Skip to content
View orrzohar's full-sized avatar
Video
Video

Highlights

  • Pro

Block or report orrzohar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
orrzohar/README.md

I'm Orr Zohar 👋

My research focuses on Large Multimodal Models, with the hope of pushing these models to be capable more capable of understanding images and videos.

  • smolvlm2 SmolVLM2: The tiniest video-LMM ever!
  • Astronaut Helmet Apollo: Exploring video understanding in LMMs
  • 💫 Video-STAR: Introduced a method that allows the utilization of any labeled video dataset for instruction tuning.
  • 🤖 VideoAgent: A novel agent-based system that utilizes a large language model to iteratively identify and compile crucial information from long-form videos

Pinned Loading

  1. Video-STaR Video-STaR Public

    [ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision

    Python 59 4

  2. PROB PROB Public

    [CVPR 2023] Official Pytorch code for PROB: Probabilistic Objectness for Open World Object Detection

    Python 120 16

  3. FOMO FOMO Public

    Official Pytorch code for Open World Object Detection in the Era of Foundation Models

    Python 75 4

  4. huggingface/transformers huggingface/transformers Public

    🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

    Python 141k 28.2k

  5. LOVM LOVM Public

    [NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selection

    Python 20