Pinned Loading
Repositories
Showing 10 of 50 repositories
- VideoRefer Public
[CVPR 2025] The code for "VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM"
- FineReason Public
FineReason: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving
- VideoLLaMA2 Public
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
- multimodal_textbook Public
The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"