-
Johns Hopkins University
- Baltimore, MD
-
07:48
- 4h behind
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"
This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)
Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)
Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"
An official pytorch implementation of the paper: [MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval].
Source code of our CVPR2024 paper TeachCLIP for Text-to-Video Retrieval
Video Summarization With Spatiotemporal Vision Transformer
A lightweight library to support the development of applications using LLMs
Code release for ActionFormer (ECCV 2022)
Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)
Official repo for BMVC2021 paper ASFormer: Transformer for action segmentation
End to End Streaming Video Temporal Segmentation
Official PyTorch code of GroundVQA (CVPR'24)
Awesome papers & datasets specifically focused on long-term videos.
EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties
Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation
[CVPR 2022] Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection
[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale
Official implementation for "A Simple LLM Framework for Long-Range Video Question-Answering"
Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"
[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.