Skip to content
View reddyav1's full-sized avatar
  • Johns Hopkins University
  • Baltimore, MD
  • 07:48 - 4h behind

Highlights

  • Pro

Block or report reddyav1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"

Python 162 16 Updated Feb 23, 2025

This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)

Python 26 6 Updated Jun 28, 2024

Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)

Python 63 1 Updated Jun 7, 2024

Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"

Python 99 3 Updated Jan 28, 2024
Python 29 2 Updated Aug 14, 2023

An official pytorch implementation of the paper: [MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval].

Python 13 2 Updated Jul 27, 2024

Source code of our CVPR2024 paper TeachCLIP for Text-to-Video Retrieval

Python 28 1 Updated Mar 3, 2025

Video Summarization With Spatiotemporal Vision Transformer

Python 21 7 Updated Jul 5, 2023
Python 50 2 Updated Jun 4, 2024

A lightweight library to support the development of applications using LLMs

Python 5 Updated Apr 25, 2024
Python 16 Updated Jul 26, 2023

Code release for ActionFormer (ECCV 2022)

Python 474 81 Updated Apr 11, 2024

Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)

Python 40 Updated Apr 15, 2024

Official repo for BMVC2021 paper ASFormer: Transformer for action segmentation

Python 99 18 Updated Feb 19, 2022

End to End Streaming Video Temporal Segmentation

Python 25 6 Updated Mar 10, 2025

Official PyTorch code of GroundVQA (CVPR'24)

Python 58 2 Updated Sep 13, 2024
Python 126 20 Updated Jan 3, 2024

Awesome papers & datasets specifically focused on long-term videos.

263 12 Updated Nov 15, 2024

EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties

Python 122 9 Updated Nov 10, 2024

Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation

476 36 Updated Mar 28, 2025

[CVPR 2022] Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection

Python 50 3 Updated Mar 6, 2023
Python 6 2 Updated Feb 8, 2024

[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale

Jupyter Notebook 189 21 Updated Nov 13, 2023

Official implementation for "A Simple LLM Framework for Long-Range Video Question-Answering"

Python 93 4 Updated Oct 27, 2024

Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"

Python 23 Updated Aug 28, 2023

[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Python 353 32 Updated Nov 19, 2024
Python 86 2 Updated Dec 30, 2024

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python 3,201 259 Updated Jan 18, 2025
Next
Showing results