FreeVA: Offline MLLM as Training-Free Video Assistant
-
Updated
Jun 9, 2024 - Python
FreeVA: Offline MLLM as Training-Free Video Assistant
[2023 IJCAI] The PyTorch implementation of the paper "Timestamp-Supervised Action Segmentation from the Perspective of Clustering".
VELOCITI Benchmark Evaluation and Visualisation Code
The code for DwTNL-Net with Pytorch
The code for PB-Net with Pytorch
[ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recognition".
The code for 3DTDS-Net with Pytorch
Official repository of ECCV 2024 paper - "HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization"
implementation of Inflated 3D ConvNet in TensorFlow
[ICCV 2023] Code implementation for "Leaping Into Memories: Space-Time Deep Feature Synthesis"
Streamlined video understanding with the help of language models
PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.
Official implementation of "An Action Is Worth Multiple Words: Handling Ambiguity in Action Recognition", BMVC 2022
Temporal Action Localization with Multi-granularity Feature Aggregation and Cross-level Boundary Modeling
【AAAI 2022】Temporal Action Proposal Generation with Background Constraint
The source code for the journal paper: Spatio-Temporal Perturbations for Video Attribution, TCSVT-2021
Code for the FitCLIP method
ACM Multimedia 2023 - Temporal Sentence in Streaming Videos
Add a description, image, and links to the video-understanding topic page so that developers can more easily learn about it.
To associate your repository with the video-understanding topic, visit your repo's landing page and select "manage topics."