video-understanding

Here are 161 public repositories matching this topic...

whwu95 / FreeVA

FreeVA: Offline MLLM as Training-Free Video Assistant

chatbot video-understanding zero-shot-video-captioning video-question-answering chatgpt vision-language-model llava training-free multimodal-large-language-models

Updated Jun 9, 2024
Python

ddz16 / TSASPC

Star

[2023 IJCAI] The PyTorch implementation of the paper "Timestamp-Supervised Action Segmentation from the Perspective of Clustering".

deep-learning pytorch video-processing clustering-algorithm video-understanding tcn video-action-segmentation

Updated Apr 24, 2023
Python

katha-ai / VELOCITI

Star

VELOCITI Benchmark Evaluation and Visualisation Code

benchmarking benchmark video artificial-intelligence dataset awesome-list clip evaluation-metrics video-understanding vlm semantic-role-labeling llm chain-of-thought vision-language-model llm-inference llama3

Updated Sep 2, 2024
Python

SCUT-BIP-Lab / DwTNL-Net

Star

The code for DwTNL-Net with Pytorch

biometrics human-computer-interaction attention-mechanism video-understanding hand-gesture-authentication

Updated Feb 27, 2023
Python

SCUT-BIP-Lab / PB-Net

Star

The code for PB-Net with Pytorch

biometrics human-computer-interaction video-understanding biometrics-authentication hand-gesture-authentication behavioral-characteristic-analysis

Updated Feb 27, 2023
Python

UCSC-VLAA / Image-Pretraining-for-Video

Star

[ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recognition".

image-classification action-recognition video-understanding 3d-convolutional-network eccv2022

Updated Dec 22, 2022
Python

unitaryai / VTC-dataset

Star

dataset video-understanding video-text-retrieval vision-language-pretraining vision-language-dataset

Updated May 1, 2024
Python

SCUT-BIP-Lab / 3DTDS-Net

Star

The code for 3DTDS-Net with Pytorch

biometrics human-computer-interaction video-understanding hand-gesture-authentication

Updated Mar 21, 2022
Python

sakibreza / ECCV24-HAT

Star

Official repository of ECCV 2024 paper - "HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization"

computer-vision transformers video-understanding action-localization egocentric-vision eccv2024

Updated Aug 23, 2024
Python

wanjinchang / I3D_Master

Star

implementation of Inflated 3D ConvNet in TensorFlow

tensorflow action-recognition video-understanding inceptionv2 inflated-network

Updated Apr 16, 2018
Python

alexandrosstergiou / Leaping-Into-Memories

Star

[ICCV 2023] Code implementation for "Leaping Into Memories: Space-Time Deep Feature Synthesis"

video-understanding interpretability feature-visualization feature-synthesis spatiotemporal-features

Updated Jul 25, 2023
Python

davidhaas6 / digest

Star

Streamlined video understanding with the help of language models

youtube transcription language-model video-understanding transcript-generator transcript-analysis llm

Updated Sep 13, 2024
Python

sarthak268 / nesca-pytorch

Sponsor

Star

PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.

computer-vision pytorch attention human-robot-interaction video-understanding human-robot-collaboration action-anticipation short-context robot-and-automation

Updated Aug 9, 2024
Python

kiyoon / verb_ambiguity

Sponsor

Star

Official implementation of "An Action Is Worth Multiple Words: Handling Ambiguity in Action Recognition", BMVC 2022

video activity-recognition action-recognition video-understanding multi-label-learning label-noise label-ambiguity single-positive-multi-label-learning multi-label-action-recognition-dataset

Updated Dec 16, 2022
Python

MGCBM / TAL-MGCBM

Star

Temporal Action Localization with Multi-granularity Feature Aggregation and Cross-level Boundary Modeling

video-understanding temporal-action-localization

Updated Nov 5, 2023
Python

happy-hsy / BCNet

Star

【AAAI 2022】Temporal Action Proposal Generation with Background Constraint

video-understanding temporal-action-proposals temporal-action-detection temporal-action-localization vision-transformers

Updated May 13, 2022
Python

shinkyo0513 / Spatio-Temporal-Perturbations-for-Video-Attribution

Star

The source code for the journal paper: Spatio-Temporal Perturbations for Video Attribution, TCSVT-2021

action-recognition video-understanding interpretable-deep-learning attribution-methods weakly-supervised-action-detection

Updated Oct 3, 2023
Python

bryant1410 / fitclip

Star

Code for the FitCLIP method

computer-vision pytorch video-understanding zero-shot video-classification zero-shot-learning text-to-video-retrieval

Updated Jan 27, 2023
Python

SCZwangxiao / TSGVs-MM2023

Star

ACM Multimedia 2023 - Temporal Sentence in Streaming Videos

streaming-video video-understanding vision-and-language temporal-action-localization video-moment-retrieval temporal-sentence-grounding

Updated Mar 17, 2024
Python

escorciav / moments-retrieval

Star

computer-vision python3 pytorch artificial-intelligence video-understanding

Updated Mar 2, 2023
Python

Improve this page

Add a description, image, and links to the video-understanding topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the video-understanding topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

video-understanding

Here are 161 public repositories matching this topic...

whwu95 / FreeVA

ddz16 / TSASPC

katha-ai / VELOCITI

SCUT-BIP-Lab / DwTNL-Net

SCUT-BIP-Lab / PB-Net

UCSC-VLAA / Image-Pretraining-for-Video

unitaryai / VTC-dataset

SCUT-BIP-Lab / 3DTDS-Net

sakibreza / ECCV24-HAT

wanjinchang / I3D_Master

alexandrosstergiou / Leaping-Into-Memories

davidhaas6 / digest

sarthak268 / nesca-pytorch

kiyoon / verb_ambiguity

MGCBM / TAL-MGCBM

happy-hsy / BCNet

shinkyo0513 / Spatio-Temporal-Perturbations-for-Video-Attribution

bryant1410 / fitclip

SCZwangxiao / TSGVs-MM2023

escorciav / moments-retrieval

Improve this page

Add this topic to your repo