video-understanding

Star

Here are 183 public repositories matching this topic...

crim-ca / FrVD

Star

FrVD: French Video Description dataset

annotations dataset action-recognition video-understanding video-description

Updated Jun 22, 2023

unitaryai / VTC-dataset

Star

dataset video-understanding video-text-retrieval vision-language-pretraining vision-language-dataset

Updated May 1, 2024
Python

dukaenea / unintentional_actions

Star

Leveraging Self-Supervised Training for Unintentional Action Recognition (ECCVW 2022)

computer-vision deep-learning video-understanding

Updated Apr 24, 2023
Python

fpv-iplab / Quasi-Online-Detection-Take-Release

Star

Code for the Paper: Quasi-Online Detection of Take and Release Actions from Egocentric Videos. International Conference on Image Analysis and Processing 2023.

video-understanding action-detection

Updated May 28, 2024
Python

chajchaj / models

Star

Pre-trained and Reproduced Deep Learning Models （『飞桨』官方模型库，包含多种学术前沿和工业场景验证的深度学习模型）

video-understanding video-classification action-classification

Updated Sep 1, 2020
Python

crim-ca / FrVD-visualization-tool

Star

Tool employed to visualize synchronized FrVD metadata and videos simultaneously.

visualization annotations dataset action-recognition video-understanding video-description

Updated Apr 1, 2024
Python

InvincibleWyq / VBA

Star

Undergraduate Thesis @ Department of Automation, Tsinghua -- Understanding Few-shot Video with Pretrained Image-Text Models

transfer-learning video-understanding image-text-pretraining

Updated Dec 18, 2023
Python

SCUT-BIP-Lab / 3DTDS-Net

Star

The code for 3DTDS-Net with Pytorch

biometrics human-computer-interaction video-understanding hand-gesture-authentication

Updated Mar 21, 2022
Python

engindeniz / DialogSummary-VideoQA

Star

[ICCV 2021] On the hidden treasure of dialog in video question answering

language-models video-understanding vision-language video-question-answering knowledge-base-videoqa

Updated Mar 30, 2022
Python

XFeiF / ComputerVision_PaperNotes

Star

📚 Paper Notes (Computer vision)

computer-vision notes paper cv representation-learning cvpr action-recognition iccv video-understanding eccv video-representation-learning self-supervised-learning video-representation video-retrieval tpami video-papernotes

Updated Mar 23, 2021

carriex / C3D

Star

Video understanding with C3D

video-understanding c3d pytorch-implementation

Updated Jun 10, 2020
Python

SCUT-BIP-Lab / FSTA-Net

Star

The code for FSTA-Net with Pytorch

biometrics human-computer-interaction video-understanding biometric-authentication hand-gesture-authentication behavioral-characteristic-analysis

Updated May 23, 2023
Python

mx-mark / SPMNet

Star

Source code for "Visually aligned sound generation via sound-producing motion parsing" (Published at Neurocomputing)

synchronization video-understanding audioset vas cross-modality visual-audio audio-generation visual-to-sound

Updated Apr 12, 2022

ZJCV / Non-local

Star

[CVPR 2018] Non-local Neural Networks

pytorch action-recognition video-understanding video-recognition non-local i3d c2d resnet3d

Updated Dec 15, 2020
Python

Fsoft-AIC / UGLF

Star

[IJCNN 2024] Unifying Global and Local Scene Entities Modelling for Precise Action Spotting

video-processing video-understanding vision-language-model action-spotting

Updated May 4, 2024
Python

SCUT-BIP-Lab / PB-Net

Star

The code for PB-Net with Pytorch

biometrics human-computer-interaction video-understanding biometrics-authentication hand-gesture-authentication behavioral-characteristic-analysis

Updated Feb 27, 2023
Python

We use visual data alone to learn a control policy for a robotic arm by observing expert video demonstrations. We implement and test several models, accomplishing an 85% success rate for a pick-and-place task.

machine-learning video computer-vision deep-learning robotics video-understanding visuomotor-control