FrVD: French Video Description dataset
-
Updated
Jun 22, 2023
FrVD: French Video Description dataset
Leveraging Self-Supervised Training for Unintentional Action Recognition (ECCVW 2022)
Code for the Paper: Quasi-Online Detection of Take and Release Actions from Egocentric Videos. International Conference on Image Analysis and Processing 2023.
Pre-trained and Reproduced Deep Learning Models (『飞桨』官方模型库,包含多种学术前沿和工业场景验证的深度学习模型)
Tool employed to visualize synchronized FrVD metadata and videos simultaneously.
Undergraduate Thesis @ Department of Automation, Tsinghua -- Understanding Few-shot Video with Pretrained Image-Text Models
The code for 3DTDS-Net with Pytorch
[ICCV 2021] On the hidden treasure of dialog in video question answering
📚 Paper Notes (Computer vision)
Video understanding with C3D
The code for FSTA-Net with Pytorch
Source code for "Visually aligned sound generation via sound-producing motion parsing" (Published at Neurocomputing)
[CVPR 2018] Non-local Neural Networks
[IJCNN 2024] Unifying Global and Local Scene Entities Modelling for Precise Action Spotting
The code for PB-Net with Pytorch
We use visual data alone to learn a control policy for a robotic arm by observing expert video demonstrations. We implement and test several models, accomplishing an 85% success rate for a pick-and-place task.
Official implementation of "Capturing Temporal Information in a Single Frame: Channel Sampling Strategies for Action Recognition", BMVC 2022
Temporal Action Localization with Multi-granularity Feature Aggregation and Cross-level Boundary Modeling
The code for L3AM loss with Pytorch
Add a description, image, and links to the video-understanding topic page so that developers can more easily learn about it.
To associate your repository with the video-understanding topic, visit your repo's landing page and select "manage topics."