3D ResNets for Action Recognition (CVPR 2018)
-
Updated
Jan 20, 2021 - Python
3D ResNets for Action Recognition (CVPR 2018)
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video tagging and sport action detection.
This is an official implementation for "Video Swin Transformers".
Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain
Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
Detects license plate of car and recognizes its characters
AutoVideo: An Automated Video Action Recognition System
PyTorch implementation of Non-Local Neural Networks (https://arxiv.org/pdf/1711.07971.pdf)
【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective
GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
YAPO e+ - Yet Another Porn Organizer (extended)
CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning
WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support visual intelligence development!
Video Recognition using Mixed Convolutional Tube (MiCT) on PyTorch with a ResNet backbone
Frame Flexible Network (CVPR2023)
My experimentation around action recognition in videos. Contains Keras implementation for C3D network based on original paper "Learning Spatiotemporal Features with 3D Convolutional Networks", Tran et al. and it includes video processing pipelines coded using mPyPl package. Model is being benchmarked on popular UCF101 dataset and achieves result…
State of the art object detection in real-time using YOLOV3 algorithm. Augmented with a process that allows easy training of the classifier as a plug & play solution . Provides alert if an item in an alert list is detected.
[WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https://arxiv.org/abs/2106.10137.
Add a description, image, and links to the video-recognition topic page so that developers can more easily learn about it.
To associate your repository with the video-recognition topic, visit your repo's landing page and select "manage topics."