This repository contains a compilation of code implementations for numerous works in Activity Recognition.
- Video attributions Code
- SlowFast by Facebook Research Link
- MMAction Link
- MMAction2 Link
- ClassyVision Link
- Facebook Video Modelling Zoo Link
- PyVideoResearch Link
- GluonCV Link
- M-PACT Link
- PyTorch Video Recognition Link
- Convolutional Two-Stream Network Fusion for Video Action Recognition Code | Paper
- Temporal Segment Networks: Towards Good Practices for Deep Action Recognition Code | Paper
- ActionVLAD: Learning spatio-temporal aggregation for action classification Code | Paper
- Hidden Two-Stream Convolutional Networks for Action Recognition Code | Paper
- Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset Code | Code | Code | Paper
LSTM Methods:
- Long-term Recurrent Convolutional Networks for Visual Recognition and Description Code | Paper
- What Would You Expect? Anticipating Egocentric Actions With Rolling-Unrolling LSTMs and Modality Attention Code | Paper
- TS-LSTM and Temporal-Inception: Exploiting Spatiotemporal Dynamics for Activity Recognition Code | Paper
CNN Methods:
- Temporal 3D ConvNets: New Architecture and Transfer Learning for Video Classification Code | Paper
- TSM: Temporal Shift Module for Efficient Video Understanding Code | Paper
- Temporal Relational Reasoning in Videos Code | Paper
- Non-Local Neural Networks Code | Paper
- Video Classification with Channel-Separated Convolutional Networks Unofficial Code | Paper
- Gate-Shift Networks for Video Action Recognition Code | Paper
- V4D: 4D Convolutional Neural Networks for Video-level Representation Learning Code | Paper
- Temporal Interlacing Network Code | Paper
Attention-based Methods:
- Action Recognition using Visual Attention Code | Code | Paper
- Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification Unofficial Code | Paper
- Video Modeling with Correlation Networks Code | Paper
- Attentional Pooling for Action Recognition Code | Paper
- Temporal Convolutional Networks: A Unified Approach to Action Segmentation and Detection Code | Paper
- Long-term Temporal Convolutions Code | Paper
- Dynamic Image Networks for Action Recognition Code | Paper
- Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks Code | Paper
- Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet? Code | Paper
- Long-Term Feature Banks for Detailed Video Understanding Code | Paper
- Learning Correspondence from the Cycle-consistency of Time Code | Paper
- Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition Code | Paper
- Learning Actor Relation Graphs for Group Activity Recognition Code | Paper
- Asynchronous Temporal Fields for Action Recognition Code | Paper
- TEA: Temporal Excitation and Aggregation for Action Recognition Code | Paper
- MotionSqueeze: Neural Motion Feature Learning for Video Understanding Code | Paper
- ECO: Efficient Convolutional Network for Online Video Understanding Code | Paper
- End-to-End Learning of Motion Representation for Video Understanding Code | Paper
- AR-Net: Adaptive Frame Resolution for Efficient Action Recognition Code | Paper
- Learn to cycle: Time-consistent feature discovery for action recognition Code | Paper
- VideoGraph: Recognizing Minutes-Long Human Activities in Videos Code | Paper
- Timeception for Complex Action Recognition Code | Paper
- An Evaluation of Action Recognition Models on EPIC-Kitchens Code | Paper
- STEP: Spatio-Temporal Progressive Learning for Video Action Detection Code | Paper
- Appearance-and-Relation Networks for Video Classification Code | Paper
- End-to-end Video-level Representation Learning for Action Recognition Code | Paper
- Action Recognition with Trajectory-Pooled Deep-Convolutional Descriptors Code | Paper
- Real-time Action Recognition with Enhanced Motion Vector CNNs Code | Paper
- Temporal-Relational CrossTransformers for Few-Shot Action Recognition Code | Paper