Activity-Recognition and Video-Understanding

This repository contains a compilation of code implementations for numerous works in Activity Recognition.

Below is a general purpose template for Activity Recognition:

Visual Attributions:

Video attributions Code

General code bases:

SlowFast by Facebook Research Link

MMAction Link
MMAction2 Link
ClassyVision Link
Facebook Video Modelling Zoo Link
PyVideoResearch Link
GluonCV Link
M-PACT Link
PyTorch Video Recognition Link

Multi-stream Methods:

Convolutional Two-Stream Network Fusion for Video Action Recognition Code | Paper
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition Code | Paper
ActionVLAD: Learning spatio-temporal aggregation for action classification Code | Paper
Hidden Two-Stream Convolutional Networks for Action Recognition Code | Paper
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset Code | Code | Code | Paper

Single-Stream Methods:

LSTM Methods:

Long-term Recurrent Convolutional Networks for Visual Recognition and Description Code | Paper
What Would You Expect? Anticipating Egocentric Actions With Rolling-Unrolling LSTMs and Modality Attention Code | Paper
TS-LSTM and Temporal-Inception: Exploiting Spatiotemporal Dynamics for Activity Recognition Code | Paper

CNN Methods:

Temporal 3D ConvNets: New Architecture and Transfer Learning for Video Classification Code | Paper
TSM: Temporal Shift Module for Efficient Video Understanding Code | Paper
Temporal Relational Reasoning in Videos Code | Paper
Non-Local Neural Networks Code | Paper
Video Classification with Channel-Separated Convolutional Networks Unofficial Code | Paper
Gate-Shift Networks for Video Action Recognition Code | Paper
V4D: 4D Convolutional Neural Networks for Video-level Representation Learning Code | Paper
Temporal Interlacing Network Code | Paper

Attention-based Methods:

Action Recognition using Visual Attention Code | Code | Paper
Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification Unofficial Code | Paper
Video Modeling with Correlation Networks Code | Paper
Attentional Pooling for Action Recognition Code | Paper

Miscellenous:

Temporal Convolutional Networks: A Unified Approach to Action Segmentation and Detection Code | Paper
Long-term Temporal Convolutions Code | Paper
Dynamic Image Networks for Action Recognition Code | Paper
Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks Code | Paper
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet? Code | Paper
Long-Term Feature Banks for Detailed Video Understanding Code | Paper
Learning Correspondence from the Cycle-consistency of Time Code | Paper
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition Code | Paper
Learning Actor Relation Graphs for Group Activity Recognition Code | Paper
Asynchronous Temporal Fields for Action Recognition Code | Paper
TEA: Temporal Excitation and Aggregation for Action Recognition Code | Paper
MotionSqueeze: Neural Motion Feature Learning for Video Understanding Code | Paper
ECO: Efficient Convolutional Network for Online Video Understanding Code | Paper
End-to-End Learning of Motion Representation for Video Understanding Code | Paper
AR-Net: Adaptive Frame Resolution for Efficient Action Recognition Code | Paper
Learn to cycle: Time-consistent feature discovery for action recognition Code | Paper
VideoGraph: Recognizing Minutes-Long Human Activities in Videos Code | Paper
Timeception for Complex Action Recognition Code | Paper
An Evaluation of Action Recognition Models on EPIC-Kitchens Code | Paper
STEP: Spatio-Temporal Progressive Learning for Video Action Detection Code | Paper
Appearance-and-Relation Networks for Video Classification Code | Paper
End-to-end Video-level Representation Learning for Action Recognition Code | Paper
Action Recognition with Trajectory-Pooled Deep-Convolutional Descriptors Code | Paper
Real-time Action Recognition with Enhanced Motion Vector CNNs Code | Paper
Temporal-Relational CrossTransformers for Few-Shot Action Recognition Code | Paper

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Activity-Recognition and Video-Understanding

Below is a general purpose template for Activity Recognition:

Visual Attributions:

General code bases:

Multi-stream Methods:

Single-Stream Methods:

Miscellenous:

About

Uh oh!

Releases

Packages

License

SHI-Labs/Activity-Recognition

Folders and files

Latest commit

History

Repository files navigation

Activity-Recognition and Video-Understanding

Below is a general purpose template for Activity Recognition:

Visual Attributions:

General code bases:

Multi-stream Methods:

Single-Stream Methods:

Miscellenous:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages