Skip to content

Simple pipeline to get started with video classification and action recognition

License

Notifications You must be signed in to change notification settings

sithu31296/video-classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Video Classification

Supported Models & Datasets

Action Recognition

Action Detection

Datasets

Pre-trained Models & Benchmarks

Refer to MODELS.md for supported pre-trained models and benchmarks of SOTA models.

Installation

  • python >= 3.6
  • torch >= 1.9.0
  • torchvision >= 0.10.0

Then, clone and install this repo with:

$ git clone https://github.com/sithu31296/video-classification
$ cd video-classification
$ pip install -e .

Inference

Use the following script to test the pre-trained model:

$ python tools/infer.py \
    --source VIDEO_FILE_NAME
    --model MODEL_NAME
    --model_path PRETRAINED_MODEL_PATH
    --num_classes DATASET_NUM_CLASSES

You will get the top-5 score similar to this:

Class            Score (%)
-------------  -----------
archery              91.49
throwing axe          0.11
slacklining           0.06
feeding fish          0.06
rock climbing         0.05

Training & Evaluation

Dataset Preparation

Follow the steps provided in DATASETS.md.

Configuration

Coming Soon

Training

Coming Soon

Evaluation

Coming Soon

References

Citations

@misc{li2022uniformer,
      title={UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning}, 
      author={Kunchang Li and Yali Wang and Peng Gao and Guanglu Song and Yu Liu and Hongsheng Li and Yu Qiao},
      year={2022},
      eprint={2201.04676},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

About

Simple pipeline to get started with video classification and action recognition

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages