APN

This is the official repoistory of the paper Progression-Guided Temporal Action Detection in Videos. Our model achieves 58% mAP@0.5 on THUMOS14 in end-to-end manner.

Build Environment

# 2080ti
conda create -n open-mmlab -y
conda activate open-mmlab
conda install pytorch torchvision -c pytorch
conda install pandas h5py scipy
pip install openmim future tensorboard future timm pytorchvideo
mim install mmengine mmaction2 mmdet

Prepare Data

Download the pre-processed THUMOS14 raw frames and the annotations (APN format), and put them under the repo root. You are suggested to put the data in other palce (SSD would be best) and set a symbolic link here pointing to the data path. The folder structure should be like:

APN
|-- configs
|-- ...
|-- my_data
|   |-- thumos14
|   |   |-- annotations
|   |   |   |-- apn
|   |   |   |   |-- apn_train.csv
|   |   |   |   |-- apn_val.csv
|   |   |   |   |-- apn_test.csv
|   |   |-- rawframes
|   |   |   |-- train
|   |   |   |   |-- v_BaseballPitch_g01_c01
|   |   |   |   |   |-- img_00000.jpg
|   |   |   |   |   |-- img_00001.jpg
|   |   |   |   |   |-- ...
|   |   |   |   |   |-- img_00106.jpg
|   |   |   |   |   |-- flow_x_00000.jpg
|   |   |   |   |   |-- flow_x_00001.jpg
|   |   |   |   |   |-- ...
|   |   |   |   |   |-- flow_x_00105.jpg
|   |   |   |   |   |-- flow_y_00000.jpg
|   |   |   |   |   |-- flow_y_00001.jpg
|   |   |   |   |   |-- ...
|   |   |   |   |   |-- flow_y_00105.jpg
|   |   |   |   |-- ...
|   |   |   |-- val
|   |   |   |   |-- video_validation_0000051
|   |   |   |   |-- ...
|   |   |   |-- test
|   |   |   |   |-- video_test_0000004
|   |   |   |   |-- ...

Optical flows (TVL1) and RGB frames are included.
Only videos with temporal annotations (20 classes) are keeped.
Some wrong annotated videos are removed.

Training

Let's take as example the implementation of APN on THUMOS14 of optical flow using I3D as backbone with resolution of (32 frames x 4 stride):

train.sh configs/localization/apn/apn_r3dsony_32x4_10e_thumos14_flow 2

*replace the 2 with the number of GPUs you want use.

Test

After the training finished, you may use the below command to test the trained checkpoint.

test.sh configs/localization/apn/apn_r3dsony_32x4_10e_thumos14_flow.py work_dirs/apn_r3dsony_32x4_10e_thumos14_flow/latest.pth 2

*replace the 2 with the number of GPUs you want use.

Acknowledgement

Our code is based on the MMAction2.

Citation

If you find our work useful, please cite:

@article{lu2023progression,
  title={Progression-Guided Temporal Action Detection in Videos},
  author={Lu, Chongkai and Mak, Man-Wai and Li, Ruimin and Chi, Zheru and Fu, Hong},
  journal={arXiv preprint arXiv:2308.09268},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 450 Commits
configs		configs
custom_modules		custom_modules
.gitignore		.gitignore
Appendix_APN_v2.pdf		Appendix_APN_v2.pdf
README.md		README.md
evaluation.py		evaluation.py
install.txt		install.txt
test.sh		test.sh
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

configs

configs

custom_modules

custom_modules

.gitignore

.gitignore

Appendix_APN_v2.pdf

Appendix_APN_v2.pdf

README.md

README.md

evaluation.py

evaluation.py

install.txt

install.txt

test.sh

test.sh

train.sh

train.sh

Repository files navigation

APN

Build Environment

Prepare Data

Training

Test

Acknowledgement

Citation

About

Releases

Packages

Languages

makecent/APN

Folders and files

Latest commit

History

Repository files navigation

APN

Build Environment

Prepare Data

Training

Test

Acknowledgement

Citation

About

Topics

Resources

Stars

Watchers

Forks

Languages