Usage Guide

Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation

Usage Guide

Prerequisites

The training and testing in PRSA-Net is reimplemented in PyTorch for the ease of use.

PyTorch 1.8.1

Other minor Python modules can be installed by running

pip install -r requirements.txt

Code and Data Preparation

Download Datasets

We support experimenting with two publicly available datasets for temporal action detection: THUMOS14 & ActivityNet v1.3. Here are some steps to download these two datasets.

THUMOS14: THUMOS14 challenge website.
ActivityNet v1.3: Using the official ActivityNet downloader to download videos from the YouTube. And the dataset is provided in the form of YouTube URL list.

Download Features

You can get the TSN features for training and testing from G-TAD GoogleDrive. I3D features will be provided later.

Training

Install Align1D layers

cd aligner/
python setup.py install

Set the path of features in config/cfg.yaml

feature_path: $PATH_OF_FEATURES
video_info_path: $PATH_OF_ANNOTATIONS

Then, you can use the following commands to train PRSA-Net

python main.py --mode train --cfg $PATH_TO_CONFIG_FILE

Testing Trained Models

You can evaluate the model's action proposal generation performance and action detection performance at the same time by running the following command

python main.py --mode infer --cfg $PATH_TO_CONFIG_FILE

We use the weight of the 4-th epoch by default for model evaluation during the experiment. If you want, you can modify the eval_model field in the config file. This script will report the proposal generation performance in terms of AR(average recall) under various number of proposals, and detection performance in terms of (mean average precision) at different IoU thresholds..

proposal generation performance on THUMOS14

AR@100	RGB+Flow
PRSA-Net (I3D)	56.12

detection performance on THUMOS14

mAP@0.5IoU (%)	RGB+Flow
PRSA-Net (I3D + two-stream)	55.0
PRSA-Net (I3D + PGCN)	58.7

Reference

My implementations borrow ideas from previous works.

BMN: BMN: Boundary-Matching Network for Temporal Action Proposal Generation.

G-TAD: Sub-Graph Localization for Temporal Action Detection

Contact

lishuaicheng@sensetime.com

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.idea		.idea
Evaluation		Evaluation
aligner		aligner
checkpoint		checkpoint
config		config
data		data
model		model
.DS_Store		.DS_Store
dataset.py		dataset.py
engine.py		engine.py
inference.py		inference.py
main.py		main.py
readme.md		readme.md
requirements.txt		requirements.txt
train.py		train.py
train.sh		train.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation

Usage Guide

Prerequisites

Code and Data Preparation

Download Datasets

Download Features

Training

Testing Trained Models

proposal generation performance on THUMOS14

detection performance on THUMOS14

Reference

Contact

About

Releases

Packages

Languages

handhand123/PRSA-Net

Folders and files

Latest commit

History

Repository files navigation

Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation

Usage Guide

Prerequisites

Code and Data Preparation

Download Datasets

Download Features

Training

Testing Trained Models

proposal generation performance on THUMOS14

detection performance on THUMOS14

Reference

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages