Context-aware RCNNs: a Baseline for Action Detection in Videos

Source code for the following paper(arXiv link):

Context-aware RCNNs: a Baseline for Action Detection in Videos
Jianchao Wu, Zhanghui Kuang, Limin Wang, Wayne Zhang, Gangshan Wu
in ECCV 2020

Our implementation is based on Video-long-term-feature-banks.

Prepare dataset

Please follow LFB on how to prepare AVA dataset.

Prepare environment

Please follow LFB on how to prepare Caffe2 environment.

Download pre-trained weights

Please download R50-I3D-NL, and put it in [code root]/pretrained_weights folder.

Train a baseline model without scene feature and long-term feature

Run:

bash train_baseline.sh configs/avabox_r50_baseline_32x2_scale1_5.yaml

Train a model with scene feature

Run:

bash train_baseline.sh configs/avabox_r50_baseline_16x4_scale1_5_withScene.yaml

Train a model with scene feature and long-term feature

Stage1. Train a baseline model that will be used to infer LFB:

bash train_baseline.sh configs/avabox_r50_baseline_16x4_scale1_5.yaml

Stage2. Train a model with scene feature and LFB:

bash train_lfb.sh configs/avabox_r50_lfb_win60_L3_16x4_withScene.yaml [path to baseline model weight from step1]

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
caffe2_customized_ops/video		caffe2_customized_ops/video
configs		configs
dataset_tools		dataset_tools
lib		lib
tools		tools
README.md		README.md
test_baseline.sh		test_baseline.sh
train_baseline.sh		train_baseline.sh
train_lfb.sh		train_lfb.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Context-aware RCNNs: a Baseline for Action Detection in Videos

Prepare dataset

Prepare environment

Download pre-trained weights

Train a baseline model without scene feature and long-term feature

Train a model with scene feature

Train a model with scene feature and long-term feature

About

Releases

Packages

Languages

MCG-NJU/CRCNN-Action

Folders and files

Latest commit

History

Repository files navigation

Context-aware RCNNs: a Baseline for Action Detection in Videos

Prepare dataset

Prepare environment

Download pre-trained weights

Train a baseline model without scene feature and long-term feature

Train a model with scene feature

Train a model with scene feature and long-term feature

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages