Actor-Action Detection

UPDATE: Please update this repository (see README.md in the parent folder) and follow the new installation guide to compile the code again.

Please remember to activate virtual environment by conda activate pytorch_0_4_1

Installation

Please run the following command before running the code

cd mask_rcnn
bash make.sh
cd model/roi_align
bash make.sh
cd ../../../../A2D/Annotations
wget https://www.cs.rochester.edu/u/zli82/files/gt_val_det.pkl
cd ../../csc_249_final_proj_a2d_det

Usage

Training

This shell script, scripts/train.sh, will call train.py with some command line arguments (e.g., learning rate, training epochs, etc.). You can change the arguments in the scripts/train.sh. For more details of arguments, please run python train.py --help.

To start training, please run:
```
bash scripts/train.sh
```

Evaluation

Step 1: generating detection result

In scripts/gen_det_val.sh, please replace --load_ckpt with the checkpoint model (.pth file) to be evaluate, which should locate at --output_dir given in scripts/train.sh. Besides, you may also change the file path given at --det_result_pkl .

bash scripts/gen_det_val.sh

Step 2: evaluating the detection result

python eval/baseline_pascal_voc_map.py \
	--gt_cls_pkl ../A2D/Annotations/gt_val_det.pkl \
	--det_cls_pkl path_to_the_detection_result \ # generated by step 1 (--det_result_pkl in scripts/gen_det.sh)
	--mode actor_action # You can use 'actor' or 'action' here to see the mAP when only considering actor labels or action labels

Generate Result on Testing set

It's the similar with step 1 in evaluation. Please run
```
bash scripts/gen_det_test.sh
```
Please make sure to change the --load_ckpt with the new checkpoint model.

Please rename your pickle file to det.pkl in the submission.

Data Processing

We provide the the code of data loading part. Now it supports loading neighboring frames and forming a segment, which may be helpful to action recognition. Also, optical flow corresponding to the loaded segment will also be computed by Gunnar-Farneback optical flow esimation algorithm.

Model

The baseline model we choose is Faster-RCNN with Feature Pyramid Network, whose backbone network is ResNet-50. You can switch to other models by using other configuration files in model_cfgs/ and replace the flag --cfg in scripts/train.sh. Of course, you can use any other models as long as it achieves better performance.

COCO-pretrained model

If you want to use the MS-COCO pretrained model., please download them from https://github.com/facebookresearch/Detectron/blob/master/MODEL_ZOO.md. Then, add the --load_detectron flag in scripts/train.sh with the path to the downloaded pickle file.

Acknowledgement

Thanks to @roytseng-tw for https://github.com/roytseng-tw/Detectron.pytorch

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Actor-Action Detection

Installation

Usage

Data Processing

Model

Acknowledgement

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
dataset		dataset
eval		eval
mask_rcnn		mask_rcnn
model		model
model_cfgs		model_cfgs
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
train.py		train.py

License

shiningstark/csc_249_final_proj_a2d_det

Folders and files

Latest commit

History

Repository files navigation

Actor-Action Detection

Installation

Usage

Data Processing

Model

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages