Meta-Faster-R-CNN

This repo contains the official PyTorch implementation for the AAAI 2022 Oral paper: 'Meta Faster R-CNN: Towards Accurate Few-Shot Object Detection with Attentive Feature Alignment' (paper).

Highlights

Our model is a natural extension of Faster R-CNN for few-shot scenario with the prototype based metric-learning.
Our meta-learning based models achieve strong few-shot object detection performance without fine-tuning.
Our model can keep the knowledge of base classes by learning a separate Faster R-CNN detection head for base classes.

Installation

Our codebase is built upon detectron2. You only need to install detectron2 following their instructions.

Please note that we used detectron 0.2.1 in this project. Higher versions of detectron might report errors.

Data Preparation

We evaluate our model on two FSOD benchmarks PASCAL VOC and MSCOCO following the previous work TFA.
Please prepare the original PASCAL VOC and MSCOCO datasets and also the few-shot datasets following TFA in the folder ./datasets/coco and ./datasets/pascal_voc respectively.
Please run the scripts in ./datasets/coco and ./datasets/pascal_voc step by step to generate the support images for both many-shot base classes (used during meta-training) and few-shot classes (used during few-shot fine-tuning).

Model training and evaluation on MSCOCO

We have three training stages, first meta-training, then training the base-classes detection head, and finally few-shot fine-tuning.
During meta-training, we have three training steps. First, we train the baseline model following FewX. Then we add the whole feature fusion network in both Meta-RPN and Meta-Classifier, and finally add the proposed attentive feature alignment. The training script is

sh scripts/meta_training_coco_resnet101_multi_stages.sh

after meta-training, the model are directly evaluated on novel classes without fine-tuning.

We a separate Faster R-CNN detection head for base classes, using the shared feature backbone as the first step. The training script is

sh scripts/faster_rcnn_with_fpn_coco_base_classes_branch.sh

We perform 1/2/3/5/10/30-shot fine-tuning over novel classes after the three-step meta-training, using the exact same few-shot datasets as TFA. The training script is

sh scripts/few_shot_finetune_coco_resnet101.sh

Model training and evaluation on PASCAL VOC

We evaluate our model on the three splits as TFA.
Similar as MSCOCO, we have three training stages, and three training steps during meta-training.
The training scripts for VOC split1 is

sh scripts/meta_training_pascalvoc_split1_resnet101_multi_stages.sh
sh scripts/faster_rcnn_with_fpn_pascalvoc_split1_base_classes_branch.sh
sh scripts/few_shot_finetune_pascalvoc_split1_resnet101.sh

The training scripts for VOC split2 is

sh scripts/meta_training_pascalvoc_split2_resnet101_multi_stages.sh
sh scripts/faster_rcnn_with_fpn_pascalvoc_split2_base_classes_branch.sh
sh scripts/few_shot_finetune_pascalvoc_split2_resnet101.sh

The training scripts for VOC split3 is

sh scripts/meta_training_pascalvoc_split3_resnet101_multi_stages.sh
sh scripts/faster_rcnn_with_fpn_pascalvoc_split3_base_classes_branch.sh
sh scripts/few_shot_finetune_pascalvoc_split3_resnet101.sh

Model Zoo

We provided the meta-trained models over base classes for both MSCOCO dataset and the 3 splits on VOC dataset. The model links are Google Drive and Tencent Weiyun.

Citing Meta-Faster-R-CNN

If you use this work in your research or wish to refer to the baseline results published here, please use the following BibTeX entries:

@inproceedings{han2022meta,
  title={Meta faster r-cnn: Towards accurate few-shot object detection with attentive feature alignment},
  author={Han, Guangxing and Huang, Shiyuan and Ma, Jiawei and He, Yicheng and Chang, Shih-Fu},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={36},
  number={1},
  pages={780--789},
  year={2022}
}
@inproceedings{fan2020few,
  title={Few-shot object detection with attention-RPN and multi-relation detector},
  author={Fan, Qi and Zhuo, Wei and Tang, Chi-Keung and Tai, Yu-Wing},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={4013--4022},
  year={2020}
}
@inproceedings{wang2020frustratingly,
  title={Frustratingly simple few-shot object detection},
  author={Wang, Xin and Huang, Thomas E and Darrell, Trevor and Gonzalez, Joseph E and Yu, Fisher},
  booktitle={Proceedings of the 37th International Conference on Machine Learning},
  pages={9919--9928},
  year={2020}
}

Acknowledgement

This repo is developed based on FewX, TFA and detectron2. Thanks for their wonderful codebases.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
assets		assets
configs/fsod		configs/fsod
datasets		datasets
meta_faster_rcnn		meta_faster_rcnn
scripts		scripts
README.md		README.md
faster_rcnn_train_net.py		faster_rcnn_train_net.py
fsod_train_net.py		fsod_train_net.py
update_params_resnet101_fpn.py		update_params_resnet101_fpn.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Meta-Faster-R-CNN

Highlights

Installation

Data Preparation

Model training and evaluation on MSCOCO

Model training and evaluation on PASCAL VOC

Model Zoo

Citing Meta-Faster-R-CNN

Acknowledgement

About

Releases

Packages

Languages

GuangxingHan/Meta-Faster-R-CNN

Folders and files

Latest commit

History

Repository files navigation

Meta-Faster-R-CNN

Highlights

Installation

Data Preparation

Model training and evaluation on MSCOCO

Model training and evaluation on PASCAL VOC

Model Zoo

Citing Meta-Faster-R-CNN

Acknowledgement

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages