Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery

Xavier Bou, Rafael Grompone, Thibaud Ehret, Gabriele Facciolo, Jean-Michel Morel

Centre Borelli, ENS Paris-Saclay

This repository is the official implementation of the paper Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery.

🎉 Our Paper Has Been Accepted to EarthVision Workshop at CVPR24! 🌍

The goal of this paper is to perform object detection in satellite imagery with only a few examples, thus enabling users to specify any object class with minimal annotation. To this end, we explore recent methods and ideas from open-vocabulary detection for the remote sensing domain. We develop a few-shot object detector based on a traditional two-stage architecture, where the classification block is replaced by a prototype-based classifier. A large-scale pre-trained model is used to build class-reference embeddings or prototypes, which are compared to region proposal contents for label prediction. In addition, we propose to fine-tune prototypes on available training images to boost performance and learn differences between similar classes, such as aircraft types. We perform extensive evaluations on two remote sensing datasets containing challenging and rare objects. Moreover, we study the performance of both visual and image-text features, namely DINOv2 and CLIP, including two CLIP models specifically tailored for remote sensing applications. Results indicate that visual features are largely superior to vision-language models, as the latter lack the necessary domain-specific vocabulary. Lastly, the developed detector outperforms fully supervised and few-shot methods evaluated on the SIMD and DIOR datasets, despite minimal training parameters.

  conda create -n ovdsat python=3.9 -y
  conda activate ovdsat
  pip install torch==1.13.0+cu116 torchvision==0.14.0+cu116 torchaudio==0.13.0 --extra-index-url https://download.pytorch.org/whl/cu116
  python -m pip install 'git+https://github.com/facebookresearch/detectron2.git'
  pip install opencv-python albumentations transformers

Data preparation and weights

To set up the data and pre-trained weights, download the contents of the following Google Drive folder. We provide the same splits and labels we use in our article for the SIMD dataset (N = {5, 10, 30}). Add the data/ and weights/ directories into the project directory. The data path should follow the structure below for each dataset, e.g. simd, dior or your own:

data/
│
├── simd/
│   ├── train_coco_subset_N5.json
│   ├── train_coco_subset_N10.json
│   ├── train_coco_subset_N30.json
│   ├── val_coco.json
│   ├── train/
│   │   ├── image1.jpg
│   │   ├── image2.jpg
│   │   └── ...
│   └── val/
│       ├── image1.jpg
│       ├── image2.jpg
│       └── ...
│
├── dior/
│   ├── train_coco_subset_N5.json
│   ├── train_coco_subset_N10.json
│   ├── train_coco_subset_N30.json
│   └── ...
...

Weights

We pre-trained a FasterRCNN model on DOTA for the RPN using the code from DroneDetectron2. If you plan to use any of the Remote Sensing CLIP models tested in the paper, download the pre-trained weights (RemoteCLIP and GeoRSClip) and add them to the weights/ directory.

Create prototypes

To generate the class-reference and background prototypes using DINOv2 features, run the following command:

bash scripts/init_prototypes.sh

Important: Add the path to your data and the in the DATA_DIR variable in the bash files. You can adapt the used datasets, value of N as well. If you are running other data or the files/paths differ from ours, you can adapt the contents of the bash file to your own structure.

Fine-tune prototypes

Train the pre-initialised class-reference prototypes on the available data:

bash scripts/train_prototypes_bbox.sh

Evaluate

Evaluate the learned prototypes on unsen data:

bash scripts/eval_detection.sh

Citation

If you found our work useful, please cite it as follows:

@article{Bou:2024,
  title={Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery},
  author={Bou, Xavier and Facciolo, Gabriele and von Gioi, Rafael Grompone and Morel, Jean-Michel and Ehret, Thibaud},
  journal={arXiv preprint arXiv:2403.05381},
  year={2024}
}

License and Acknowledgement

This project is licensed under the GNU Affero General Public License v3.0 - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
assets		assets
configs		configs
datasets		datasets
models		models
prototypes		prototypes
scripts		scripts
utils_dir		utils_dir
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build_prototypes.py		build_prototypes.py
eval_classification.py		eval_classification.py
eval_detection.py		eval_detection.py
train.py		train.py

License

xavibou/ovdsat

Folders and files

Latest commit

History

Repository files navigation

Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery

Contents

Overview

Requirements:

Data preparation and weights

Weights

Create prototypes

Fine-tune prototypes

Evaluate

Citation

License and Acknowledgement

About

Topics

Resources

License

Stars

Watchers

Forks

Languages