DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment

By Jianhong Han, Liang Chen and Yupei Wang.

This repository contains the implementation accompanying our paper DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment.

If you find it helpful for your research, please consider citing:

@article{han2024datr,
  title={DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment},
  author={Han, Jianhong and Chen, Liang and Wang, Yupei},
  journal={arXiv preprint arXiv:2405.11765},
  year={2024}
}

Acknowledgment

This implementation is bulit upon DINO and RemoteSensingTeacher.

Installation

Please refer to the instructions here. We leave our system information for reference.

OS: Ubuntu 16.04
Python: 3.10.9
CUDA: 11.8
PyTorch: 2.0.1 (The lower versions of Torch can cause some bugs.)
torchvision: 0.15.2

Dataset Preparation

Please construct the datasets following these steps:

Download the datasets from their sources.
Convert the annotation files into COCO-format annotations.
Modify the dataset path setting within the script DAcoco.py

  #---源域
    PATHS_Source = {
        "train": ("",  #train image dir
                  ""), #train coco format json file
        "val": ("",    #val image dir
                ""),   #val coco format json file
    }
    #----目标域
    PATHS_Target = {
        "train": ("",  #train image dir
                  ""), #train coco format json file
        "val": ("",    #val image dir
                ""),   #val coco format json file
    }

Add domain adaptation direction within the script init.py. For example:

    if args.dataset_file == 'city':
        return build_city_DA(image_set, args,strong_aug)

Training / Evaluation / Inference

We provide training script as follows. We divide the training process into two stages. The settings for each stage can be found in the config folder.

(1) For the Burn-In stage:

Training with single GPU

sh scripts/DINO_train.sh

Training with Multi-GPU

sh scripts/DINO_train_dist.sh

(2) For the Teacher-Student Mutual Learning stage, it is necessary to use the optimal model obtained from the first stage of training.

Training with single GPU

sh scripts/DINO_train_self_training.sh

Training with Multi-GPU

sh scripts/DINO_train_self_training_dist.sh

We provide evaluation script to evaluate pre-trained model.

Evaluation Model.

sh scripts/DINO_eval.sh

Evaluation EMA Model.

sh scripts/DINO_eval_for_EMAmodel.sh

We provide inference script to visualize detection results. See inference.py for details

Inference Model.

python inference.py

Inference EMA Model.

python inference_ema_model.py

Pre-trained models

We provide specific experimental configurations and pre-trained models to facilitate the reproduction of our results. You can learn the details of DATR through the paper, and please cite our papers if the code is useful for your papers. Thank you!

Task	mAP50	Config	Model
Cityscapes to Foggy Cityscapes	52.8%	cfg	model
Sim10k to Cityscapes	66.3%	cfg	model
Cityscapes to BDD100K-daytime	41.9%	cfg	model

Reference

https://github.com/IDEA-Research/DINO

https://github.com/h751410234/RemoteSensingTeacher

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
__pycache__		__pycache__
config		config
datasets		datasets
figs		figs
models		models
scripts		scripts
tools		tools
util		util
LICENSE		LICENSE
README.md		README.md
arial.ttf		arial.ttf
engine.py		engine.py
inference.py		inference.py
inference_ema_model.py		inference_ema_model.py
main.py		main.py
main_teacher.py		main_teacher.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment

Acknowledgment

Installation

Dataset Preparation

Training / Evaluation / Inference

Pre-trained models

Reference

About

Releases

Packages

Languages

License

h751410234/DATR

Folders and files

Latest commit

History

Repository files navigation

DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment

Acknowledgment

Installation

Dataset Preparation

Training / Evaluation / Inference

Pre-trained models

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages