Task-specific Inconsistency Alignment for Domain Adaptive Object Detection
(CVPR 2022)

Liang Zhao, Limin Wang

[paper]

✔️ Requirements

Ubuntu 16.04
Python 3.7
CUDA 10.1
PyTorch 1.2.0
Faster R-CNN

Remember to compile the cuda dependencies using following simple commands following Faster R-CNN:

cd lib
python setup.py build develop

📚 Datasets

cd data
bash download_datasets.sh

Still under construction, we will upload to google drive soon!

You can also prepare all the datasets following DA-Faster RCNN, SCL and HTCN. It is also important to note that we have written all the codes for Pascal VOC format. Meanwhile, the pixel-level adaptation/interpolation/augmentation is implemented via CycleGAN following HTCN, for a fair comparison. We use the default parameters for training, and perhaps better parameters or networks would give better results.

🔔 Pre-trained Models

Follow the convention, two pre-trained models on ImageNet, i.e., VGG16 and ResNet101 are employed. Please download and place these two models in pretrained_models/ from:

VGG16: Dropbox VT Server
ResNet101: Dropbox VT Server

🌳 Authors' checkpoints

cd checkpoints
bash download_checkpoints_tia.sh

The file structure should be as follows:

TIA/
├── cfgs/
├── data/
├── lib/
├── pretrained_models/
│   ├── resnet101_caffe.pth
│   └── vgg16_caffe.pth
├── models/
│   ├── cityscale/vgg16/TIA/da_faster_rcnn_cityscape_7.pth
│   ├── clipart/res101/TIA/da_faster_rcnn_clipart_7.pth
│   └── kitti2cityscape/vgg16/TIA/da_faster_rcnn_kitti2cityscape_7.pth
├── scripts/
_init_paths.py
parse.py
README.md
test_net.py
trainval_net.py

📌 Quick start: testing scripts

Run the following command directly to test:

bash scripts/{dataset_name}/test.sh

For example, to test TIA on the clipart dataset:

bash scripts/clipart/test.sh

🔥 Training scripts

Run the following command directly to train:

bash scripts/{dataset_name}/train.sh

For example, to train TIA on the clipart dataset:

bash scripts/clipart/train.sh

❓ Q & A

Q: Why are the training results unstable and inconsistent?

A: It is firstly, I think, a intrinsic problem of DAOD. Since GRL-based adversarial training is essentially a compromise of GAN on large models like detectors. Therefore, DAOD and the primary GAN possess similar properties, both of their training are unstable, occasionally even mode collapse, and neither of their losses are indicative, making it difficult for us to determine their convergence. Second, the pixel-level adaptation, or namely the data augmentation, further reinforces this instability. As we simply wrap both the original and generated target-like images into the source domain, which could confuse the discriminator's judgment, even though this shows the best performance. Finally, the multi-view learning like TIA relies to some extent on the randomness of initialization. If there are large discrepancies between different classifiers or localizers, then they will capture even higher inconsistencies. We have tried to manually constrain the initialization with some success, and if you are interested in this, you can also try more diverse initialization constraints.

Q: Why are there some inconsistencies between the test results of the provided models and the results reported in the paper?

A: After the paper was submitted, I attempted to impose some constraints against the initialized randomness in the adaptation from cityscape to foggy cityscape and obtained better results. Also, I tried to overcome the severe overfitting problem in the adaptation from kitti to cityscape and found more stable results with the help of earlystopping.

Q: Why is the open source so late?

A: I suffer from severe procrastination. If no one pushes me, I will keep procrastinating :), and if someone pushes me, I will keep procrastinating with pressure :(.

🔍 Related repos

Our project references the codes in the following repos:

Chen et al., DA-Faster RCNN, DA-Faster Rcnn-PyTorch
Shen et al., SCL
Xu et al., CRDA
Chen et al., HTCN

📜 Citing TIA

If you find our code or paper useful to your research work, please consider citing our work using the following bibtex:

@inproceedings{tiadaod22cvpr,
  author    = {Liang Zhao and
               Limin Wang},
  title     = {Task-specific Inconsistency Alignment for Domain Adaptive Object Detection},
  booktitle = {{CVPR}},
  year      = {2022}
}

@article{zhao2022tia,
  author    = {Zhao, Liang and 
               Wang, Limin},
  title     = {Task-specific Inconsistency Alignment for Domain Adaptive Object Detection},
  journal   = {arXiv preprint arXiv:2203.15345},
  year      = {2022}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cfgs

cfgs

data

data

lib

lib

models

models

scripts

scripts

README.md

README.md

_init_paths.py

_init_paths.py

parse.py

parse.py

test_net.py

test_net.py

trainval_net.py

trainval_net.py

Repository files navigation

Task-specific Inconsistency Alignment for Domain Adaptive Object Detection
(CVPR 2022)

Liang Zhao, Limin Wang

[paper]

✔️ Requirements

📚 Datasets

🔔 Pre-trained Models

🌳 Authors' checkpoints

📌 Quick start: testing scripts

🔥 Training scripts

❓ Q & A

🔍 Related repos

📜 Citing TIA

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
cfgs		cfgs
data		data
lib		lib
models		models
scripts		scripts
README.md		README.md
_init_paths.py		_init_paths.py
parse.py		parse.py
test_net.py		test_net.py
trainval_net.py		trainval_net.py

MCG-NJU/TIA

Folders and files

Latest commit

History

Repository files navigation

Task-specific Inconsistency Alignment for Domain Adaptive Object Detection (CVPR 2022)

Liang Zhao, Limin Wang

✔️ Requirements

📚 Datasets

🔔 Pre-trained Models

🌳 Authors' checkpoints

📌 Quick start: testing scripts

🔥 Training scripts

❓ Q & A

🔍 Related repos

📜 Citing TIA

About

Resources

Stars

Watchers

Forks

Languages

Task-specific Inconsistency Alignment for Domain Adaptive Object Detection
(CVPR 2022)