TIRDet

Source Code for 'TIRDet: Mono-Modality Thermal InfraRed Object Detection Based on Prior Thermal-To-Visible Translation'

Accepted by ACM MM'23

This repo highly inherits the mmdetection framework.

Abstract

Cross-modality images that combine visible-infrared spectra can provide complementary information for object detection. In particular, they are well-suited for autonomous vehicle applications in dark environments with limited illumination. However, it is time-consuming to acquire a large number of pixel-aligned visible-thermal image pairs, and real-time alignment is challenging in practical driving systems. Furthermore, the quality of visible-spectrum images can be adversely affected by complex environmental conditions. In this paper, we propose a novel neural network called TIRDet, which only utilizes Thermal InfraRed (TIR) images for mono-modality object detection. To compensate for the lacked visible-band information, we adopt a prior Thermal-To-Visible (T2V) translation model to obtain the translated visible images and the latent T2V codes. In addition, we introduce a novel attention-based Cross-Modality Aggregation (CMA) module, which can augment the modality-translation awareness of TIRDet by preserving the T2V semantic information. Extensive experiments on FLIR and LLVIP datasets demonstrate that our TIRDet significantly outperforms all mono-modality detection methods based on thermal images, and it even surpasses most State-Of-The-Art (SOTA) multispectral methods using visible-thermal image pairs. Code is available at https://github.com/zeyuwang-zju/TIRDet.

Requirements

torch=1.9.1
torchvision=0.9.1
cuda=11.1
mmdet=2.28.2

Usage

Download the FLIR and LLVIP datasets. FLIR: https://www.flir.eu/oem/adas/adas-dataset-form/ LLVIP: https://bupt-ai-cz.github.io/LLVIP/
Prepare the FLIR and LLVIP datasets into Microsoft COCO version.
Use the provided ''mmdet'' and ''configs''.
Download the Pearl-GAN pretrained weights from https://github.com/FuyaLuo/PearlGAN/. Place them into configs/tirdet/pearlgan_ckpt/FLIR_NTIR2DC/.
Follow the implementations of mmdetection to train and test our model.

Pretrained Model Weights

We provide the pretrained model weights on FLIR dataset:

Model	Code
TIRDet-S	2fgt
TIRDet-M	6G3A
TIRDet-L	2fbT

Experiments

Citation

If you are interested this repo for your research, welcome to cite our paper:

@inproceedings{wang2023tirdet,
  title={TIRDet: Mono-Modality Thermal InfraRed Object Detection Based on Prior Thermal-To-Visible Translation},
  author={Wang, Zeyu and Colonnier, Fabien and Zheng, Jinghong and Acharya, Jyotibdha and Jiang, Wenyu and Huang, Kejie},
  booktitle={Proceedings of the 31st ACM International Conference on Multimedia},
  pages={2663--2672},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
anchor		anchor
configs		configs
docker		docker
mmdet		mmdet
requirements		requirements
tests		tests
tools		tools
README.md		README.md
model-index.yml		model-index.yml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TIRDet

Abstract

Requirements

Usage

Pretrained Model Weights

Experiments

Citation

About

Releases

Packages

Languages

zeyuwang-zju/TIRDet

Folders and files

Latest commit

History

Repository files navigation

TIRDet

Abstract

Requirements

Usage

Pretrained Model Weights

Experiments

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages