Skip to content

Source Code - "TIRDet: Mono-Modality Thermal InfraRed Object Detection Based on Prior Thermal-To-Visible Translation" - ACM MM 2023

Notifications You must be signed in to change notification settings

zeyuwang-zju/TIRDet

Repository files navigation

TIRDet

Source Code for 'TIRDet: Mono-Modality Thermal InfraRed Object Detection Based on Prior Thermal-To-Visible Translation'

Accepted by ACM MM'23

image

image

This repo highly inherits the mmdetection framework.

Abstract

Cross-modality images that combine visible-infrared spectra can provide complementary information for object detection. In particular, they are well-suited for autonomous vehicle applications in dark environments with limited illumination. However, it is time-consuming to acquire a large number of pixel-aligned visible-thermal image pairs, and real-time alignment is challenging in practical driving systems. Furthermore, the quality of visible-spectrum images can be adversely affected by complex environmental conditions. In this paper, we propose a novel neural network called TIRDet, which only utilizes Thermal InfraRed (TIR) images for mono-modality object detection. To compensate for the lacked visible-band information, we adopt a prior Thermal-To-Visible (T2V) translation model to obtain the translated visible images and the latent T2V codes. In addition, we introduce a novel attention-based Cross-Modality Aggregation (CMA) module, which can augment the modality-translation awareness of TIRDet by preserving the T2V semantic information. Extensive experiments on FLIR and LLVIP datasets demonstrate that our TIRDet significantly outperforms all mono-modality detection methods based on thermal images, and it even surpasses most State-Of-The-Art (SOTA) multispectral methods using visible-thermal image pairs. Code is available at https://github.com/zeyuwang-zju/TIRDet.

Requirements

  • torch=1.9.1
  • torchvision=0.9.1
  • cuda=11.1
  • mmdet=2.28.2

Usage

  1. Download the FLIR and LLVIP datasets. FLIR: https://www.flir.eu/oem/adas/adas-dataset-form/ LLVIP: https://bupt-ai-cz.github.io/LLVIP/
  2. Prepare the FLIR and LLVIP datasets into Microsoft COCO version.
  3. Use the provided ''mmdet'' and ''configs''.
  4. Download the Pearl-GAN pretrained weights from https://github.com/FuyaLuo/PearlGAN/. Place them into configs/tirdet/pearlgan_ckpt/FLIR_NTIR2DC/.
  5. Follow the implementations of mmdetection to train and test our model.

Pretrained Model Weights

We provide the pretrained model weights on FLIR dataset:

Model Code
TIRDet-S 2fgt
TIRDet-M 6G3A
TIRDet-L 2fbT

Experiments

image

Citation

If you are interested this repo for your research, welcome to cite our paper:

@inproceedings{wang2023tirdet,
  title={TIRDet: Mono-Modality Thermal InfraRed Object Detection Based on Prior Thermal-To-Visible Translation},
  author={Wang, Zeyu and Colonnier, Fabien and Zheng, Jinghong and Acharya, Jyotibdha and Jiang, Wenyu and Huang, Kejie},
  booktitle={Proceedings of the 31st ACM International Conference on Multimedia},
  pages={2663--2672},
  year={2023}
}

About

Source Code - "TIRDet: Mono-Modality Thermal InfraRed Object Detection Based on Prior Thermal-To-Visible Translation" - ACM MM 2023

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages