[IEEE FG'23] T2V-DDPM: Thermal to Visible Face Translation using Denoising Diffusion Probabilistic Models

Modern-day surveillance systems perform person recognition using deep learning-based face verification networks. Most state-of-the-art facial verification systems are trained using visible spectrum images. But, acquiring images in the visible spectrum is impractical in scenarios of low-light and nighttime conditions, and often images are captured in an alternate domain such as the thermal infrared domain. Facial verification in thermal images is often performed after retrieving the corresponding visible domain images. This is a well-established problem often known as the Thermal-to-Visible (T2V) image translation. In this paper, we propose a Denoising Diffusion Probabilistic Model (DDPM) based solution for T2V translation specifically for facial images. During training, the model learns the conditional distribution of visible facial images given their corresponding thermal image through the diffusion process. During inference, the visible domain image is obtained by starting from Gaussian noise and performing denoising repeatedly. The existing inference process for DDPMs is stochastic and time-consuming. Hence, we propose a novel inference strategy for speeding up the inference time of DDPMs, specifically for the problem of T2V image translation. We achieve the state-of-the-art results on multiple datasets.

Prerequisites:

Create a conda environment and activate using

conda env create -f environment.yml
conda activate T2V-diff

Data Preparation

Prepare Data in the following format

    ├── data 
    |   ├── train # Training  
    |   |   ├── TH              # thermal images 
    |   |   └── VIS             # visible images
    |   └── test  # Testing
    |       ├── TH              # thermal images 
    |       ├── sample.png      # one visible sample from training set to take colour bit from

Preprocess the testdata using:

python preprocess_test.py

Training and Testing

Run following commands to train and test

For training:
export PYTHONPATH=$PYTHONPATH:$(pwd)
CUDA_VISIBLE_DEVICES="0" NCCL_P2P_DISABLE=1  torchrun --nproc_per_node=1 --master_port=4326 scripts/T2V_train.py 

For testing:
export PYTHONPATH=$PYTHONPATH:$(pwd)
CUDA_VISIBLE_DEVICES="0" NCCL_P2P_DISABLE=1  torchrun --nproc_per_node=1 --master_port=4326 scripts/T2V_test.py --weights /pathtoweights/ --data_dir /pathtodata/

Model weights

The weights for THVIS dataset can be found in THVIS weights

The weights for ARL-VTF dataset can be found in ARL-VTF weights

Citation

If you use our work, please use the following citation

@article{nair2022t2v,
  title={T2V-DDPM: Thermal to Visible Face Translation using Denoising Diffusion Probabilistic Models},
  author={Nair, Nithin Gopalakrishnan and Patel, Vishal M},
  journal={arXiv preprint arXiv:2209.08814},
  year={2022}
}

Acknowledgements

Thanks to authors of Diffusion Models Beat GANs on Image Synthesis sharing their code. Most of the code is borrowed from the guided diffusion

https://github.com/openai/guided-diffusion

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
core		core
guided_diffusion		guided_diffusion
scripts		scripts
LICENSE.md		LICENSE.md
README.md		README.md
environment.yml		environment.yml
preprocess_test.py		preprocess_test.py
run_diff.sh		run_diff.sh
test_diff.sh		test_diff.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core

core

guided_diffusion

guided_diffusion

scripts

scripts

LICENSE.md

LICENSE.md

README.md

README.md

environment.yml

environment.yml

preprocess_test.py

preprocess_test.py

run_diff.sh

run_diff.sh

test_diff.sh

test_diff.sh

Repository files navigation

[IEEE FG'23] T2V-DDPM: Thermal to Visible Face Translation using Denoising Diffusion Probabilistic Models

Prerequisites:

Data Preparation

Training and Testing

Model weights

Citation

Acknowledgements

About

Releases

Packages

Languages

License

Nithin-GK/T2V-DDPM

Folders and files

Latest commit

History

Repository files navigation

[IEEE FG'23] T2V-DDPM: Thermal to Visible Face Translation using Denoising Diffusion Probabilistic Models

Prerequisites:

Data Preparation

Training and Testing

Model weights

Citation

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Languages