Xformer: Hybrid X-Shaped Transformer for Image Denoising

Jiale Zhang, Yulun Zhang, Jinjin Gu, Jiahua Dong, Linghe Kong, and Xiaokang Yang, "Xformer: Hybrid X-Shaped Transformer for Image Denoising", ICLR, 2024

[arXiv] [pretrained models]

🔥🔥🔥 News

2024-02-01: We open source model training and testing details, and the pretrained models can be downloaded.

2023-10-21: We release this repository.

Abstract: In this paper, we present a hybrid X-shaped vision Transformer, named Xformer, which performs notably on image denoising tasks. We explore strengthening the global representation of tokens from different scopes. In detail, we adopt two types of Transformer blocks. The spatialwise Transformer block performs fine-grained local patches interactions across tokens defined by spatial dimension. The channel-wise Transformer block performs direct global context interactions across tokens defined by channel dimension. Based on the concurrent network structure, we design two branches to conduct these two interaction fashions. Within each branch, we employ an encoder-decoder architecture to capture multi-scale features. Besides, we propose the Bidirectional Connection Unit (BCU) to couple the learned representations from these two branches while providing enhanced information fusion. The joint designs make our Xformer powerful to conduct global information modeling in both spatial and channel dimensions. Extensive experiments show that Xformer, under the comparable model complexity, achieves state-of-the-art performance on the synthetic and real-world image denoising tasks.

HQ	LQ	SwinIR	Restormer	Xformer (ours)

Installation

This repository is built in PyTorch 1.8.0. (Python3.8, CUDA11.6, cuDNN~).

Make conda environment

conda create -n pytorch18 python=3.8
conda activate pytorch18

Install dependencies

git clone https://github.com/gladzhang/Xformer.git
cd Xformer
pip install torch==1.8.0+cu111 torchvision==0.9.0+cu111 torchaudio==0.8.0 -f https://download.pytorch.org/whl/torch_stable.html
pip install -r requirements.txt

Install basicsr

python setup.py develop --no_cuda_ext

Models

Task	Method	Params (M)	FLOPs (G)	Dataset	PSNR	SSIM	Model Zoo
Color-DN	Xformer	25.23	42.2	Urban100	30.36	0.8941	Google Drive
Gray-DN	Xformer	25.23	42.2	Urban100	28.71	0.8629	Google Drive
Real-DN	Xformer	25.23	42.2	DND	40.19	0.957	Google Drive

We provide the performance Urban100 (level=50, Color-DN) Urban100 (level=50, Gray-DN), and DND (Real-DN). We use the input 3 × 128 × 128 to calculate FLOPS.
Download the models and put them into the folder experiments/pretrained_models . Go to the folder to find details of directory structure.

Datasets

Used training and training sets can be downloaded as follows:

Task	Training Set	Testing Set
GaussionColor image denoising	DIV2K + Flickr2K (2650 images) + BSD500 (400 training&testing images) + WED (4744 images)	CBSD68 + Kodak24 + McMaster + Urban100 [download]
GaussionGrayscale image denoising	DIV2K + Flickr2K (2650 images) + BSD500 (400 training&testing images) + WED (4744 images)	Set12 + BSD68 + Urban100 [download]
real image denoising	SIDD_train	SIDD_val + SIDD_test + DND
Single image motion deblurring	GoPro_train	GoPro + HIDE + RealBlur_J + RealBlur_R

Download training datasets and put them into the folder datasets/ . Go to the folder to find details of directory structure. You should go to datasets/data_prepare to prepare training and validating data in advance.

Download testing datasets and put them into the folder datasets/test/. Go to the folder to find details of directory structure.

Training

Train on GaussionColor image denoising

Please download the corresponding training datasets and put them in the folder datasets\DFWB.
Follow the instructions below to begin training our model.

# using 4 GPU with initial batch size 16 each GPU
python -m torch.distributed.launch --nproc_per_node=4 --master_port=2418 basicsr/train.py -opt options/GaussianColorDenoising_X_FormerSigma15.yml --launcher pytorch
python -m torch.distributed.launch --nproc_per_node=4 --master_port=2416 basicsr/train.py -opt options/GaussianColorDenoising_X_FormerSigma25.yml --launcher pytorch
python -m torch.distributed.launch --nproc_per_node=4 --master_port=2414 basicsr/train.py -opt options/GaussianColorDenoising_X_FormerSigma50.yml --launcher pytorch
python -m torch.distributed.launch --nproc_per_node=4 --master_port=2412 basicsr/train.py -opt options/GaussianColorDenoising_X_Former_Blind.yml --launcher pytorch

Train on GaussionGrayscale image denoising

Please download the corresponding training datasets and put them in the folder datasets\DFWB.
Follow the instructions below to begin training our model.

# using 4 GPU with initial batch size 16 each GPU
python -m torch.distributed.launch --nproc_per_node=4 --master_port=2418 basicsr/train.py -opt options/GaussianGrayscaleDenoising_X_FormerSigma15.yml --launcher pytorch
python -m torch.distributed.launch --nproc_per_node=4 --master_port=2416 basicsr/train.py -opt options/GaussianGrayscaleDenoising_X_FormerSigma25.yml --launcher pytorch
python -m torch.distributed.launch --nproc_per_node=4 --master_port=2414 basicsr/train.py -opt options/GaussianGrayscaleDenoising_X_FormerSigma50.yml --launcher pytorch
python -m torch.distributed.launch --nproc_per_node=4 --master_port=2412 basicsr/train.py -opt options/GaussianGrayscaleDenoising_X_Former_Blind.yml --launcher pytorch

Train on Real Image Denoising

Please download the corresponding training datasets and put them in the folder datasets\SIDD.
Follow the instructions below to begin training our model.

# using 4 GPU with initial batch size 16 each GPU
python -m torch.distributed.launch --nproc_per_node=4 --master_port=6414 basicsr/train.py -opt options/RealDenoising_X_Former.yml --launcher pytorch

Train on Single image motion deblurring

Please download the corresponding training datasets and put them in the folder datasets\GoPro.

Follow the instructions below to begin training our model.

# using 4 GPU with initial batch size 16 each GPU
python -m torch.distributed.launch --nproc_per_node=4 --master_port=2412 basicsr/train.py -opt options/MotionDeblurring_X_Former.yml --launcher pytorch

Testing

Test on Gaussian Color Image Denoising

Please download the corresponding testing datasets and put them in the folder datasets/test/ColorDN. Download the corresponding models and put them in the folder experiments/pretrained_models.
Follow the instructions below to begin testing our Xformer model.

#Xformer model for color image denoising. You can find corresponding results in Table 5 of the main paper.
##noise 15
python evaluate/test_gaussian_color_denoising.py --sigma 15
python evaluate/evaluate_gaussian_color_denoising.py --sigma 15
##noise 25
python evaluate/test_gaussian_color_denoising.py --sigma 25
python evaluate/evaluate_gaussian_color_denoising.py --sigma 25
##noise 50
python evaluate/test_gaussian_color_denoising.py --sigma 50
python evaluate/evaluate_gaussian_color_denoising.py --sigma 50

#Xformer for learning a single model to handle various noise levels. You can find corresponding results in Table 3 of the supplementary material.
##noise 15, 25, 50
python evaluate/test_gaussian_color_denoising.py --model_type blind --sigma 15
python evaluate/test_gaussian_color_denoising.py --model_type blind --sigma 25
python evaluate/test_gaussian_color_denoising.py --model_type blind --sigma 50
python evaluate/evaluate_gaussian_color_denoising.py

Test on Gaussian Grayscale Image Denoising

Please download the corresponding testing datasets and put them in the folder datasets/test/GrayDN. Download the corresponding models and put them in the folder experiments/pretrained_models.
Follow the instructions below to begin testing our Xformer model.

#Xformer model for gray image denoising. You can find corresponding results in Table 4 of the main paper.
##noise 15
python evaluate/test_gaussian_gray_denoising.py --sigma 15
python evaluate/evaluate_gaussian_gray_denoising.py --sigma 15
##noise 25
python evaluate/test_gaussian_gray_denoising.py --sigma 25
python evaluate/evaluate_gaussian_gray_denoising.py --sigma 25
##noise 50
python evaluate/test_gaussian_gray_denoising.py --sigma 50
python evaluate/evaluate_gaussian_gray_denoising.py --sigma 50

#Xformer for learning a single model to handle various noise levels. You can find corresponding results in Table 3 of the supplementary material.
##noise 15, 25, 50
python evaluate/test_gaussian_gray_denoising.py --model_type blind --sigma 15
python evaluate/test_gaussian_gray_denoising.py --model_type blind --sigma 25
python evaluate/test_gaussian_gray_denoising.py --model_type blind --sigma 50
python evaluate/evaluate_gaussian_gray_denoising.py

Test on Real Image Denoising

Please download the corresponding testing datasets and put them in the folder datasets/test/SIDD and datasets/test/SIDD. Download the corresponding models and put them in the folder experiments/pretrained_models.
Follow the instructions below to begin testing our Xformer model.

#Xformer model for real image denoising. You can find corresponding results in Table 6 of the main paper.
##sidd
python evaluate/test_real_denoising_sidd.py
run evaluate_sidd.m
##dnd (You should upload the generated mat files to the online server to get PSNR and SSIM.)
python evaluate/test_real_denoising_dnd.py

🔎 Results

We achieved state-of-the-art performance on Gaussian image denoising and real-world image denoising tasks. More results can be found in the paper.

Quantitative Comparison (click to expan)

results in Table 2-4 of the main paper

Visual Comparison (click to expan)

results in Figure 4 of the main paper

results in Figure 6 of the main paper

📎 Citation

If you find the code helpful in your research or work, please cite the following paper(s).

@inproceedings{zhang2024xformer,
      title={Xformer: Hybrid X-Shaped Transformer for Image Denoising}, 
      author={Jiale Zhang and Yulun Zhang and Jinjin Gu and Jiahua Dong and Linghe Kong and Xiaokang Yang},
      booktitle={ICLR},
      year={2024},
}

💡 Acknowledgements

This work is released under the Apache 2.0 license. The codes are based on Restormer and BasicSR. Please also follow their licenses. Thanks for their awesome works.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
basicsr		basicsr
datasets		datasets
evaluate		evaluate
experiments/pretrained_models		experiments/pretrained_models
figs		figs
options		options
README.md		README.md
VERSION		VERSION
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

gladzhang/Xformer

Folders and files

Latest commit

History

Repository files navigation