ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS 2023, Spotlight)

Zongsheng Yue, Jianyi Wang, Chen Change Loy

Conference Paper | Journal Paper | Project Page | Video

⭐ If ResShift is helpful to your images or projects, please help star this repo. Thanks! 🤗

Diffusion-based image super-resolution (SR) methods are mainly limited by the low inference speed due to the requirements of hundreds or even thousands of sampling steps. Existing acceleration sampling techniques inevitably sacrifice performance to some extent, leading to over-blurry SR results. To address this issue, we propose a novel and efficient diffusion model for SR that significantly reduces the number of diffusion steps, thereby eliminating the need for post-acceleration during inference and its associated performance deterioration. Our method constructs a Markov chain that transfers between the high-resolution image and the low-resolution image by shifting the residual between them, substantially improving the transition efficiency. Additionally, an elaborate noise schedule is developed to flexibly control the shifting speed and the noise strength during the diffusion process. Extensive experiments demonstrate that the proposed method obtains superior or at least comparable performance to current state-of-the-art methods on both synthetic and real-world datasets, even only with 15 sampling steps.

Update

2024.03.11: Update the code for the Journal paper
2023.12.02: Add configurations for the x2 super-resolution task.
2023.08.15: Add .
2023.08.15: Add Gradio Demo.
2023.08.14: Add bicubic (matlab resize) model.
2023.08.14: Add Project Page.
2023.08.02: Add Replicate demo .
2023.07.31: Add Colab demo .
2023.07.24: Create this repo.

Requirements

Python 3.10, Pytorch 2.1.2, xformers 0.0.23
More detail (See environment.yml) A suitable conda environment named resshift can be created and activated with:

conda env create -n resshift python=3.10
conda activate resshift
pip install -r requirements.txt

or

conda env create -f environment.yml
conda activate resshift

Applications

👉 Real-world image super-resolution

👉 Image inpainting

👉 Blind Face Restoration

Online Demo

You can try our method through an online demo:

python app.py

Fast Testing

🐯 Real-world image super-resolution

python inference_resshift.py -i [image folder/image path] -o [result folder] --task realsr --scale 4 --version v3

🦁 Bicubic (resize by Matlab) image super-resolution

python inference_resshift.py -i [image folder/image path] -o [result folder] --task bicsr --scale 4

🐍 Natural image inpainting

python inference_resshift.py -i [image folder/image path] -o [result folder] --mask_path [mask path] --task inpaint_imagenet --scale 1

🐊 Face image inpainting

python inference_resshift.py -i [image folder/image path] -o [result folder] --mask_path [mask path] --task inpaint_face --scale 1

🐙 Blind Face Restoration

python inference_resshift.py -i [image folder/image path] -o [result folder] --task faceir --scale 1

Training

🐢 Preparing stage

Download the pre-trained VQGAN model from this link and put it in the folder of 'weights'
Adjust the data path in the config file.
Adjust batchsize according your GPUS.
- configs.train.batch: [training batchsize, validation batchsize]
- configs.train.microbatch: total batchsize = microbatch * #GPUS * num_grad_accumulation

🐬 Real-world Image Super-resolution for NeurIPS

CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 torchrun --standalone --nproc_per_node=8 --nnodes=1 main.py --cfg_path configs/realsr_swinunet_realesrgan256.yaml --save_dir [Logging Folder]

🐳 Real-world Image Super-resolution for Journal

CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 torchrun --standalone --nproc_per_node=8 --nnodes=1 main.py --cfg_path configs/realsr_swinunet_realesrgan256_journal.yaml --save_dir [Logging Folder]

🐂 Image inpainting (Natural) for Journal

CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 torchrun --standalone --nproc_per_node=8 --nnodes=1 main.py --cfg_path configs/inpaint_lama256_imagenet.yaml --save_dir [Logging Folder]

🐝 Image inpainting (Face) for Journal

CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 torchrun --standalone --nproc_per_node=8 --nnodes=1 main.py --cfg_path configs/inpaint_lama256_face.yaml --save_dir [Logging Folder]

🐸 Blind face restoration for Journal

CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 torchrun --standalone --nproc_per_node=8 --nnodes=1 main.py --cfg_path configs/faceir_gfpgan512_lpips.yaml --save_dir [Logging Folder]

Reproducing the results in our paper

🚗 Prepare data

Synthetic data for image super-resolution: Link
Real data for image super-resolution: RealSet65 | RealSet80
Synthetic data for natural image inpainting: Link
Synthetic data for face image inpainting: Link
Synthetic data for blind face restoration: Link

🚀 Image super-resolution

Reproduce the results in Table 3 of our NeurIPS paper:

python inference_resshift.py -i [image folder/image path] -o [result folder] --task realsr --scale 4 --version v1 --chop_size 64 --chop_stride 64 --bs 64

Reproduce the results in Table 4 of our NeurIPS paper:

python inference_resshift.py -i [image folder/image path] -o [result folder] --task realsr --scale 4 --version v1 --chop_size 512 --chop_stride 448

Reproduce the results in Table 2 of our Journal paper:

python inference_resshift.py -i [image folder/image path] -o [result folder] --task realsr --scale 4 --version v3 --chop_size 64 --chop_stride 64 --bs 64

Reproduce the results in Table 3 of our Journal paper:

python inference_resshift.py -i [image folder/image path] -o [result folder] --task realsr --scale 4 --version v3 --chop_size 512 --chop_stride 448

Model card:

version-1: Conference paper, 15 diffusion steps, trained with 300k iterations.
version-2: Conference paper, 15 diffusion steps, trained with 500k iterations.
version-3: Journal paper, 4 diffusion steps.

✈️ Image inpainting

Reproduce the results in Table 4 of our Journal paper:

python inference_resshift.py -i [image folder/image path] -o [result folder] --mask_path [mask path] --task inpaint_imagenet --scale 1 --chop_size 256 --chop_stride 256 --bs 32

Reproduce the results in Table 5 of our Journal paper:

python inference_resshift.py -i [image folder/image path] -o [result folder] --mask_path [mask path] --task inpaint_face --scale 1 --chop_size 256 --chop_stride 256 --bs 32

⛵ Blind Face Restoration

Reproduce the results in Table 6 of our Journal paper (arXiv):

python inference_resshift.py -i [image folder/image path] -o [result folder] --task faceir --scale 1 --chop_size 256 --chop_stride 256 --bs 16

License

This project is licensed under NTU S-Lab License 1.0. Redistribution and use should follow this license.

Acknowledgement

This project is based on Improved Diffusion Model, LDM, and BasicSR. We also adopt Real-ESRGAN to synthesize the training data for real-world super-resolution. Thanks for their awesome works.

Contact

If you have any questions, please feel free to contact me via zsyzam@gmail.com.

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
assets		assets
basicsr		basicsr
configs		configs
datapipe		datapipe
ldm		ldm
models		models
scripts		scripts
testdata		testdata
utils		utils
LICENSE		LICENSE
README.md		README.md
app.py		app.py
cog.yaml		cog.yaml
environment.yml		environment.yml
inference_resshift.py		inference_resshift.py
main.py		main.py
predict.py		predict.py
requirements.txt		requirements.txt
sampler.py		sampler.py
trainer.py		trainer.py

License

zsyOAOA/ResShift

Folders and files

Latest commit

History

Repository files navigation

ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS 2023, Spotlight)

Update

Requirements

Applications

👉 Real-world image super-resolution

👉 Image inpainting

👉 Blind Face Restoration

Online Demo

Fast Testing

🐯 Real-world image super-resolution

🦁 Bicubic (resize by Matlab) image super-resolution

🐍 Natural image inpainting

🐊 Face image inpainting

🐙 Blind Face Restoration

Training

🐢 Preparing stage

🐬 Real-world Image Super-resolution for NeurIPS

🐳 Real-world Image Super-resolution for Journal

🐂 Image inpainting (Natural) for Journal

🐝 Image inpainting (Face) for Journal

🐸 Blind face restoration for Journal

Reproducing the results in our paper

🚗 Prepare data

🚀 Image super-resolution

Model card:

✈️ Image inpainting

⛵ Blind Face Restoration

License

Acknowledgement

Contact

About

Resources

License

Stars

Watchers

Forks

Languages