DyNet

Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration

Akshay Dudhane, Omkar Thawakar, Sayed Waqas Zamir, Salman Khan, Ming-Hsuan Yang and Fahad Khan

Mohamed bin Zayed University of AI, Inception Institute of AI, Australian National University, University of California - Merced, Yonsei University, Google Research, Linköping University

Latest

2024/04/02: We released our on arxiv. Stay tuned for our Million-IRD dataset, code, and trained models.

Abstract

All-in-one image restoration tackles different types of degradations with a unified model instead of having task-specific, non-generic models for each degradation. The requirement to tackle multiple degradations using the same model can lead to high-complexity designs with fixed configurations that lack the adaptability to more efficient alternatives. We propose DyNet, a dynamic family of networks designed in an encoder-decoder style for all-in-one image restoration tasks. Our DyNet can seamlessly switch between its bulkier and lightweight variants, thereby offering flexibility for efficient model deployment with a single round of training. This seamless switching is enabled by our weights-sharing mechanism, forming the core of our architecture and facilitating the reuse of initialized module weights. Further, to establish robust weights initialization, we introduce a dynamic pre-training strategy that trains variants of the proposed DyNet concurrently, thereby achieving a 50% reduction in GPU hours. To tackle the unavailability of a large-scale dataset required in pre-training, we curate a high-quality, high-resolution image dataset named Million-IRD, having 2M image samples. We validate our DyNet for image denoising, deraining, and dehazing in an all-in-one setting, achieving state-of-the-art results with 31.34% reduction in GFlops and a 56.75% reduction in parameters compared to baseline models

Dynamic Network Architecture

Proposed Weights-Sharing Strategy

Sample Visualization of Dynamic Pre-training

Samples from our Million-IRD Dataset

Installation and Data Preparation

Refer INSTALL.md for instructions on installing dependencies and preparing the dataset necessary to use this codebase.

Training

Once organized the training data in the data/ directory, use

python train.py

To begin training the model, utilize the de_type argument to select the combination of degradation types you wish to train on. By default, it is configured to include all three degradation types: noise, rain, and haze.

Example Usage: to train on deraining and dehazing:

python train.py --de_type derain dehaze

Testing

After arranging the testing data within the test/ directory, move the model checkpoint file to the ckpt directory. You can download the pretrained model from here or find it under the releases tab. To conduct the evaluation, use

python test.py --mode {n}

n specifies the tasks to be evaluated: use 0 for denoising, 1 for deraining, 2 for dehazing, and 3 for an all-in-one setting. Example Usage: To test all the degradation types at once, run the following:

python test.py --mode 3

Results

Performance results of the PromptIR framework trained under the all-in-one setting

Table: Comparison results in the All-in-one restoration setting. Our DyNet-L model outperforms PromptIR [36] by 0.82 dB on average across tasks. Additionally, our DyNet-S model achieves a 0.59 dB average improvement over PromptIR [36], with reductions of 31.34% in parameters and 56.75% in GFlops.

Citation

if you use our work, please consider citing us:

@misc{dudhane2024dynamic,
      title={Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration}, 
      author={Akshay Dudhane and Omkar Thawakar and Syed Waqas Zamir and Salman Khan and Ming-Hsuan Yang and Fahad Shahbaz Khan},
      year={2024},
      eprint={2404.02154},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
ckpt		ckpt
data		data
data_dir		data_dir
net		net
utils		utils
INSTALL.md		INSTALL.md
LICENSE.md		LICENSE.md
README.md		README.md
all_in_one_results_table.png		all_in_one_results_table.png
million-IRD.png		million-IRD.png
network.png		network.png
options.py		options.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py
video.gif		video.gif
weights_sharing.jpg		weights_sharing.jpg

License

akshaydudhane16/DyNet

Folders and files

Latest commit

History

Repository files navigation

DyNet

Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration

Akshay Dudhane, Omkar Thawakar, Sayed Waqas Zamir, Salman Khan, Ming-Hsuan Yang and Fahad Khan

Mohamed bin Zayed University of AI, Inception Institute of AI, Australian National University, University of California - Merced, Yonsei University, Google Research, Linköping University

Latest

Dynamic Network Architecture

Proposed Weights-Sharing Strategy

Sample Visualization of Dynamic Pre-training

Samples from our Million-IRD Dataset

Installation and Data Preparation

Training

Testing

Results

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages