DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution

Overview framework

Image super-resolution pursuits reconstructing high-fidelity high-resolution counterpart for low-resolution (LR) image. In recent years, diffusion-based models have garnered significant attention due to their capabilities with rich prior knowledge. The success of diffusion models based on general text prompts has validated the effectiveness of textual control in the field of text2image. However, given the severe degradation commonly presented in low-resolution images, coupled with the randomness characteristics of diffusion models, current models struggle to adequately discern semantic and degradation information within severely degraded images. This often leads to obstacles such as semantic loss, visual artifacts, and visual hallucinations, which pose substantial challenges for practical use. To address these challenges, this paper proposes to leverage degradation-aligned language prompt for accurate, fine-grained, and high-fidelity image restoration. Complementary priors including semantic content descriptions and degradation prompts are explored. Specifically, on one hand, image-restoration prompt alignment decoder is proposed to automatically discern the degradation degree of LR images, thereby generating beneficial degradation priors for image restoration. On the other hand, much richly tailored descriptions from pretrained multimodal large language model elicit high-level semantic priors closely aligned with human perception, ensuring fidelity control for image restoration. Comprehensive comparisons with state-of-the-art methods have been done on several popular synthetic and real-world benchmark datasets. The quantitative and qualitative analysis have demonstrated that the proposed method achieves a new state-of-the-art perceptual quality level, especially in real-world cases based on reference-free metrics.

Visual Examples

Installation

Clone this Repo and Create Conda Environment and Install Package

## git clone this repository
git clone https://github.com/puppy210/DaLPSR.git
cd DalPSR

# create an environment with python >= 3.9
conda create -n DalPSR python=3.9
conda activate DalPSR
pip install --upgrade pip
pip install -r requirements.txt

Download Pre-trained Models
Pre-trained Models:
- stable-diffusion-2-base: stable-diffusion-2-base
- RAM: RAM-Swin-Large-14M
- LLaVA: LLaVA

Citations

If our paper helps your research or work, please consider citing our paper. The following are BibTeX references:

@article{jiang2024dalpsr,
  title={DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution},
  author={Jiang, Aiwen and Wei, Zhi and Peng, Long and Liu, Feiqiang and Li, Wenbo and Wang, Mingwen},
  journal={arXiv preprint arXiv:2406.16477},
  year={2024}
}

Acknowledgments

Some code is sourced from SUPIR and SeeSR. We appreciate their excellent work.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
basicsr		basicsr
figs		figs
models		models
utils		utils
README.md		README.md
requirements.txt		requirements.txt
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution

Overview framework

Visual Examples

Installation

Pre-trained Models:

Citations

Acknowledgments

About

Releases

Packages

Languages

puppy210/DaLPSR

Folders and files

Latest commit

History

Repository files navigation

DaLPSR: Leverage Degradation-Aligned Language Prompt for Real-World Image Super-Resolution

Overview framework

Visual Examples

Installation

Pre-trained Models:

Citations

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages