Multi-Feature Aggregation in Diffusion Models for Enhanced Face Super-Resolution

This project introduces a super-resolution algorithm using diffusion models, combining a low-resolution image with features from multiple low-quality images. It achieves high-quality outputs with minimized distortions in identity and delivers state-of-the-art results on CelebA and Quis-Campi datasets.

Overview of the proposed method.

Fig 1: The low-resolution images LR₁, ..., LRₙ are used to compute a set of features F₁, ..., Fₙ, respectively, which are then combined to generate Fₘ. The low-resolution image LR₀ is integrated with Fₘ in the diffusion model to produce a super-resolution (SR) image. The SR image is subsequently compared with a set of images from the gallery for face recognition.

Qualitative Results

Fig 2: Comparison of low-resolution (LR), super-resolution (SR) results obtained by various methods, and ground gruth (GT) images from the Quis-Campi dataset. FASR outperforms baseline methods, preserving facial symmetry and natural appearance.

This project was built using a fork of Score-SDE and SDE-SR.

Prepare conda environment

conda create -n fasr python=3.8.2

Install requirements

pip3 install -r requirements.txt

Also install jax+cuda

pip install --upgrade jax==0.2.8 jaxlib==0.1.59+cuda110 -f https://storage.googleapis.com/jax-releases/jax_releases.html

Activate conda environment

conda activate fasr

Tfrecords

The algorithm processes images in TFRecords format, which can be generated using Progressive Growing of GANs using:

python dataset_tool.py create_from_images tfrecords_path images_path --shuffle 0

In the sample_imgs/tfrecords folder there is a sample of 10 images from the CelebA dataset.

Adaface

Download the R18 CASIA-WebFace feature extractor from Adaface here and place it in the pretrained_adaface directory.

Pre-trained FASR model

Download our pre-trained model here and place it in the exps/checkpoints-meta directory.

Sample images and feature extraction

In sample_images, you will find a sample of images, with gallery images in gallery, low-resolution images used for feature extraction in LR_imgs, probe images in high resolution in probe_HR, and reference low-resolution images used for super-resolution in probe_LR.

For the calculation of the mean feature, use features_extract.py. Save the features in sample_imgs/features.

Adjust settings and path in files config/default_ve_configs.py and configs/ve/sr_ve.py.

Generate SR images

CUDA_VISIBLE_DEVICES=0 python3 main.py --config 'configs/ve/sr_ve.py' --mode 'sr' --workdir exps

Train a new model

CUDA_VISIBLE_DEVICES=0 python3 main.py --config 'configs/ve/sr_ve.py' --mode 'train' --workdir exps

Citation

DOS SANTOS, Marcelo et al. "Multi-Feature Aggregation in Diffusion Models for Enhanced Face Super-Resolution." In: 2024 37th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI). IEEE, 2024. p. 1-6. [IEEE Xplore] [arXiv]

@inproceedings{santos2024multi,
  title = {Multi-Feature Aggregation in Diffusion Models for Enhanced Face Super-Resolution},
  author = {M. {dos Santos} and R. {Laroca} and R. O. {Ribeiro} and J. {Neves} and D. {Menotti}},
  year = {2024},
  month = {Sept},
  booktitle = {Conference on Graphics, Patterns and Images (SIBGRAPI)},
  volume = {},
  number = {},
  pages = {1-6},
  doi = {10.1109/SIBGRAPI62404.2024.10716316},
  issn = {1530-1834},
}

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
aux		aux
configs		configs
exps		exps
models		models
op		op
sample_imgs		sample_imgs
.gitignore		.gitignore
README.md		README.md
accuracy.py		accuracy.py
datasets.py		datasets.py
datasets_train.py		datasets_train.py
debug.py		debug.py
fasr.png		fasr.png
fasr_results.png		fasr_results.png
features_extract.py		features_extract.py
likelihood.py		likelihood.py
losses.py		losses.py
main.py		main.py
net.py		net.py
requirements.txt		requirements.txt
run.txt		run.txt
run_lib.py		run_lib.py
sampling.py		sampling.py
sde_lib.py		sde_lib.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Feature Aggregation in Diffusion Models for Enhanced Face Super-Resolution

Overview of the proposed method.

Qualitative Results

Prepare conda environment

Tfrecords

Adaface

Pre-trained FASR model

Sample images and feature extraction

Generate SR images

Train a new model

Citation

About

Releases

Packages

Languages

marcelowds/fasr

Folders and files

Latest commit

History

Repository files navigation

Multi-Feature Aggregation in Diffusion Models for Enhanced Face Super-Resolution

Overview of the proposed method.

Qualitative Results

Prepare conda environment

Tfrecords

Adaface

Pre-trained FASR model

Sample images and feature extraction

Generate SR images

Train a new model

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages