[NeurIPS 2025] Ranking-based Preference Optimization
for Diffusion Models from Implicit User Feedback

This is the official implementation of the paper, Ranking-based Preference Optimization for Diffusion Models from Implicit User Feedback.

Requirements

Install the required dependencies using the following command:

pip install -r requirements.txt

Note: The results in the paper were obtained using Python 3.9.20 and torch==2.3.1 with cuda-12.1.

Datasets

Pick-a-Pic v2

The script tools/pickapic.py can automatically download and preprocess the Pick-a-Pic v2 dataset. It will select top-500 images for training.

Use PickScore to select top-500 images:

accelerate launch --multi_gpu --num_processes 8 \
    -m tools.pickapic \
        --score pickscore \
        --output ./data/pickapicv2_pickscore_500

Use HPSv2 to select top-500 images:

accelerate launch --multi_gpu --num_processes 8 \
    -m tools.pickapic \
        --score hpsv2 \
        --output ./data/pickapicv2_hpsv2_500

For testing, we use the official Pick-a-Pic v2 test set. Run the following script to download and organize the test set:

python -m tools.pickapic_test \
    --output ./data/pickapicv2_test

HPDv2

The script tools.hpdv2_benchmark.py can automatically download and organize the HPDv2 benchmark dataset for testing.

python -m tools.hpdv2_benchmark \
    --output ./data/hpdv2_benchmark

Models

Train From Scratch

accelerate launch --multi_gpu --gpu_ids 0,1,2,3 --num_processes 4 train.py \
    --train_dataset ./data/pickapicv2_hpsv2_500 \
    --logdir ./logs/sd15_diffusion-dro

For more training options, please refer to python train.py --help.

Inference

Inference with the pre-trained model from huggingface hub:

Pick-a-Pic v2 test:

accelerate launch --gpu_ids 0,1,2,3 --multi_gpu --num_processes 4 inference.py \
    --unet ylwu/diffusion-dro-sd1.5 \
    --unet_subfolder unet \
    --test_dataset ./data/pickapicv2_test \
    --output ./output/pickapicv2_test

HPDv2 Benchmark

accelerate launch --gpu_ids 0,1,2,3 --multi_gpu --num_processes 4 inference.py \
    --unet ylwu/diffusion-dro-sd1.5 \
    --unet_subfolder unet \
    --test_dataset ./data/hpdv2_benchmark \
    --output ./output/hpdv2_benchmark

It also supports inference with a local checkpoint by providing the path to --unet, e.g., --unet ./logs/sd15_diffusion-dro/ckpt-25600.

Evaluation

Calculate PickScore, HPSv2, Aesthetic Score, CLIP Score, and ImageReward for the generated images

Pick-a-Pic v2 test:

accelerate launch --gpu_ids 0,1,2,3 --multi_gpu --num_processes 4 score.py \
    --pickscore --hpsv2 --aestheticv1 --clip --imagereward \
    --dir ./output/pickapicv2_test

HPDv2 Benchmark:

accelerate launch --gpu_ids 0,1,2,3 --multi_gpu --num_processes 4 score.py \
    --pickscore --hpsv2 --aestheticv1 --clip --imagereward \
    --dir ./output/hpdv2_benchmark

Citation

@misc{wu2025rankingbasedpreferenceoptimizationdiffusion,
      title={Ranking-based Preference Optimization for Diffusion Models from Implicit User Feedback},
      author={Yi-Lun Wu and Bo-Kai Ruan and Chiang Tseng and Hong-Han Shuai},
      year={2025},
      eprint={2510.18353},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2510.18353},
}

LICENSE

This model is a fine-tuned version of Stable Diffusion, released under the CreativeML Open RAIL-M License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

[NeurIPS 2025] Ranking-based Preference Optimization
for Diffusion Models from Implicit User Feedback

Requirements

Datasets

Models

Train From Scratch

Inference

Evaluation

Citation

LICENSE

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
misc		misc
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
inference.py		inference.py
readme.md		readme.md
requirements.txt		requirements.txt
score.py		score.py
train.py		train.py

License

basiclab/DiffusionDRO

Folders and files

Latest commit

History

Repository files navigation

[NeurIPS 2025] Ranking-based Preference Optimization for Diffusion Models from Implicit User Feedback

Requirements

Datasets

Models

Train From Scratch

Inference

Evaluation

Citation

LICENSE

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

[NeurIPS 2025] Ranking-based Preference Optimization
for Diffusion Models from Implicit User Feedback

Packages