Score-Based Density Estimation from Pairwise Comparisons
Official PyTorch implementation of the ICLR 2026 paper
Score-Based Density Estimation from Pairwise Comparisons
Petrus Mikkola, Luigi Acerbi, Arto Klami
https://arxiv.org/abs/2510.09146
Abstract: We study density estimation from pairwise comparisons, motivated by expert knowledge elicitation and learning from human feedback. We relate the unobserved target density to a tempered winner density (marginal density of preferred choices), learning the winner's score via score-matching. This allows estimating the target by `de-tempering' the estimated winner density's score. We prove that the score vectors of the belief and the winner density are collinear, linked by a position-dependent tempering field. We give analytical formulas for this field and propose an estimator for it under the Bradley--Terry model. Using a diffusion model trained on tempered samples generated via score-scaled annealed Langevin dynamics, we can learn complex multivariate belief densities of simulated experts, from only hundreds to thousands of pairwise comparisons.
Install dependencies:
pip install -r requirements.txt
This project is authored by Petrus Mikkola (petrus-mikkola).
As this work incorporates and adapts code from projects licensed under CC BY-NC-SA 4.0, this entire repository is distributed under the same terms to comply with the "ShareAlike" requirement.
This repository is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
This project builds upon and modifies code from the following sources:
-
NVIDIA CORPORATION & AFFILIATES © 2024
- Source: edm2
- License: CC BY-NC-SA 4.0
- Contribution: Modified logic from
generate_images.pyandtoy_example.py. Includedphema.py.
-
Google Research Authors © 2020
- Source: score_sde_pytorch
- License: Apache License, Version 2.0
- Contribution: Modified logic from
likelihood.py.
Accepted to ICLR 2026, the citation will be updated soon..
@inproceedings{mikkola2026scorepair,
author = {Mikkola, Petrus and Acerbi, Luigi and Klami, Arto},
booktitle = {The Fourteenth International Conference on Learning Representations},
title = {Preferential Normalizing Flows},
year = {2026}
}
