Skip to content

Commit

Permalink
Clarified the methodology of the experiment, and added a link to the …
Browse files Browse the repository at this point in the history
…now-released code to compute the Elo scores.

PiperOrigin-RevId: 621809358
  • Loading branch information
Martin Bruse authored and Copybara-Service committed Apr 4, 2024
1 parent 55f09bb commit 4a841ea
Showing 1 changed file with 13 additions and 8 deletions.
21 changes: 13 additions & 8 deletions mucped23/README.md
Expand Up @@ -48,15 +48,16 @@ The exact images used can be downloaded at

The evaluations use the same methodology as [CLIC](http://compression.cc/).

It consists in requiring a choice between two different distortions of the same
image, and computes an [Elo](https://en.wikipedia.org/wiki/Elo_rating_system)
ranking (an estimate of the probability of each method being considered higher
quality by the raters) of distortions based on that. Compared to traditional
Opinion Score methods, it avoids requiring test subjects to calibrate their
scores.
It consists of the rater being shown an original image and two distortions, and
asked to choose the distortion that is closest to the original. The test subject
is able to flip between the two distortions, and has the original image
available on the side for comparison at all times.

The test subject is able to flip between the two distortions, and has the
original image available on the side for comparison at all times.
We then compute an [Elo](https://en.wikipedia.org/wiki/Elo_rating_system)
ranking (an estimate of the probability of each method being considered closer
to the original by the raters) of distortions based on that. Compared to
traditional Opinion Score methods, it avoids having to calibrate scores between
questions or subjects.

The distortions used are encoding and decoding using
[MozJPEG](https://github.com/mozilla/mozjpeg),
Expand Down Expand Up @@ -97,3 +98,7 @@ compute the Elo scores based on [answers.csv](answers.csv). This code does not
contain the complete algorithm to compute the
[Elo](https://en.wikipedia.org/wiki/Elo_rating_system) including rater agreement
and golden question response.

Complete code to compute the Elo scores, including credible intervals and rater
reliability, can be found in
[elo_rater_model](https://github.com/google-research/google-research/tree/master/elo_rater_model).

0 comments on commit 4a841ea

Please sign in to comment.