Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The test scores identical images inconsistently #4

Closed
kornelski opened this issue Jun 19, 2021 · 1 comment
Closed

The test scores identical images inconsistently #4

kornelski opened this issue Jun 19, 2021 · 1 comment

Comments

@kornelski
Copy link

In oracle.csv there are these two lines:

726ade5797570bea0c3a81a81f0d62a0.png,28118abf1cf068242fe360fd51f701ac.png,28118abf1cf068242fe360fd51f701ac.png,1
30f0e4fb2fd7f39dcb582fb0fd8d209b.png,1e1f7115e11bd7a6426e74f932f017de.png,1e1f7115e11bd7a6426e74f932f017de.png,0

Note that A and B are exactly the same file, but each time a different one is preferred.

@gtoderici
Copy link
Collaborator

Thanks for looking deeper into the data!

However, this is working as intended (i.e., it's not surprising you might have the same crop with reversed labels).

It is possible that two raters rated the same crop with differing opinions. Since in this task we cannot represent "equal", the closest we can do is have a 1 and a 0.

In the "real" task (i.e., not in the validation) we do take this into account in the rank-correlation computation, as opposed to in the accuracy task. We'll report all of these metrics on the leaderboard shortly (they're not reported).

For reference what I mean by "rank" is that we use each of these answers to compute a ranking of N compression methods which participate in the challenge. We compare the ranking obtained by running the classifier vs. the humans' preferences, which is what you're looking at.

In a future revision of the benchmark we are considering more aggressive data filtering, but given that this was released as is, and it has been used by multiple participants for testing purposes (not for competition), we don't plan to change this file at this time.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants