Skip to content

Support for computing Elo ratings#25

Merged
geoalgo merged 9 commits intomainfrom
elo
Mar 30, 2026
Merged

Support for computing Elo ratings#25
geoalgo merged 9 commits intomainfrom
elo

Conversation

@geoalgo
Copy link
Copy Markdown
Collaborator

@geoalgo geoalgo commented Mar 19, 2026

Several people are using this script to compute ELO ratings, it would be better to have it supported in mainline.

We will have several improvements based on @kargibora 's work to improve the accuracy of the estimation and the confidence interval.

@ErlisLushtaku
Copy link
Copy Markdown
Collaborator

@cursor review

@geoalgo
Copy link
Copy Markdown
Collaborator Author

geoalgo commented Mar 20, 2026

Just FYI @ErlisLushtaku I reviewed this code already with Claude, I think it is ready for human review :-)

@ErlisLushtaku
Copy link
Copy Markdown
Collaborator

Just FYI @ErlisLushtaku I reviewed this code already with Claude, I think it is ready for human review :-)

Will give it the human review soon, was just testing some features of cursor if it's any good. It's not working on others' PRs apparently 😄

Copy link
Copy Markdown
Collaborator

@ErlisLushtaku ErlisLushtaku left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot for the PR David. I had mostly some questions to clarify, and a couple of comments.

Comment thread openjury/generate_and_evaluate.py Outdated
Comment thread judgearena/estimate_elo_ratings.py
Comment thread judgearena/arenas_utils.py
Comment thread openjury/estimate_elo_ratings.py Outdated
Comment thread openjury/estimate_elo_ratings.py Outdated
Comment thread openjury/estimate_elo_ratings.py Outdated
Comment thread openjury/estimate_elo_ratings.py Outdated
Comment thread openjury/estimate_elo_ratings.py Outdated
Comment thread openjury/estimate_elo_ratings.py Outdated
Comment thread judgearena/evaluate.py
@geoalgo geoalgo merged commit d2d67d6 into main Mar 30, 2026
1 check passed
@geoalgo geoalgo deleted the elo branch March 30, 2026 19:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants