Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement ELO progression, better progress tracking and archive flow #15

Merged
merged 49 commits into from
Jul 17, 2021

Conversation

codeandkey
Copy link
Owner

@codeandkey codeandkey commented Jul 17, 2021

This PR adds scripts for tracking the progression of the model strength. It adds skeleton code for tracking several other long-term changes in model performance as well. It would be good to add tracking for average loss by generation as well as delta loss for each training run.

ELO ratings are currently a tournament-like performance estimate and need to be changed to a true ELO calculation as the model should clearly not be rated 700 as a worst possible performance.

This PR also refactors the archiving system into dir.rs, removing the need for moving files and folders after every training session and eliminating the dependency on fs_extra. This will definitely require additional testing before merging.

Incidentally this PR also contains modifications to the PUCT calucation as well as changing the model optimizer to RMSprop for dynamic learning rates with an added L2 regularization.

Closes #3
Closes #4
Closes #12

Remaining tasks before merging:

  • Accurate ELO calculation
  • Tests for archive flow
  • Tests for training flow (excl. TUI, or just exclude train.rs entirely)
  • Train stdout in TUI log, archive as well
  • Shorter performance evaluation
  • Potentially shorter nodecount, faster games

@coveralls
Copy link

coveralls commented Jul 17, 2021

Coverage Status

Coverage increased (+5.7%) to 94.187% when pulling 5dc97e5 on elo-progression into 3d08dfe on master.

@codeandkey codeandkey merged commit 86e54fb into master Jul 17, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Use builder pattern with search Temperature refactoring Constants refactoring
2 participants