Implement ELO progression, better progress tracking and archive flow #15

codeandkey · 2021-07-17T13:47:51Z

This PR adds scripts for tracking the progression of the model strength. It adds skeleton code for tracking several other long-term changes in model performance as well. It would be good to add tracking for average loss by generation as well as delta loss for each training run.

ELO ratings are currently a tournament-like performance estimate and need to be changed to a true ELO calculation as the model should clearly not be rated 700 as a worst possible performance.

This PR also refactors the archiving system into dir.rs, removing the need for moving files and folders after every training session and eliminating the dependency on fs_extra. This will definitely require additional testing before merging.

Incidentally this PR also contains modifications to the PUCT calucation as well as changing the model optimizer to RMSprop for dynamic learning rates with an added L2 regularization.

Closes #3
Closes #4
Closes #12

Remaining tasks before merging:

Accurate ELO calculation
Tests for archive flow
Tests for training flow (excl. TUI, or just exclude train.rs entirely)
Train stdout in TUI log, archive as well
Shorter performance evaluation
Potentially shorter nodecount, faster games

…losure

…regularization to loss

coveralls · 2021-07-17T13:56:33Z

Coverage increased (+5.7%) to 94.187% when pulling 5dc97e5 on elo-progression into 3d08dfe on master.

The methods in dir.rs depende exclusively on the user data directories and their combinations with other path's; any test would effectively just be a test of the 'dirs' crate.

codeandkey added 24 commits July 11, 2021 16:13

add dir module

60b8e2a

add WIP train module

0935b53

more model type to constants, add ELO constants

e3097e3

add evolution display, stockfish binaries

84dc43d

massive refactoring

2c6724b

fix some searcher tests

62e9e0e

add reset, position messages to train

6d1b8ac

ui: add reset, position sender methods

76e833f

fix input thread hanging

566ed2d

fix rest of searcher tests, allow arbitrary return type from status c…

3b2642b

…losure

main: remove unused import

002a2fd

re-add game result logging

b5679fc

add doc for do_search

5641ad8

implement ELO evaluation

ef1115c

cut learning rate in half

20d5fe2

change PUCT calculation to be more correct, actually add weighted L2 …

c1e7602

…regularization to loss

tree: fix bad type

7f5c4c9

remove unused constants

48c9050

switch optimizer to adadelta, remove L2 norm loss

1742ea7

apply cargo formatting, auto warning fixes

12f87df

cut learning rate, switch to RMSprop optimizer, add L2 norm

3ad7521

add ELO progression to evolution display

8e0189a

change archive flow, avoid directory scanning for generation tracking

d919c9a

reduce game count in ELO evaluation

8524784

codeandkey added 5 commits July 17, 2021 09:14

implement correct ELO calculation

67baa0b

add training stdout to tui log, loss tracking

35dc1cb

add stdout to archive

e2bdcd4

reduce default maxnodes, training batch count and size

c74e87d

remove fs_extra dep

6beb368

codeandkey added 20 commits July 17, 2021 10:04

remove nodes limit during ELO evaluation

fb33553

fix syntax

8eab94a

add explicit time limits in do_search

e7effac

add no_grad guard to torch execution

ffc160f

train: remove unused import

d62dfb3

position: remove unused methods, imports

4ed8f45

dir: fix generation increment

0cf76f9

searcher: remove unused methods

77a2278

searcher: remove unnecessary members

2538189

fix loss archive format

89d0139

max node, time limits optional

6b39ddb

update train for new limit format

cfd0a03

remove default search time limit

5400a38

update ui to show time/node limits when applicable

49b98bf

refactor torch model into seperate file

922e7d7

remove special model types from coverage

42b68f4

add more ui tests

53d59bd

fix test without model

ebd3330

remove dir.rs from coverage

50db9a5

The methods in dir.rs depende exclusively on the user data directories and their combinations with other path's; any test would effectively just be a test of the 'dirs' crate.

remove train from coverage

5dc97e5

codeandkey merged commit 86e54fb into master Jul 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement ELO progression, better progress tracking and archive flow #15

Implement ELO progression, better progress tracking and archive flow #15

codeandkey commented Jul 17, 2021 •

edited

Loading

coveralls commented Jul 17, 2021 •

edited

Loading

Implement ELO progression, better progress tracking and archive flow #15

Implement ELO progression, better progress tracking and archive flow #15

Conversation

codeandkey commented Jul 17, 2021 • edited Loading

coveralls commented Jul 17, 2021 • edited Loading

codeandkey commented Jul 17, 2021 •

edited

Loading

coveralls commented Jul 17, 2021 •

edited

Loading