WDL value head support #635

Ttl · 2018-12-30T11:41:30Z

WDL head support for all backends. Includes just the backend support and doesn't do anything yet with the draw information.

Needs updated protobuf: LeelaChessZero/lczero-common#6

Training code for replacing the old value head with WDL in existing old type network: https://github.com/Ttl/lczero-training/tree/wdl_surgery

~~11248 with WDL value head: http://hforsten.com/leelaz/11248-wdl.pb.gz~~ (Obsolete due to protobuf changes)

Score of lc0_11248_wdl vs lc0_11248: 15 - 5 - 45 [0.577]
Elo difference: 53.88 +/- 46.33, LOS: 98.73 %, DrawRatio: 69.2 %

TC: 10+0.5s on GTX 1050 Ti

Ttl · 2018-12-30T11:53:27Z

Strength increase in testing is probably from the additional training on TB rescored test10 data instead of WDL head. I wouldn't expect it to be any stronger. All convolutional layers and policy head were set to non-trainable, so the only difference in this network is the fully connected layers of the value head.

Tests fail because protobuf needs to be updated.

Ttl · 2019-01-28T13:48:04Z

Added the tree search part. Previous commit also includes parameter for adjusting the score of the draw, but I removed since it didn't gain any Elo in testing. Verbose-move-stats was modified to report WDL scores for testing purposes.

~~Also I added WDL head to a test30 net that is much better at evaluating draws than test10 net. It can be downloaded at: http://hforsten.com/leelaz/256x20-32585-wdl-4000.pb.gz~~ (Obsolete due to protobuf changes)

Tilps · 2019-02-01T23:35:59Z

src/mcts/search.cc

        << ") ";

-    oss << "(U: " << std::setw(6) << std::setprecision(5) << edge.GetU(U_coeff)


Verbose stats still needs to print U. (And Q+U is convenient.)

TFiFiE · 2019-02-02T23:27:43Z

This implements #79, right? Like I said there, it would then also be possible to shorten the average training game length by allowing draws by agreement.

Tilps · 2019-02-02T23:35:13Z

Yes, but we can do that as a follow up PR.

TFiFiE · 2019-02-03T01:18:01Z

Previous commit also includes parameter for adjusting the score of the draw, but I removed since it didn't gain any Elo in testing.

But wouldn't the usefulness of that come from the ability to do stuff like force the engine to play for a draw or for a win or with some other contempt-like factor?

Tilps · 2019-02-03T01:20:16Z

Previous commit also includes parameter for adjusting the score of the draw, but I removed since it didn't gain any Elo in testing.

But wouldn't the usefulness of that come from the ability to do stuff like force the engine to play for a draw or for a win or with some other contempt-like factor?

More work can be done on this aspect after the PR is submitted.

Ttl · 2019-02-04T02:57:53Z

Needs modifications for protobuf changes in: LeelaChessZero/lczero-common#8

TFiFiE · 2019-02-04T20:09:33Z

Maybe change https://github.com/orgs/LeelaChessZero/projects/1#card-10518034 to point to this instead?

Ttl · 2019-02-10T03:33:35Z

Merged with master and implemented the protobuf changes.

Test network with WDL value head and convolutional policy head: http://hforsten.com/leelaz/128x10-az-pol-map-wdl-200000.pb.gz

src/mcts/node.h

src/neural/network_random.cc

borg323 · 2019-02-10T10:08:54Z

Just tested that 32 bit builds work.

lp200 · 2019-02-10T14:32:46Z

iirc, currently move_count is disabled
when it enable, will not it help to judge the draw more accurately?

WDL head support

bbfa8eb

Ttl mentioned this pull request Dec 30, 2018

WDL head support LeelaChessZero/lczero-common#6

Closed

Ttl added 3 commits January 1, 2019 07:58

Adjustable draw score

2b47bf5

Report WDL stats

59716fc

Remove drawscore param

eefb951

Tilps reviewed Feb 1, 2019

View reviewed changes

Ttl closed this Feb 4, 2019

borg323 reopened this Feb 4, 2019

borg323 added the wip Work in progress label Feb 4, 2019

Cyanogenoid mentioned this pull request Feb 9, 2019

Implement V4TrainingData #722

Merged

Ttl added 4 commits February 9, 2019 14:30

Merge branch 'master' into wdl

7bb2594

Backend D fixes, report only D in VerboseMoveStats

394271d

Merge branch 'master' into wdl

03dfe10

Write D to training data

4c7ea28

Ttl removed the wip Work in progress label Feb 10, 2019

Tilps approved these changes Feb 10, 2019

View reviewed changes

src/mcts/node.h Outdated Show resolved Hide resolved

src/neural/network_random.cc Outdated Show resolved Hide resolved

Ttl added 2 commits February 10, 2019 05:58

Fix 32-bit Node size, random backend D calculation

82e1160

Limit range of D to valid values in random backend

aa56e73

Tilps merged commit 39b85ec into LeelaChessZero:master Feb 10, 2019

QueensGambit mentioned this pull request May 18, 2021

Xiangqi: Show WDL next to the evaluation ml-research/liground#205

Open

QueensGambit mentioned this pull request May 29, 2021

WDLP value head QueensGambit/CrazyAra#123

Merged

yuzisee mentioned this pull request Aug 29, 2023

How does the graph differ from Lichess's? rooklift/nibbler#242

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WDL value head support #635

WDL value head support #635

Ttl commented Dec 30, 2018 •

edited

Loading

Ttl commented Dec 30, 2018

Ttl commented Jan 28, 2019 •

edited

Loading

Tilps Feb 1, 2019

TFiFiE commented Feb 2, 2019

Tilps commented Feb 2, 2019

TFiFiE commented Feb 3, 2019

Tilps commented Feb 3, 2019 •

edited

Loading

Ttl commented Feb 4, 2019

TFiFiE commented Feb 4, 2019

Ttl commented Feb 10, 2019

borg323 commented Feb 10, 2019

lp200 commented Feb 10, 2019

		<< ") ";

		oss << "(U: " << std::setw(6) << std::setprecision(5) << edge.GetU(U_coeff)

WDL value head support #635

WDL value head support #635

Conversation

Ttl commented Dec 30, 2018 • edited Loading

Ttl commented Dec 30, 2018

Ttl commented Jan 28, 2019 • edited Loading

Tilps Feb 1, 2019

Choose a reason for hiding this comment

TFiFiE commented Feb 2, 2019

Tilps commented Feb 2, 2019

TFiFiE commented Feb 3, 2019

Tilps commented Feb 3, 2019 • edited Loading

Ttl commented Feb 4, 2019

TFiFiE commented Feb 4, 2019

Ttl commented Feb 10, 2019

borg323 commented Feb 10, 2019

lp200 commented Feb 10, 2019

Ttl commented Dec 30, 2018 •

edited

Loading

Ttl commented Jan 28, 2019 •

edited

Loading

Tilps commented Feb 3, 2019 •

edited

Loading