Skip to content
This repository has been archived by the owner on Dec 11, 2020. It is now read-only.

Some questions about training a bot #60

Closed
infinitycr opened this issue Jun 6, 2018 · 4 comments
Closed

Some questions about training a bot #60

infinitycr opened this issue Jun 6, 2018 · 4 comments

Comments

@infinitycr
Copy link

I change the network to 64*5 and start 2 clients . One client is running start_server.sh and start_client.sh simultaneously, and another one is running start_client.sh. From the log file I find that there are over 4000 selfplay games after running 40 hours. However, .I cannot find anything about training information. I want to know when will the training start?

@jma127
Copy link
Contributor

jma127 commented Jun 6, 2018

Hi @infinitycr , thanks for your interest! The training will only start once a sufficient number of selfplay games are gathered. The default setting in the start_server script will require roughly 12k-20k games. To scale down this requirement, you can decrease q_min_size.

@jma127 jma127 closed this as completed Jun 6, 2018
@infinitycr
Copy link
Author

@jma127 I find the q_min_size in the script file is 200, so 200 means 12k-20k selfplay games? Could you please explain specificly about the relationship of q_min_size and selfplay games?

@jma127
Copy link
Contributor

jma127 commented Jun 8, 2018

There are 50 queues that store the games, and q_min_size 200 means that each queue must have 200 games before any training will start.

@infinitycr
Copy link
Author

infinitycr commented Jun 8, 2018

@jma127 I'm really appreciated for your reply! I'm confused about some logging information.

[2018-06-06 13:14:16.977] [elfgames::go::TrainCtrl-0] [info] received 3000 records from local-Workstation-1db3-cbb4-5d10-b748, with 32 state updates, 0 records, 0 valid selfplays, and 0 evals

Wed Jun 6 13:14:16 2018, last_identity: local--Workstation-1db3-cbb4-5d10-b748, #msg: 0 #client: 8, Msg count: 3000, avg msg size: 686653, failed count: 0

[2018-06-06 21:30:26.797] [elfgames::go::TrainCtrl-0] [info] received 4000 records from local-699e-247b-e4e-6760, with 32 state updates, 5 records, 5 valid selfplays, and 0 evals

Wed Jun 6 21:30:26 2018, last_identity: local-699e-247b-e4e-6760, #msg: 0 #client: 8, Msg count: 4000, avg msg size: 682586, failed count: 0

These logging information come from the start_server.sh. What's the meaning of '5 valid selfplays'? Does that mean there are only 5 valid selfplay games in 4000 games performed by the client?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants