Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Book with positions according to frequency #9

Merged
merged 1 commit into from
Apr 28, 2020

Conversation

vondele
Copy link
Member

@vondele vondele commented Apr 28, 2020

  • games played on lichess (lichess_db_standard_rated_2020-01.pgn)
  • TC > 60s and WhiteELo > 1800 and BlackElo > 1800 (8917954 games)
  • FENs in all games ordered wrt (approximate) frequency (games analyzed to the first 'novelty' only)

The book contains 200000 positions.

* games played on lichess (lichess_db_standard_rated_2020-01.pgn)
* TC > 60s and WhiteELo > 1800 and BlackElo > 1800 (8917954 games)
* FENs in all games ordered wrt (approximate) frequency (games analyzed to the first 'novelty' only)

The book contains 200000 positions.
@vondele vondele merged commit 5774856 into official-stockfish:master Apr 28, 2020
@xoto10
Copy link

xoto10 commented Apr 28, 2020

So this is 200,000 unique positions, is that right?

Do you have some rough info on the distribution of lengths from startpos? I guess startpos once, a few 1-ply positions, (a few)^2 2-ply posiitons, etc ? Perhaps I've answered that one myself :)

@vondele
Copy link
Member Author

vondele commented Apr 28, 2020

@xoto10 yes, the 200k unique most frequent positions. The epd has them in the order of frequency.

I didn't collect the plies from startpos info, but your idea is roughly right, I think, with the exception of some very long popular lines that will be there much more than the simple scheme suggests.

@vondele
Copy link
Member Author

vondele commented Apr 28, 2020

@xoto10 BTW, this is from lichess games, but it would be trivial to do that for any other public pgn database. You did post the link to lczero games, I had a quick look, but how can we pick good-quality games... some of the pgn's were clearly from the early phase (like mate in 8 from startpos).

@xoto10
Copy link

xoto10 commented Apr 28, 2020

... You did post the link to lczero games, I had a quick look, but how can we pick good-quality games... some of the pgn's were clearly from the early phase (like mate in 8 from startpos).

Ah, I didn't know, I'm not familiar with their project. I think it mentioned "matches" so I thought they might be better games.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants