Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some kind of off-by-one mistake #1

instance01 opened this issue Jul 24, 2020 · 0 comments

Some kind of off-by-one mistake #1

instance01 opened this issue Jul 24, 2020 · 0 comments


Copy link

instance01 commented Jul 24, 2020

Seen on gridworld (5x5 and 8x8). See following example on 8x8:

A good evaluation: 22222122222
A weird off-by-one evaluation: 222221222201010101010101010101010101010101010101010101010101010101010101010101010101010101 010101010101010101010101010101010101010101010101010101010101010101010101010101010101010101 0101010101010101010101010101010101010101010101010101010101010101010101010101

So the last action before the goal goes left and right indefinitely.
I remember having the same issue in the Python implementation of BootlegAlphaZero.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
None yet
None yet

No branches or pull requests

1 participant