New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Atari results #46
Comments
A follow-up question. How to normalize the score? |
@GilgameshD normalized score =100 * (score - random score)/(expert score - random score) |
Hi, I did use those seeds but I’ve realized that there is some additional stochasticity that I have not been able to locate. Really sorry about that! |
@TongZhangTHU |
Hi,
Thanks for your wonderful work. I cannot reproduce the performance reported in the paper for Atari. For example, compared to Table 1, my normalized score for Breakout is 147.738, for Seaquest is 1.875 (averaged over 3 seeds, I use the same seed as this script: https://github.com/kzl/decision-transformer/blob/master/atari/run.sh ) I wonder did you use the same seeds (123, 231, 312) as that script ? Or did I miss something?
The text was updated successfully, but these errors were encountered: