Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Atari results #46

Closed
TongZhangTHU opened this issue Aug 9, 2022 · 4 comments
Closed

Atari results #46

TongZhangTHU opened this issue Aug 9, 2022 · 4 comments

Comments

@TongZhangTHU
Copy link

TongZhangTHU commented Aug 9, 2022

Hi,

Thanks for your wonderful work. I cannot reproduce the performance reported in the paper for Atari. For example, compared to Table 1, my normalized score for Breakout is 147.738, for Seaquest is 1.875 (averaged over 3 seeds, I use the same seed as this script: https://github.com/kzl/decision-transformer/blob/master/atari/run.sh ) I wonder did you use the same seeds (123, 231, 312) as that script ? Or did I miss something?

@GilgameshD
Copy link

A follow-up question. How to normalize the score?

@TongZhangTHU
Copy link
Author

@GilgameshD normalized score =100 * (score - random score)/(expert score - random score)

@lili-chen
Copy link
Collaborator

Hi, I did use those seeds but I’ve realized that there is some additional stochasticity that I have not been able to locate. Really sorry about that!

@yiyeChen
Copy link

yiyeChen commented Sep 7, 2022

@TongZhangTHU
Another follow up question, where to find random score and expert score? I did find a table from here but not sure if everyone is using the same set of parameters.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants