New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
the rollout accuracy in test script is lower than the test accuracy in train script. #6
Comments
Hi @albzni, Rollout accuracy does indicate the success rate. I'm not sure what numbers you are getting, however, the success rate numbers over a large sample size are reported at the bottom of the README. Accounting for some randomness these numbers match Aviv's original implementation. |
Hi @kentsommer ! Thank you for your comments. Thank you! |
The success rate is taken over 5000 randomly generated environments as noted in the readme. The reason for increasing the number of test domains is that it gives a larger sample size and therefore a better indication of actual performance. The higher the number of samples from the full distribution of all possible random environments, the better you can estimate the true performance of the policy. |
Thank you so much! |
Hello!
I have a little doubt.Does the rollout accuracy indicate the success rate? If so, why is it lower than the prediction accuracy? In the Aviv's implementation, the success rate of the 8x8 grid world was as high as 99.6%. Why is the success rate in your experiment relatively low?
Thanks!
The text was updated successfully, but these errors were encountered: