the rollout accuracy in test script is lower than the test accuracy in train script. #6

albzni · 2018-04-27T14:57:29Z

Hello!

I have a little doubt.Does the rollout accuracy indicate the success rate? If so, why is it lower than the prediction accuracy? In the Aviv's implementation, the success rate of the 8x8 grid world was as high as 99.6%. Why is the success rate in your experiment relatively low?

Thanks!

kentsommer · 2018-05-01T06:32:40Z

Hi @albzni,

Rollout accuracy does indicate the success rate. I'm not sure what numbers you are getting, however, the success rate numbers over a large sample size are reported at the bottom of the README. Accounting for some randomness these numbers match Aviv's original implementation.

albzni · 2018-05-02T04:37:19Z

Hi @kentsommer ! Thank you for your comments.
After reading the README, I still have some questions.In test script,the n_domains=100, and in your results the Success Rate up to 99.69%. Did you average the results of multiple tests? If not, why would the accuracy between 99% and 100% in 100 domains? And does increasing the number of domains can reduce the randomness of the results?

Thank you!

kentsommer · 2018-05-02T04:41:40Z

@albzni

The success rate is taken over 5000 randomly generated environments as noted in the readme.

The reason for increasing the number of test domains is that it gives a larger sample size and therefore a better indication of actual performance. The higher the number of samples from the full distribution of all possible random environments, the better you can estimate the true performance of the policy.

albzni · 2018-05-02T05:44:42Z

Thank you so much!

albzni closed this as completed May 2, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the rollout accuracy in test script is lower than the test accuracy in train script. #6

the rollout accuracy in test script is lower than the test accuracy in train script. #6

albzni commented Apr 27, 2018

kentsommer commented May 1, 2018

albzni commented May 2, 2018

kentsommer commented May 2, 2018 •

edited

albzni commented May 2, 2018

the rollout accuracy in test script is lower than the test accuracy in train script. #6

the rollout accuracy in test script is lower than the test accuracy in train script. #6

Comments

albzni commented Apr 27, 2018

kentsommer commented May 1, 2018

albzni commented May 2, 2018

kentsommer commented May 2, 2018 • edited

albzni commented May 2, 2018

kentsommer commented May 2, 2018 •

edited