Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about generate_starts function for reverse curriculum #7

Closed
tldoan opened this issue Sep 27, 2018 · 1 comment
Closed

Question about generate_starts function for reverse curriculum #7

tldoan opened this issue Sep 27, 2018 · 1 comment

Comments

@tldoan
Copy link

tldoan commented Sep 27, 2018

Hi,

According to your paper you apply a brownian motion to generate new seeds states (Normal 0 variance 1 ) but according to this line
https://github.com/florensacc/rllab-curriculum/blob/master/curriculum/envs/start_env.py#L233

It seems that you are applying a random uniform action with range env.action_space.bounds for the AntMaze Environment.

Can you explain why the action is not a N(0,I) ??

Thank you very much.

@florensacc
Copy link
Owner

Hi,
you are right, we apply random uniform actions. Given that the action bounds are finite and known, the maximum entropy policy is the uniform over that interval. We also tried applying a N(0,I) action and the results were essentially the same. It is unfortunate that this published code has a random walk that is not exactly "Brownian". Anyone interested in verifying that the results are exactly the same under Brownian motion can change the line of code you point out. Thanks for spotting this detail!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants