Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

baseline problem in the vanilla policy gradient example #2

Closed
flyers opened this issue Apr 30, 2016 · 1 comment
Closed

baseline problem in the vanilla policy gradient example #2

flyers opened this issue Apr 30, 2016 · 1 comment

Comments

@flyers
Copy link

flyers commented Apr 30, 2016

Hi everyone,
When I am running the rllab/examples/vpg_2.py example, I found that the baseline is actually a zero vector. It seems that we need to explicitly call the baseline.fit method before calling the baseline.predict method. @dementrock
Thanks.

@dementrock
Copy link
Member

Ahh yes. Your interpretation is correct & thanks for reporting the issue! I will push a fix to the master branch shortly.

@flyers flyers closed this as completed Apr 30, 2016
dementrock added a commit that referenced this issue May 1, 2016
alexbeloi pushed a commit to alexbeloi/rllab that referenced this issue Jun 23, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants