You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Under the heading Constructing the computational graph, a surrogate function is defined and it's gradient taken. However, isn't the surrogate function itself the gradient? This is as per the pseudo code given in the Preliminary section. Why do we need to compute the gradient again?
Kindly update the documentation or clarify. Thanks!
The text was updated successfully, but these errors were encountered:
Hello. I've just started using rllab, and went through the documentation today. The example of REINFORCE given here appears to be incorrect, or I don't fully understand the implementation.
https://rllab.readthedocs.io/en/latest/user/implement_algo_basic.html
Under the heading Constructing the computational graph, a surrogate function is defined and it's gradient taken. However, isn't the surrogate function itself the gradient? This is as per the pseudo code given in the Preliminary section. Why do we need to compute the gradient again?
Kindly update the documentation or clarify. Thanks!
The text was updated successfully, but these errors were encountered: