Replies: 2 comments 2 replies
-
It is not difficult to implement naive policy gradient in DI-engine, but I want to know why you need it? We didn't add naive pg before because it shows poor performance in most environments. |
Beta Was this translation helpful? Give feedback.
2 replies
-
I have implemented REINFORCE in this #544. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
The REINFORCE algorithm is a classical policy gradient method. It has been implemented in some other RL libraries, such as RLlib and Tianshou. I wonder if we can also have it in DI-engine.
Beta Was this translation helpful? Give feedback.
All reactions