Skip to content

Latest commit

 

History

History
6 lines (4 loc) · 242 Bytes

File metadata and controls

6 lines (4 loc) · 242 Bytes

Reinforcement-Learning-with-Policy-Gradients

Using tensorFlow and experiment with Policy Gradient methods for a simple Reinforcement Learning problem using the OpenAI Gym framework.

Cartpole-v0 is used implemented by Rich Sutton et al