Additional Readings These are optional readings if you want to go deeper. Introduction to Policy Optimization Part 3: Intro to Policy Optimization - Spinning Up documentation Policy Gradient https://johnwlambert.github.io/policy-gradients/ RL - Policy Gradient Explained Chapter 13, Policy Gradient Methods; Reinforcement Learning, an introduction by Richard Sutton and Andrew G. Barto Implementation PyTorch Reinforce implementation Implementations from DDPG to PPO