This is a Pytorch implementation of Proximal Policy Optimization as described in this paper.
The implementation used in this repo was used as a reference for this implementation.
To run a demo, clone the repo and use the command:
python simulate.py
The training plots are shown below: