Proximal Policy Optimization

An implementation of PPO (clipping and KL-divergence) using Tensorflow. The results on some Mujoco tasks have been reproduced as in the PPO paper.

Acknowledgements

This repository is a blend of the Pytorch repository mjrl and OpenAI baselines.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
pgrl.egg-info		pgrl.egg-info
pgrl		pgrl
README.md		README.md
examples.py		examples.py
plots.py		plots.py
results_kl.png		results_kl.png
setup.py		setup.py