GitHub - Abhipanda4/PPO-PyTorch: Implementation of Proximal Policy Optimization(PPO)

This is a Pytorch implementation of Proximal Policy Optimization as described in this paper.

The implementation used in this repo was used as a reference for this implementation.

To run a demo, clone the repo and use the command: python simulate.py

The training plots are shown below:

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
images		images
saved_models		saved_models
.gitignore		.gitignore
README.md		README.md
constants.py		constants.py
environment.py		environment.py
model.py		model.py
ppo.py		ppo.py
replay_memory.py		replay_memory.py
running_state.py		running_state.py
simulate.py		simulate.py
train.py		train.py
utils.py		utils.py

Provide feedback