Pytorch-DPPO

Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286 Using PPO with clip loss (from https://arxiv.org/pdf/1707.06347.pdf).

work in progress

#req python3.6 pytorch cmake

#install use cmake-gui and visual studio to build on windows ./build.sh to build on linux

#run ./run_with_log.sh on linux python .\main.py cppSimulator on windows

Acknowledgments

Hyperparameters and loss computation has been taken from https://github.com/openai/baselines

Name		Name	Last commit message	Last commit date
Latest commit History 290 Commits
cppSimulator		cppSimulator
figs		figs
log		log
model		model
.gitignore		.gitignore
GameServer.py		GameServer.py
LICENSE		LICENSE
README.md		README.md
build.sh		build.sh
chief.py		chief.py
main.py		main.py
model.py		model.py
mp_trainer.py		mp_trainer.py
note		note
plot_rewards.py		plot_rewards.py
run_with_log.sh		run_with_log.sh
train.py		train.py
utils.py		utils.py
visualizer.py		visualizer.py