Parallel PPO-PyTorch

A parallel agent training version of Proximal Policy Optimization with clipped objective.

Usage

To test a pre-trained network : run test.py
To train a new network : run parallel_PPO.py
All the hyperparameters are in the file, main function

Results

CartPole-v1	LunarLander-v2

Dependencies

Trained and tested on:

Python 3.6
PyTorch 1.3
NumPy 1.15.3
gym 0.10.8
Pillow 5.3.0

TODO

implement Conv net based training

Setting up Conda Environment

conda env export | grep -v "^prefix: " > environment.yml to export the file environment.yml
conda create -f environment.yml to create the conda environment used for training

References

PPO paper
PPO-PyTorch github

Name		Name	Last commit message	Last commit date
Latest commit History 108 Commits
gif		gif
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
parallel_PPO.py		parallel_PPO.py
print_custom.py		print_custom.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Parallel PPO-PyTorch

Usage

Results

Dependencies

TODO

Setting up Conda Environment

References

About

Releases

Packages

Languages

License

rhklite/Parallel-PPO-PyTorch

Folders and files

Latest commit

History

Repository files navigation

Parallel PPO-PyTorch

Usage

Results

Dependencies

TODO

Setting up Conda Environment

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages