PyRL

PyRL (pronounced "Parallel") is a PyTorch deep reinforcement learning library focusing on reproducibility and readability.

Our philosophy is to respect all the details in original papers. We wish to keep it highly-readable by following the pseudocode and implementing with PyTorch.

Let's make deep RL easier to start!

Subpackages

Currently, PyRL includes implementations of:

DQN (Nature 2015)
A3C (ICML 2016)
DDPG (ICML 2016, on the list)
PPO (arXiv 2017, ongoing)
RAINBOW (AAAI 2018, on the list)

More methods may be included in the future.

Prerequisites

Python 3.6
PyTorch 1.0.1
OpenAI Gym 0.10.9

Usage

To train an agent using A3C, come to the root directory, and simply run:

python -m a3c.main

which train an agent to solve the game Breakout with random seed 0.

Results will be saved in res/BreakoutNoFrameskip-v4_a3c_0/ by default.

To evaluate the trained A3C model, run:

python -m a3c.eval

More options could be found in arguments (see main.py and eval.py).

Structure

The PyRL package contains:

method packages, e.g. a3c/
environment packages, envs/
common package, common/
result directory, res/
plot script, plot.ipynb

Each method packages and environment packages are designed to work independently. Therefore, if you would like to use A3C on atari games, you will only need a3c/ and envs/atari/, as well as common/, res/, plot.ipynb.

Method Packages

Each method package (e.g. a3c/) contains:

main.py, main file for training
model.py, model file defines the network architecture
eval.py, evaluation file evaluates the performance of the saved model
other files

Environment Packages

Environment packages (envs/) contain:

atari/, atari games environment based on OpenAI Gym

Common Package

Common package (common/) contains:

logger.py, logger to log training and evaluation data

Result Directory

The result directory (res/) follows the naming rule:

[env_id]_[method]_[seed]

e.g. BreakoutNoSkipFrame-v4_a3c_0, contains:

test.txt, logging file during training, generated by main.py
model.dat, best saved model during training, generated by main.py
eval.txt. logging file during evaluation, generated by eval.py

Plot Script

plot.ipynb, plot learning curves in the result directory, averaged across different random seeds

For more details of a specific method, please refer to the README in each method packages.

Citing the Project

To cite this repository in publications:

@misc{PyRL,
  author = {Yuezhang, Liu},
  title = {PyRL},
  year = {2019},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/liuyuezhangadam/pyrl}},
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
a2c		a2c
a3c		a3c
common		common
ddqn		ddqn
dnc		dnc
dqn		dqn
envs		envs
ppo		ppo
ppo1		ppo1
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
plot.ipynb		plot.ipynb
plot.py		plot.py

License

liuyuezhang/pyrl

Folders and files

Latest commit

History

Repository files navigation

PyRL

Subpackages

Prerequisites

Usage

Structure

Method Packages

Environment Packages

Common Package

Result Directory

Plot Script

Citing the Project

About

Topics

Resources

License

Stars

Watchers

Forks

Languages