OpenAI Gym PyTorch

OpenAI's Gym is an open source toolkit containing several environments which can be used to compare reinforcement learning algorithms and techniques in a consistent and repeatable manner, easily allowing developers to benchmark their solutions.

This repository aims to create a simple one-stop location for testing reinforcement learning models without worrying about configuring or maintaining the environment. Featuring extensive command-line parameters, various tweaks to settings can easily be made to determine an optimal configuration for a particular environment and model.

Setting up the repository

Creating a virtual environment

After cloning the repository, it is highly recommended to install a virtual environment (such as virtualenv) or Anaconda to isolate the dependencies of this project with other system dependencies.

To install virtualenv, simply run

pip install virtualenv

Once installed, a new virtual environment can be created by running

virtualenv env

This will create a virtual environment in the env directory in the current working directory. To change the location and/or name of the environment directory, change env to the desired path in the command above.

To enter the virtual environment, run

source env/bin/activate

You should see (env) at the beginning of the terminal prompt, indicating the environment is active. Again, replace env with your desired directory name.

To get out of the environment, simply run

deactivate

Installing Dependencies

While the virtual environment is active, install the required dependencies by running

pip -r requirements.txt

This will install all of the dependencies at specific versions to ensure they are compatible with one another.

Training a model

To train a model, use the train.py script and specify any parameters that need to be changed, such as the environment or epsilon decay factors. A list of the default values for every parameters can be found by running

python train.py --help

If you desire to run with the default settings, execute the script directly with

python train.py

The script will train the default environment over a set number of episodes and display the training progress after the conclusion of every episode. The updates indicate the episode number, the reward for the current episode, the best reward the model has achieved so far, a rolling average of the previous 100 episode rewards, and the current value for epsilon.

Any time the model reaches a new best rolling average, the current model weights are saved as a .dat file with the environment's name (such as PongNoFrameskip-v4.dat). This saved model will overwrite any existing model weight files for the same environment.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.github		.github
core		core
media		media
models		models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenAI Gym PyTorch

Setting up the repository

Creating a virtual environment

Installing Dependencies

Training a model

About

Releases

Packages

Contributors 2

Languages

License

roclark/openai-gym-pytorch

Folders and files

Latest commit

History

Repository files navigation

OpenAI Gym PyTorch

Setting up the repository

Creating a virtual environment

Installing Dependencies

Training a model

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages