Skip to content

podgorki/rl_demos

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This is a repo for demonstrating policy proximal optimization (PPO) on a few openai gym reinforcement learning (RL) environments at The Australian Institute for Machine Learning (AIML) https://www.adelaide.edu.au/aiml/

The code can run in its own environment but so that it is reproducible it is recommended to use the docker image.

Install

There are two ways to use this repo, 1. create a venv and run it or 2. use the docker container (recommended)

Conda Env

Install swig: sudo apt-get install swig

  • Create and environment: conda env create -f environment.yml
  • Activate it: conda activate rl_demos

Docker

Usage

To run: python rl_demos/demos/rl_demo --gym_env {environment name} --demo_type {discrete/continuous}

i.e. to run the Acrobot: python rl_demos/demos/rl_demo --gym_env Acrobot-v1 --demo_type discrete

To change demos, edit the relevant config file and change the type i.e. Acrobot-v1

(Docker) To run:

  1. First clone this repo
  2. Build the dockerfile
  3. run docker-compose up

Different environments can be run by changing the command args in the docker-compose.yml

Note: To kill the demo hit 'q'

Environments

Discrete PPO

Working demos:

Continuous PPO

Working demos:

About

A repo for some rl demos for AIML

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published