What is in this repo?

A Tensorflow (r0.7) based implementation of Asynchronous Methods for Deep Reinforcement Learning.

How to run the algorithms (MacOSX for now)?

(1) Clone this repo at ~/some-path.

(2) Make sure your machine has docker installed. Follow instructions [here] (https://docs.docker.com/engine/installation/#install-docker-engine) if not.

(3) Make sure you have xquartz installed in order to visualise game play. Do the following in a separate terminal window:

$ brew cask install --force xquartz
$ open -a XQuartz
$ socat TCP-LISTEN:6000,reuseaddr,fork UNIX-CLIENT:\"$DISPLAY\"

(4) Get our docker image containing all dependencies to run the algorithms and to visualise game play.

$ docker pull restrd/tensorflow-atari-cpu

(5) Run the docker image. This will mount your home folder to /your-user-name inside the container. Be sure to give a name to the container: <container-name>

$ docker run -d -p 8888:8888 -p 6006:6006 --name <container-name> -v ~/:/root/$usr -e DISPLAY=$(ifconfig vboxnet0 | awk '$1 == "inet" {gsub(/\/.*$/, "", $2); print $2}'):0 -it docker.io/restrd/tensorflow-atari-cpu

(6) Shell into the container.

$ docker exec -it <container-name> /bin/bash

(7) Go to the algorithms folder (/your-user-name/some-path/async-deep-rl/algorithms) and choose which algorithm to run via the configuration options in main.py.

(8) Run the algorithms, e.g.:

$ python main.py beam_rider ../atari_roms/ 1 &

Running TensorBoard

You can also run TensorBoard to visualise losses and game scores.

(1) Run tensorboard from within the container:

$ tensorboard --logdir=/tmp/summary_logs/ &

(2) Get the ip address of your docker host running inside of [VirtualBox] (https://www.virtualbox.org/). Go to http://<docker-host-ip>:6006

Convergence issues (May 9, 2016)

The implementation is still in flux. We are still trying to get these algorithms to converge. We are using Python threading, which may (or may not?) be disrupting the Hogwild!ness of the algorithms.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
algorithms		algorithms
atari_roms		atari_roms
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

algorithms

algorithms

atari_roms

atari_roms

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

What is in this repo?

How to run the algorithms (MacOSX for now)?

Running TensorBoard

Convergence issues (May 9, 2016)

About

Releases

Packages

Languages

License

falcondai/async-deep-rl

Folders and files

Latest commit

History

Repository files navigation

What is in this repo?

How to run the algorithms (MacOSX for now)?

Running TensorBoard

Convergence issues (May 9, 2016)

About

Resources

License

Stars

Watchers

Forks

Languages