Introduction

This is an open source project built for experimenting with deep reinforcement learning algorithms in environments with continuous action domain (e.g. robot control tasks) from OpenAI Gym. It is a part of my Master's thesis focusing on model-based deep reinforcement learning. It also includes TensorFlow implementation of Deep Deterministic Policy Gradient algorithm and Prioritized Experience Replay.

Deep Model Learning Actor-Critic

DMLAC is a novel model-based actor-critic off-policy deep reinforcement learning algorithm inspired by Dyna-MLAC. It is designed to work in deterministic environments with continuous action domains. DMLAC learns a model of the environment from the experience of interacting with the environment. Policy is learned using the model in an actor-model-critic setting. Model is also used for n-step temporal difference learning of value function. Policy, model, and value functions are approximated with fully connected neural networks and trained with minibatches selected from prioritized experience replay.

Installation

(Tested with Python 2.7.12 + Ubuntu 16.04 + TensorFlow 1.1 + CUDA 8.0)

WARNING: In order to render OpenAI Gym environments inside Jupyter notebooks, you have to install NVIDIA drivers with --no-opengl-files option, i.e. ./NVIDIA-Linux-x86-375.39.run --no-opengl-files. If you already have NVIDIA drivers with opengl libs installed, you have to uninstall them first.

Install TensorFlow with GPU support https://www.tensorflow.org/install/install_linux
Install dependencies apt-get install -y python-numpy python-dev cmake zlib1g-dev libjpeg-dev xvfb libav-tools xorg-dev python-opengl libboost-all-dev libsdl2-dev swig
Install OpenAI Gym pip install gym[all] https://gym.openai.com/docs
Install Jupyter pip install jupyter
Launch Jupyter notebook server with a virtual screen buffer xvfb-run -s "-screen 0 1400x900x24" jupyter notebook
Open Notebook Dashboard in web browser (https://localhost:8888) and run .ipynb file of your choosing http://jupyter.readthedocs.io/en/latest/running.html#running

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
images		images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
actorcritic.py		actorcritic.py
ddpg-mountaincar.ipynb		ddpg-mountaincar.ipynb
ddpg-pendulum-episodic.ipynb		ddpg-pendulum-episodic.ipynb
ddpg-pendulum.ipynb		ddpg-pendulum.ipynb
ddpg-reacher-transfer.ipynb		ddpg-reacher-transfer.ipynb
ddpg-reacher.ipynb		ddpg-reacher.ipynb
ddpg.py		ddpg.py
displayframesasgif.py		displayframesasgif.py
dmlac-mountaincar.ipynb		dmlac-mountaincar.ipynb
dmlac-pendulum-episodic.ipynb		dmlac-pendulum-episodic.ipynb
dmlac-pendulum.ipynb		dmlac-pendulum.ipynb
dmlac-reacher-transfer.ipynb		dmlac-reacher-transfer.ipynb
dmlac-reacher.ipynb		dmlac-reacher.ipynb
dmlac.py		dmlac.py
experiencereplay.py		experiencereplay.py
experiment.py		experiment.py
exploration.py		exploration.py
layers.py		layers.py
movingaverage.py		movingaverage.py
neuralnetwork.py		neuralnetwork.py
nn.py		nn.py
optimizers.py		optimizers.py
ounoise.py		ounoise.py
parameter.py		parameter.py
replaybuffer.py		replaybuffer.py
visualisation.py		visualisation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Deep Model Learning Actor-Critic

Installation

About

Releases

Packages

Languages

License

schliffen/reinforcement-learning

Folders and files

Latest commit

History

Repository files navigation

Introduction

Deep Model Learning Actor-Critic

Installation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages