DeepQNetwork

Name		Name	Last commit message	Last commit date
parent directory ..
DQN.py		DQN.py
DQNModel.py		DQNModel.py
README.md		README.md
atari.py		atari.py
breakout.jpg		breakout.jpg
common.py		common.py
curve-breakout.png		curve-breakout.png
expreplay.py		expreplay.py

README.md

video demo

Reproduce the following reinforcement learning methods:

Nature-DQN in: Human-level Control Through Deep Reinforcement Learning
Double-DQN in: Deep Reinforcement Learning with Double Q-learning
Dueling-DQN in: Dueling Network Architectures for Deep Reinforcement Learning
A3C in Asynchronous Methods for Deep Reinforcement Learning. (I used a modified version where each batch contains transitions from different simulators, which I called "Batch-A3C".)

Claimed performance in the paper can be reproduced, on several games I've tested with.

On one TitanX, Double-DQN took 1 day of training to reach a score of 400 on breakout game. Batch-A3C implementation only took <2 hours. (Both are trained with a larger network noted in the code).

Double-DQN runs at 60 batches (3840 trained frames, 240 seen frames, 960 game frames) per second on TitanX.

How to use

Download an atari rom to $TENSORPACK_DATASET/atari_rom/ (defaults to ~/tensorpack_data/atari_rom/), e.g.:

mkdir -p ~/tensorpack_data/atari_rom
wget https://github.com/openai/atari-py/raw/master/atari_py/atari_roms/breakout.bin -O ~/tensorpack_data/atari_rom/breakout.bin

Start Training:

./DQN.py --rom breakout.bin
# use `--algo` to select other DQN algorithms. See `-h` for more options.

Watch the agent play:

./DQN.py --rom breakout.bin --task play --load trained.model

A pretrained model on breakout can be downloaded here.

A3C code and models for Atari games in OpenAI Gym are released in examples/A3C-Gym

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DeepQNetwork

DeepQNetwork

DQN.py

DQN.py

DQNModel.py

DQNModel.py

README.md

README.md

atari.py

atari.py

breakout.jpg

breakout.jpg

common.py

common.py

curve-breakout.png

curve-breakout.png

expreplay.py

expreplay.py

README.md

How to use

Files

DeepQNetwork

Directory actions

More options

Directory actions

More options

Latest commit

History

DeepQNetwork

Folders and files

parent directory

How to use