Skip to content

Latest commit

 

History

History
 
 

DeepQNetwork

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

breakout

video demo

Reproduce the following reinforcement learning methods:

Claimed performance in the paper can be reproduced, on several games I've tested with.

DQN

On one TitanX, Double-DQN took 1 day of training to reach a score of 400 on breakout game. Batch-A3C implementation only took <2 hours. (Both are trained with a larger network noted in the code).

Double-DQN runs at 60 batches (3840 trained frames, 240 seen frames, 960 game frames) per second on TitanX.

How to use

Download an atari rom to $TENSORPACK_DATASET/atari_rom/ (defaults to ~/tensorpack_data/atari_rom/), e.g.:

mkdir -p ~/tensorpack_data/atari_rom
wget https://github.com/openai/atari-py/raw/master/atari_py/atari_roms/breakout.bin -O ~/tensorpack_data/atari_rom/breakout.bin

Start Training:

./DQN.py --rom breakout.bin
# use `--algo` to select other DQN algorithms. See `-h` for more options.

Watch the agent play:

./DQN.py --rom breakout.bin --task play --load trained.model

A pretrained model on breakout can be downloaded here.

A3C code and models for Atari games in OpenAI Gym are released in examples/A3C-Gym