tensorpack/examples/Atari2600 at master · nitinh/tensorpack

History

Name		Name	Last commit message	Last commit date
parent directory ..
DQN.py		DQN.py
README.md		README.md
atari.py		atari.py
breakout.jpg		breakout.jpg
common.py		common.py
curve-breakout.png		curve-breakout.png

README.md

video demo

Reproduce the following reinforcement learning methods:

Nature-DQN in: Human-level Control Through Deep Reinforcement Learning
Double-DQN in: Deep Reinforcement Learning with Double Q-learning
Dueling-DQN in: Dueling Network Architectures for Deep Reinforcement Learning
A3C in Asynchronous Methods for Deep Reinforcement Learning. (I used a modified version where each batch contains transitions from different simulators, which I called "Batch-A3C".)

Claimed performance in the paper can be reproduced, on several games I've tested with.

DQN typically took 2 days of training to reach a score of 400 on breakout game. My Batch-A3C implementation only took <2 hours. Both were trained on one GPU with an extra GPU for simulation.

The x-axis is the number of iterations, not wall time. Iteration speed on Tesla M40 is about 9.7it/s for B-A3C. D-DQN is faster at the beginning but will converge to 12it/s due of exploration annealing.

How to use

Download an atari rom to $TENSORPACK_DATASET/atari_rom/ (defaults to tensorpack/dataflow/dataset/atari_rom/).

To train:

./DQN.py --rom breakout.bin
# use `--algo` to select other DQN algorithms

To visualize the agent:

./DQN.py --rom breakout.bin --task play --load trained.model

A3C code and models for Atari games in OpenAI Gym are released in examples/OpenAIGym

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Atari2600

Atari2600

README.md

How to use

Files

Atari2600

Directory actions

More options

Directory actions

More options

Latest commit

History

Atari2600

Folders and files

parent directory

README.md

How to use