This is an implementation of DQN (based on Mnih et al., 2015) in Keras + TensorFlow + OpenAI Gym.
- gym (Atari environment)
- scikit-image
- keras
- tensorflow
This is the result of training of DQN for about 28 hours (12K episodes, 4.7 millions frames) on AWS EC2 g2.2xlarge instance.
Statistics of average loss, average max q value, duration, and total reward / episode.
For DQN, run:
python dqn.py
For Double DQN, run:
python ddqn.py
Run the following:
tensorboard --logdir=summary/
I built an AMI for this experiment. All of requirements + CUDA + cuDNN are pre-installed in the AMI.
The AMI name is DQN-AMI
, the ID is ami-c4a969a9
, and the region is N. Virginia. Feel free to use it.