Double Q-learning with experience replay on CartPole-v0 using Keras. See algorithm on CartPole-v0.
_________________________________________________________________
Layer (type) Output Shape Param #
=================================================================
dense_1 (Dense) (None, 128) 640
_________________________________________________________________
dense_2 (Dense) (None, 2) 258
=================================================================
Total params: 898.0
Trainable params: 898
Non-trainable params: 0.0
- Jaromír Janisch, Let’s make a DQN: Implementation
- Jaromír Janisch, Let’s make a DQN: Debugging
- Jaromír Janisch, Let’s make a DQN: Full DQN
- Keon Kim, Deep Q Learning with Keras and Gym
- H Van Hasselt, A Guez, D Silver, Deep Reinforcement Learning with Double Q-learning
- HV Hasselt, Double Q-learning
- V Mnih, K Kavukcuoglu, D Silver, A Graves, Playing atari with deep reinforcement learning