夫祸患常积于忽微,而智勇多困于所溺。 -欧阳修《伶官传序》
/^ ^\
/ 0 0 \
V\ Y /V
/ - \
/ |
V__) ||
Ascii Art is from ascii-code.com.
Here is my implementation of the Q-learning algorithms by tensorflow.
- DQN
- Double DQN
- Distributional DQN
- Dueling Network
- Combine Dueling network and distributonal DQN like Rainbow
My thanks for Denny Britz.
Wrappers of atari game are from baselines.
python train.py --game_name PongNoFrameskip-v4