Snake Deep Q AI

Architecture

Architecture similar to 'Playing Atari with Deep Reinforcement Learning' (2013) V Mnih, K Kavukcuoglu, D Silver et al.

The game's pixels are used directly as network input. 400 x 400 RGB game screen greyscaled and downsampled to 84 x 84. 4 frames stacked and fed into a 3d convolution

Three convolutional layers, one fully connected, and batch norm/dropout to help against overfitting

Conv [16 1x5x5 filters, stride 1x2x2, relu] - features in each layer
Conv [32 2x3x3 filters, stride 2x2x2, relu] - features between layers
Conv [64 1x3x3 filters, stride 1x2x2, relu] - conv pooling
Batch Normalisation
Dropout [0.2]
Dense [128, relu]
Dense [4, softmax]

Replay Training

Long term memory training

End of each game round, randomly select a mini batch of state transition memories (old_state, action, new_state, reward) to train the network

Short term memory training

Train the last state transition

Config

See config.ini

Dependencies

Keras, CV2, Numpy, matplotlib, configparser

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
img		img
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
brain.py		brain.py
config.ini		config.ini
main.py		main.py
save.h5		save.h5
snake.py		snake.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Snake Deep Q AI

Architecture

Replay Training

Config

Dependencies

About

Releases

Packages

Languages

JeremyVun/Deep-Q-Snake

Folders and files

Latest commit

History

Repository files navigation

Snake Deep Q AI

Architecture

Replay Training

Config

Dependencies

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages