Q-learning

Using reinforcement learning to play snake and similar games.

An environment is created that simulates a game . This environment takes actions and returns the resulting screen of those actions, plus the resulting reward (-1 if the player dies, 0 if nothing happens and 1 if the player scores).

A player with a neural-network provides actions and learns from the environment responses using Q-learning per advantage learning. The network should learn what actions provide the best value.

Techniques used

Advantage learning, particularized to Q-learning.
Double Q-learning
(Prioritized) memory replay
Progressive discount rate growth
Progressive exploration rate growth

Results

For such a simple game the player should be able to learn to play for much longer, but it is clearly working:

More info on Q-learning

Demystifying Deep Reinforcement Learning

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
games		games
#README.md#		#README.md#
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Ubuntu-L.ttf		Ubuntu-L.ttf
bestcatchplayer.h5		bestcatchplayer.h5
catchgame.gif		catchgame.gif
main.py		main.py
model.h5		model.h5
player.py		player.py
snakegame.gif		snakegame.gif
snakeplayer.h5		snakeplayer.h5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Q-learning

Techniques used

Results

More info on Q-learning

About

Releases

Packages

Languages

License

carllacan/qlearning

Folders and files

Latest commit

History

Repository files navigation

Q-learning

Techniques used

Results

More info on Q-learning

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages