Reinforcement Learning using Q-Learning

An implementation of Q-Learning for training a machine to play Swingy Monkey (a Flappy Bird clone).

Both the game (SwingyMonkey.py) and the learning models are provided (qlearn_mX.py). The 3 different models are as follows:

qlearn_m1.py - World state is a function of ( horizontal_dist(monkey, tree), vertical_dist(monkey, dist), velocity(monkey) ).
qlearn_m2.py - Same model, but 2x the resolution of horizontal_dist and vertical_dist.
qlearn_m3.py - World state is ( horizontal_dist(monkey, tree), vertical_dist(monkey, dist), position(monkey), velocity(monkey) ). The resolution is as in #2.

How to run

Make sure you have pygame installed.
use 'python qlearn_mX.py' to start training.

NOTE: qlearn_m1.py learns a pretty good representation after ~20 minutes (on a simple MBP system). qlearn_m2.py takes several hours to converge to its optimal policy, and qlearn_m3.py was still improving after ~18h of runtime, but by that time it is already better than the first two models (and it keeps improving, although very slowly). See analysis folder for some charts (generated after all three models were run for ~18 hours).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning using Q-Learning

How to run

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
analysis		analysis
res		res
README.md		README.md
SwingyMonkey.py		SwingyMonkey.py
SwingyMonkey.pyc		SwingyMonkey.pyc
qlearn_m1.py		qlearn_m1.py
qlearn_m2.py		qlearn_m2.py
qlearn_m3.py		qlearn_m3.py
stub.py		stub.py

guyz/qlearn

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning using Q-Learning

How to run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages