batokio / RL_text-flappy-bird Public

Notifications You must be signed in to change notification settings
Fork 0
Star 1

Implementation of Q-Learning and Expected SARSA algorithms to solve the Text-Flappy-Bird game

1 star 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
img		img
README.md		README.md
RL_Text Flappy Bird.ipynb		RL_Text Flappy Bird.ipynb

Repository files navigation

RL_text-flappy-bird

The notebook presents the implementation of the Q-Learning and Expected SARSA algorithms to solve the Text-Flappy-Bird game

Text flappy bird game

The implementation of the environment can be found here: https://gitlab-research.centralesupelec.fr/stergios.christodoulidis/text-flappy-bird-gym

Final model

Please find below the final model used to measure the performance of the agents:

Hyperparameters	Q-Learning	Expected SARSA
Step-size	0.5	0.5
Step-size decay	1.0	0.99999
Epsilon	0.05	0.05
Epsilon decay	0.99999	0.99999
Discount	1.0	0.9

Performance

The sum of rewards achieved by both agents:

Q-Learning: 8,041,130
Expected SARSA : 36,660

About

Implementation of Q-Learning and Expected SARSA algorithms to solve the Text-Flappy-Bird game

reinforcement-learning hyperparameter-tuning expected-sarsa qlearning-algorithm reinforcement-learning-agent reinforcement-learning-environments

Report repository

Releases

No releases published

Packages

No packages published

Languages

Jupyter Notebook 100.0%