Cliff Walking

This project demonstrates a number of common reinforcement learning (RL) algorithms, applied on Sutton & Barto's cliff walking problem. The aim is to aid understanding of RL mechanisms in a comprehensive environment. For this purpose, the code is relatively integrated and hard-coded. I intermittedly add new algorithms and refactor the code.

The project currently contains the following algorithms:

Q-learning
SARSA
Deep Q-learning
Discrete policy gradient
Deep policy gradient

Neural network approaches are incorporated using TensorFlow.

My series of blog posts at Towards Data Science provides descriptions and interpretations of the implemented algorithms and their results:
Q-learning and SARSA
Monte Carlo learning
Discrete Policy Gradient
Deep Q-Learning
Deep Policy Gradient

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.idea		.idea
__pycache__		__pycache__
main		main
venv		venv
LICENSE		LICENSE
README.md		README.md
actions.py		actions.py
environment.py		environment.py
learning_algorithms.py		learning_algorithms.py
main.py		main.py
plot.py		plot.py
qtable.py		qtable.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.idea

.idea

pycache

pycache

main

main

venv

venv

LICENSE

LICENSE

README.md

README.md

actions.py

actions.py

environment.py

environment.py

learning_algorithms.py

learning_algorithms.py

main.py

main.py

plot.py

plot.py

qtable.py

qtable.py

requirements.txt

requirements.txt

Repository files navigation

Cliff Walking

About

Releases

Packages

Languages

License

woutervanheeswijk/cliff_walking_public

Folders and files

Latest commit

History

Repository files navigation

Cliff Walking

About

Resources

License

Stars

Watchers

Forks

Languages