Windy World

An Example of Reinforcement Learning for Making Decision Under Uncertainty

Windy World is a classic problem for many reinforcement learning and dynamic programming methods. The idea is that an agent is moving in a grid world (7x10 in this example). There are a starting cell, target (or home) cell and a few failing cells (lakes). The agent starts from the starting cell and must find its way to the home cell. In a calm world (no wind), the agent using a simple RL algorithm (in this case SARSA algorithm) can find its optimum way.

The optimum way in this example is the shortest and the safest way. But, to add uncertainty, the world is not a calm world, but a windy world. Technically, the agent choose an action, but due to the strong wind, the agent action is not necessarily the same as its choice. Therefore, the agent might fall into the lakes (failing cells). In this situation, the optimum path is not the shortest one any more. The agent must find an optimum safe and short path to get to the home cell.

Running the Code

To run this code, please use Jupyter Notebook and run "Windy_World_stochastic.ipynb".

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
Windy_World_stochastic.ipynb		Windy_World_stochastic.ipynb
map.pptx		map.pptx
windy_world.png		windy_world.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.ipynb_checkpoints

.ipynb_checkpoints

.DS_Store

.DS_Store

LICENSE

LICENSE

README.md

README.md

Windy_World_stochastic.ipynb

Windy_World_stochastic.ipynb

map.pptx

map.pptx

windy_world.png

windy_world.png

Repository files navigation

Windy World

Running the Code

About

Releases

Packages

Languages

License

tamiminaser/Windy_World_Reinforcement_Learning

Folders and files

Latest commit

History

Repository files navigation

Windy World

Running the Code

About

Resources

License

Stars

Watchers

Forks

Languages