RL1-Frozen-Lake-with-Stochastic-Wind

Using Monte Carlo Learning and Policy Evaluation-Iteration methods to chart a path for an agent to reach a goal through an icy terrain while also avoiding intermittent holes and dealing with stochastic wind which make the agents' actions non-correspondent with it's observed state transition

Original Policy	Policy after 30 episodes

The policy is found to converge within 30 episodes while the state values take ~1000 episodes to converge by Monte-Carlo method

Final state values:

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
results		results
MCEval.py		MCEval.py
README.md		README.md
main.py		main.py
policy_eval.py		policy_eval.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

results

results

MCEval.py

MCEval.py

README.md

README.md

main.py

main.py

policy_eval.py

policy_eval.py

Repository files navigation

RL1-Frozen-Lake-with-Stochastic-Wind

About

Releases

Packages

Languages

Riddhiman-M/RL1-Frozen-Lake-with-Stochastic-Wind

Folders and files

Latest commit

History

Repository files navigation

RL1-Frozen-Lake-with-Stochastic-Wind

About

Resources

Stars

Watchers

Forks

Languages