Skip to content

Using Monte Carlo Learning and Policy Evaluation-Iteration methods to chart a path for an agent to reach a goal through an icy terrain with intermittent holes and stochastic wind

Notifications You must be signed in to change notification settings

Riddhiman-M/RL1-Frozen-Lake-with-Stochastic-Wind

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RL1-Frozen-Lake-with-Stochastic-Wind

Using Monte Carlo Learning and Policy Evaluation-Iteration methods to chart a path for an agent to reach a goal through an icy terrain while also avoiding intermittent holes and dealing with stochastic wind which make the agents' actions non-correspondent with it's observed state transition

Original Policy Policy after 30 episodes

The policy is found to converge within 30 episodes while the state values take ~1000 episodes to converge by Monte-Carlo method

Final state values:

About

Using Monte Carlo Learning and Policy Evaluation-Iteration methods to chart a path for an agent to reach a goal through an icy terrain with intermittent holes and stochastic wind

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages