RL-dynamic programming

in this notebook we are going to implement 2 dynamic programming algorithms : policy iteration and value iteration.

Thes algorithms are useful when we have the markov decision process of the environment (model-based algorithms)

The environment used in this notebook is FrozenLake8x8 from openai gym environments

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
notebook.ipynb		notebook.ipynb

Provide feedback