Skip to content

Creating two interesting Markov Decision Processes, solving them using value iteration , policy iteration as well as Q-learning, comparing them and analysing how each method converges, how it performs and how their parameters affects the solution.

Notifications You must be signed in to change notification settings

Younes43/Reinforcement-Learning-Markov-Decision-Processes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Markov Decision Processes

Author: Younes EL BOUZEKRAOUI

Git Repository :

This Google drive https://drive.google.com/drive/folders/1eBCz1aysS-T2zLGcNMg-syHbuCPO2jJU?usp=sharing contains one jupyter notebook (.ipynb) and a folder named 'archive' containing 4 datasets (.csv)

To reproduce the results and the figures, please make sure that the file mdp.py is in the same repository as the notebook hmw4.ipynb and then run all the cells of the notebook.

Note that it might take few minutes to run all the cells and get the results.

Repository Structure:

|--hmw4.ipynb
|--hmw4.html
|--README.md
|--README.txt
|--mdp.py
|--ybouzekraoui3_analysis.pdf


Programming language and Libraries

All the code related to this assignement, the results and the ploted figures are in the jupyter notebook file.

The programming language used is Python (Python 3) and the libraries used are :

  • Numpy
  • Matplotlib
  • Random
  • Seaborn
  • gym
  • pymdptoolbox

Please make sure to have Jupter notebook and all the libraries above installed, the first cell of the notebook installes the required packages if the are not already installed .

About

Creating two interesting Markov Decision Processes, solving them using value iteration , policy iteration as well as Q-learning, comparing them and analysing how each method converges, how it performs and how their parameters affects the solution.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published