Skip to content

Elbarmo/Reinforcement_Learning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 

Repository files navigation

Reinforcement Learning

Motivation

My exercises while learning Reiforcement Learning. I use as a guide the lectures from David Silver. While viewing the lectures I came up with execises to practice the teachings in the lecture, so I decided to implement them.

Contents

This repository contains the following examples:

  1. Labyrinth_Problem.py: requires labyrinth1.json. The goal of this script is that taking a labyrinth schema, the script will estimate the value of each posible state (position in the labyrinth). Once it has converged, given any state, the way out of it will be moving into states with smaller value. (Need code cleaning)
  2. Random_Walk_Predictions.py: This script will evaluate the value of the steps in a random walk (only one degree of freedom) using a MonteCarlo method, the TD(0) and the TD(lambda). (Need code cleaning)
  3. Wind_Grid_Control.py: This script will extract the optimal policy for a Grid World walk. The policy will be optained using MonteCarlo (need debugging), SARSA(0), SARSA(lambda) and SARSAmax (Need code cleaning)

About

My exercises while learning Reinforcement Learning

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages