Reinforcement Learning

Motivation

My exercises while learning Reiforcement Learning. I use as a guide the lectures from David Silver. While viewing the lectures I came up with execises to practice the teachings in the lecture, so I decided to implement them.

Labyrinth_Problem.py: requires labyrinth1.json. The goal of this script is that taking a labyrinth schema, the script will estimate the value of each posible state (position in the labyrinth). Once it has converged, given any state, the way out of it will be moving into states with smaller value. (Need code cleaning)
Random_Walk_Predictions.py: This script will evaluate the value of the steps in a random walk (only one degree of freedom) using a MonteCarlo method, the TD(0) and the TD(lambda). (Need code cleaning)
Wind_Grid_Control.py: This script will extract the optimal policy for a Grid World walk. The policy will be optained using MonteCarlo (need debugging), SARSA(0), SARSA(lambda) and SARSAmax (Need code cleaning)

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
David Silver Course Exercises		David Silver Course Exercises
Open AI		Open AI
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning

Motivation

Contents

About

Releases

Packages

Languages

Elbarmo/Reinforcement_Learning

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning

Motivation

Contents

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages