Skip to content

hollygrimm/markov-decision-processes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OpenAI Scholars: Reinforcement Learning Self-Study

Week 1: Markov Decision Processes

Resources

  1. Reinforcement Learning: An Introduction, Sutton and Barto, 2nd Edition Jan 1 2018 Draft Chapter 3: Markov Decision Processes and Chapter 4: Dynamic Programming

  2. Deep RL Bootcamp Core Lecture 1 Intro to MDPs and Exact Solution Methods -- Pieter Abbeel Video | Slides

  3. Deep RL Bootcamp Core Lecture 2 Sample-based Approximations and Fitted Learning -- Rocky Duan Video | Slides

  4. Deep RL Bootcamp Lab 1: Markov Decision Processes You will implement value iteration, policy iteration, and tabular Q-learning and apply these algorithms to simple environments including tabular maze navigation (FrozenLake) and controlling a simple crawler robot.

  5. CS294 Reinforcement learning introduction -- Levine Video | Slides

  6. CS294 Value functions introduction -- Levine Video | Slides

  7. Introduction to Reinforcement Learning by Joshua Achiam, OpenAI Slides

Notes

RL Algorithms Diagram alt text

Interaction in Markov decision process

alt text

Value Iteration in an MDP

alt text

OpenAI’s Crawler robot attempting to walk with random actions. video

The same Crawler robot after it has been trained for 30,000 steps with a Q-learning algorithm. video

About

No description or website provided.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published