Skip to content

Collection of Deep Reinforcement Learning Jupyter Notebooks. Each notebook is self-contained and presents single algorithm. These include DP, MC, TD, SARSA, Q-Learning and DQNs.

Notifications You must be signed in to change notification settings

marcinbogdanski/rl-sketchpad

Repository files navigation

RL Sketchpad

Implementations of various RL Algorithms

Contents:


Reinforcement Learning: An Introduction (2nd ed, 2018) by Sutton and Barto

Implementation of selected algorithms from the book. I tried to make code snippets minimal and faithful to the book.

Part I: Tabular Solution Methods
Chapter 2: Multi-armed Bandits
fig_0201.png
Chapter 4: Dynamic Programming
0401_model_diagram.png
Chapter 5: Monte Carlo Methods
fig_0503.png
Chapter 6: Temporal-Difference Learning
fig_0604a.png
Part II: Approximate Solution Methods
Chapter 9: On-Policy Prediction with Approximation
fig_0901.png
Chapter 10: On-Policy Control with Approximation
fig_1001.png

UCL Course on RL (2016) Youtube lectures by David Silver

A bit more in-depth explanation of selected concepts from David Sivler lectures and Sutton and Barto book.


Mini Posts

About

Collection of Deep Reinforcement Learning Jupyter Notebooks. Each notebook is self-contained and presents single algorithm. These include DP, MC, TD, SARSA, Q-Learning and DQNs.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published