Reinforcement Learning: An Introduction

Python replication for Sutton & Barto's book Reinforcement Learning: An Introduction (2nd Edition)

If you have any confusion about the code or want to report a bug, please open an issue instead of emailing me directly, and unfortunately I do not have exercise answers for the book.

Figure 9.1: Gradient Monte Carlo algorithm on the 1000-state random walk task
Figure 9.2: Semi-gradient n-steps TD algorithm on the 1000-state random walk task
Figure 9.5: Fourier basis vs polynomials on the 1000-state random walk task
Figure 9.8: Example of feature width’s effect on initial generalization and asymptotic accuracy
Figure 9.10: Single tiling and multiple tilings on the 1000-state random walk task

Chapter 10

Chapter 11

Chapter 12

Chapter 13

Environment

python 3.6
numpy
matplotlib
seaborn
tqdm

Usage

All files are self-contained

python any_file_you_want.py

Contribution

If you want to contribute some missing examples or fix some bugs, feel free to open an issue or make a pull request.

Name		Name	Last commit message	Last commit date
Latest commit History 315 Commits
chapter01		chapter01
chapter02		chapter02
chapter03		chapter03
chapter04		chapter04
chapter05		chapter05
chapter06		chapter06
chapter07		chapter07
chapter08		chapter08
chapter09		chapter09
chapter10		chapter10
chapter11		chapter11
chapter12		chapter12
chapter13		chapter13
images		images
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

License

ShangtongZhang/reinforcement-learning-an-introduction

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning: An Introduction

Contents

Chapter 1

Chapter 2

Chapter 3

Chapter 4

Chapter 5

Chapter 6

Chapter 7

Chapter 8

Chapter 9

Chapter 10

Chapter 11

Chapter 12

Chapter 13

Environment

Usage

Contribution

About

Topics

Resources

License

Stars

Watchers

Forks

Languages