Implementation for Reinforcement Learning: An Introduction

This project provides a python implementation for all the figures and examples in the book - Reinforcement Learning: An Introduction (2nd Edition).

In the implementation process, some parameters are not mentioned in the book. For the consistency of the figure, these parameters refer to the code from ShangtongZhang.

Figure: the original (left) and implementation (right)

All codes are well organized and easy to understand, and can be easily read with formulas and algorithms:

# direct reinforcement learning
Q[S][A] += α * (R + γ * max(Q[S_]) - Q[S][A])

# model learning
self.t += 1
# Actions that had never been tried were allowed to be considered in the planning step
if κ != 0 and (S, A) not in M:
    for a in range(Maze.ACT_NUM):
        M[S, a] = (S, 0, 1)
M[S, A] = (S_, R, self.t)

# planning
for _ in range(self.n):
    S, A = random.choice(list(M.keys()))
    S_, R, t = M[S, A]
    if κ:
        τ = self.t - t
        R += κ * np.sqrt(τ)
    Q[S][A] += α * (R + γ * max(Q[S_]) - Q[S][A])

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
chap2		chap2
chap3		chap3
chap4		chap4
chap5		chap5
chap6		chap6
chap7		chap7
chap8		chap8
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Implementation for Reinforcement Learning: An Introduction

About

Uh oh!

Releases

Packages

Languages

Urinx/RL_intro_code

Folders and files

Latest commit

History

Repository files navigation

Implementation for Reinforcement Learning: An Introduction

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages