Skip to content

In this repo I explore the sarsa, sarsa max, and expected sarsa methods to solve RL tasks.

License

Notifications You must be signed in to change notification settings

Victor-Martinez-Pozos/DRL-TD-methods

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DRL-TD-methods

Description

In this repo I explore the Sarsa, Sarsa max, and Expected sarsa methods to solve the RL task CliffWalking-v0 from OpenA-GYM.

Usage

All of the three RL algorithms are implemented in the jupyter notebook Temporal_Difference.ipynb and running all cell in it you can train an agent to solve the enviroment in a different way(Sarsa, Sarsa max, Expected sarsa).

Installation

To use this code you need to install the following packages:

License

GNU General Public License v3.0

About

In this repo I explore the sarsa, sarsa max, and expected sarsa methods to solve RL tasks.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published