Skip to content

Latest commit

 

History

History
23 lines (14 loc) · 617 Bytes

README.md

File metadata and controls

23 lines (14 loc) · 617 Bytes

DRL-TD-methods

Description

In this repo I explore the Sarsa, Sarsa max, and Expected sarsa methods to solve the RL task CliffWalking-v0 from OpenA-GYM.

Usage

All of the three RL algorithms are implemented in the jupyter notebook Temporal_Difference.ipynb and running all cell in it you can train an agent to solve the enviroment in a different way(Sarsa, Sarsa max, Expected sarsa).

Installation

To use this code you need to install the following packages:

License

GNU General Public License v3.0