DRL-TD-methods

Description

In this repo I explore the Sarsa, Sarsa max, and Expected sarsa methods to solve the RL task CliffWalking-v0 from OpenA-GYM.

Usage

All of the three RL algorithms are implemented in the jupyter notebook Temporal_Difference.ipynb and running all cell in it you can train an agent to solve the enviroment in a different way(Sarsa, Sarsa max, Expected sarsa).

Installation

To use this code you need to install the following packages:

numpy
jupiyter
matplotlib
seaborn
OpenAI Gym

License

GNU General Public License v3.0

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Temporal_Difference.ipynb		Temporal_Difference.ipynb
check_test.py		check_test.py
plot_utils.py		plot_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DRL-TD-methods

Description

Usage

Installation

License

About

Releases

Packages

Languages

License

Victor-Martinez-Pozos/DRL-TD-methods

Folders and files

Latest commit

History

Repository files navigation

DRL-TD-methods

Description

Usage

Installation

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages