Taxi-v3 Environment

My attempt to solve the taxi-v3 OpenAI gym environment using various algorithms such as:

Q-Learning
SARSA
Expected-SARSA

Install

To install dependencies for this repo, run the following command:

pip install -r requirements.txt

Description

The workspace contains the following files (in alphabetical order):

agent.py: A factory for creating RL agents (follows Factory design pattern).
conf.yaml: A YAML file containing the hyper-parameters for agents.
main.py: The start point for this repo. In this script, you can create agents, run them and compare between their performance.
monitor.py: The train() function tests how well your agent learns from interaction with the environment.
q_learning.py: An agent implemented using Q-Learning algorithm.
sarsa.py: Three agents implemented using SARSA and Expected-SARSA respectively.
utils.py: Helpful functions.

Usage

You can use the agents found in this repository in two main ways. Before getting into that, let's first import all needed resources and define the environment:

import gym
from utils import *

env = gym.make('Taxi-v3')

Now, let's get into the two mains things that can be done with the agents:

Train an agent and see it interact with the environment:

# parse the config file where all hyper-parameters are set
conf = load_conf("conf.yaml")

# define the agent
algorithms = "Expected-SARSA"
AgentModule = AgentFactory.create_agent(algorithms)
agent = AgentModule(env.observation_space.n, env.action_space.n, conf)

# train the agent
avg_rewards, best_avg_reward = train(env, agent, conf)

# see it interact with the environment.
interact(env, agent)

Compare the performance of different agents:

compare(env, ["Q-learning", "SARSA", "Expected-SARSA"])

These two methods are already provided in the main.py file.

Environment Benchmark

OpenAI Gym defines "solving" this task as getting average return of 9.7 over 100 consecutive trials.

The following figure shows the performance of the different algorithms in this repo:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

agent.py

agent.py

conf.yaml

conf.yaml

main.py

main.py

monitor.py

monitor.py

q_learning.py

q_learning.py

requirements.txt

requirements.txt

sarsa.py

sarsa.py

utils.py

utils.py

Repository files navigation

Taxi-v3 Environment

Install

Description

Usage

Environment Benchmark

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
assets		assets
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
agent.py		agent.py
conf.yaml		conf.yaml
main.py		main.py
monitor.py		monitor.py
q_learning.py		q_learning.py
requirements.txt		requirements.txt
sarsa.py		sarsa.py
utils.py		utils.py

License

Anwarvic/taxi-v3-RL

Folders and files

Latest commit

History

Repository files navigation

Taxi-v3 Environment

Install

Description

Usage

Environment Benchmark

About

Resources

License

Stars

Watchers

Forks

Languages