Skip to content

NijatZeynalov/Taxi-v3

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Taxi-v3

This is a solution for Gym Taxi problem. This task was introduced to illustrate some issues in hierarchical reinforcement learning. There are 4 locations (labeled by different letters) and your job is to pick up the passenger at one location and drop him off in another. You receive +20 points for a successful dropoff, and lose 1 point for every timestep it takes. There is also a 10 point penalty for illegal pick-up and drop-off actions.

You are able to see the number of reward and dropouts over the episodes in the following graph: Image of Yaktocat