Hierarchical Online Planning and Reinforcement Learning on Taxi
-
Updated
Oct 23, 2017 - C++
Hierarchical Online Planning and Reinforcement Learning on Taxi
Solution for Taxi env using HRL (Hierarchical reinforcement learning) (2018)
In this paper we re-define MAXQ and the taxi environment and Implement them in R. We then apply Qlearning to the same problem. Our conclusion is that MAXQ works as good as Qlearning for this problem. Our aim is illustrate the advantages of using hierarchical reinforcement learning methods.
Inducing hierarchy via models and trajectories
Add a description, image, and links to the maxq topic page so that developers can more easily learn about it.
To associate your repository with the maxq topic, visit your repo's landing page and select "manage topics."