Skip to content

deloroy/Reinforcement-Learning

About

Project : Subgoal Discovery on the Taxi Domain with Macro Q-Learning. Practicals : Q-Leaning, Bandits (MAB, LinUCB), Approximate RL. For the RL course (Master MVA).

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors