Skip to content
Reinforcement Learning Algorithms for Episodic MDPs With Finite SA-Spaces
Julia Jupyter Notebook
Branch: master
Clone or download

Latest commit

Fetching latest commit…
Cannot retrieve the latest commit at this time.

Files

Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
notebooks
LICENSE
README.md
delayedql.jl
fhagents.jl
fhmdp.jl
mbie.jl
medianpac.jl
mormax.jl
oim.jl
psrl.jl
ubev.jl
ucfh.jl
ucrl2.jl

README.md

FiniteEpisodicRL.jl

Reinforcement Learning Algorithms for Episodic MDPs With Finite SA-Spaces Source code for experiments in

UBEV - A More Practical Algorithm for Episodic RL with Near-Optimal PAC and Regret Guarantees
Christoph Dann, Tor Lattimore, Emma Brunskill
https://arxiv.org/abs/1703.07710

For Python implementations of some of the algorithms see https://github.com/iosband/TabulaRL/

You can’t perform that action at this time.