Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees

Official implementation of OPSRL algorithm and baselines from the paper D.Tiapkin et al. "Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees". The algorithms are implemented in the folder algorithms/, the parameters are contained in the folder config\.

Requirements:

Python 3.8
rlberry 0.2.1

Running experiment opsrl_vs_baselines and generate the plots

    python run.py config/experiments/opsrl_vs_baselines.yaml
    python plot_opsrl_vs_baselines.py

Running experiment opsrl_samples and generate the plots

    python run.py config/experiments/opsrl_samples.yaml
    python plot_opsrl_samples.py

Running experiment opsrl_prior and generate the plots

    python run.py config/experiments/opsrl_prior.yaml
    python plot_opsrl_prior.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

algorithms

algorithms

config

config

envs/grid_world

envs/grid_world

.gitignore

.gitignore

README.md

README.md

plot_opsrl_prior.py

plot_opsrl_prior.py

plot_opsrl_samples.py

plot_opsrl_samples.py

plot_opsrl_vs_baselines.py

plot_opsrl_vs_baselines.py

run.py

run.py

Repository files navigation

Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
algorithms		algorithms
config		config
envs/grid_world		envs/grid_world
.gitignore		.gitignore
README.md		README.md
plot_opsrl_prior.py		plot_opsrl_prior.py
plot_opsrl_samples.py		plot_opsrl_samples.py
plot_opsrl_vs_baselines.py		plot_opsrl_vs_baselines.py
run.py		run.py

d-tiapkin/optimistic-psrl-experiments

Folders and files

Latest commit

History

Repository files navigation

Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees

About

Resources

Stars

Watchers

Forks

Languages