Python Implementation of 'A Tutorial on Thompson Sampling'

Python replication for 'A Tutorial on Thompson Sampling', written by by Daniel J. Russo, Benjamin Van Roy, Abbas Kazerouni, Ian Osband and Zheng Wen in 2018.

If you have any confusion about the code or want to report a bug, please open an issue instead of emailing me directly.

The article,'A Tutorial on Thompson Sampling',also shown as 'TS_tutorial' in file, covers the algorithm and its application, illustrating concepts through a range of examples, including Bernoulli bandit problems, shortest path problems, productrecommendation, assortment, active learning with neuralnetworks, and reinforcement learning in Markov decisionprocesses. Most of these problems involve complex information structures, where information revealed by taking anaction informs beliefs about other actions. We will also discuss when and why Thompson sampling is or is not effectiveand relations to alternative algorithms.

All of the coding aim to promote the study of TS. Hopefully, it helps all of TS learner, including me, to grap the knowledge faster.

2019.7.18

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
material		material
sample_code		sample_code
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
TS_Tutorial.pdf		TS_Tutorial.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

material

material

sample_code

sample_code

.gitattributes

.gitattributes

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

TS_Tutorial.pdf

TS_Tutorial.pdf

Repository files navigation

Python Implementation of 'A Tutorial on Thompson Sampling'

About

Releases

Packages

Languages

License

sou350121/A-Tutorial-on-Thompson-Sampling

Folders and files

Latest commit

History

Repository files navigation

Python Implementation of 'A Tutorial on Thompson Sampling'

About

Resources

License

Stars

Watchers

Forks

Languages