GitHub - shuishida/Multi-Armed-Bandit

Multi-Armed Bandit Problem

Written by Shu Ishida

This project is developed as a part of a course work assignment to compare different bandit algorithms. It implements the explore and exploit algorithm, $\epsilon$-greedy, successive elimination, UCB1 and UCB2. Implementation follows the algorithms described in Introduction to Multi-Armed Bandits by Aleksandrs Slivkins [https://arxiv.org/pdf/1904.07272.pdf].

Setup

We store experiments that have been run as pickle files. Make a directory called data to store these.

git clone https://github.com/c16192/Multi-Armed-Bandit.git
cd Multi-Armed-Bandit
mkdir data

How to run the experiments

python main.py --exp <experiment number> --bandit <type of bandit>

main.py takes other optional arguments, which can be checked by running the following:

python main.py -h

Experiment numbers are as follows: 0. Explore-exploit algorithm

Optimal explore-exploit algorithm
Epsilon-greedy algorithm
Successive elimination algorithm
UCB1 algorithm
UCB2 algorithm
Comparing all of the above

Types of bandits are:

bernoulli (default): bandit arms have bernoulli distributed rewards
normal: bandit arms have Gaussian distributed rewards
bernoulli periodic: success probability of the bernoulli distribution oscillates as a sinusoid.

How to visualise the experiments

Once the experiments have been run, they will be stored as pickle files under the data directory. While running an experiment can take a certain amount of time, plotting these results are easy.

python main.py --plot .\data\<path to pickle file>.p

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
agent		agent
bandit		bandit
env		env
sim		sim
.gitignore		.gitignore
README.md		README.md
cli.py		cli.py
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

agent

agent

bandit

bandit

env

env

sim

sim

.gitignore

.gitignore

README.md

README.md

cli.py

cli.py

main.py

main.py

Repository files navigation

Multi-Armed Bandit Problem

Setup

How to run the experiments

How to visualise the experiments

About

Releases

Packages

Languages

shuishida/Multi-Armed-Bandit

Folders and files

Latest commit

History

Repository files navigation

Multi-Armed Bandit Problem

Setup

How to run the experiments

How to visualise the experiments

About

Resources

Stars

Watchers

Forks

Languages