multi_player_multi_armed_bandit_algorithms

This repository allows to benchmark all state-of-the-art Multi Player Multi-Armed Bandit algorithms. Implemented algorithms:

Cooperative and Stochastic Multi-Player Multi-Armed Bandit: Optimal Regret With Neither Communication Nor Collisions, Sébastien Bubeck, Thomas Budzinski, Mark Sellke
SIC-MMAB, SIC-MMAB2 and DYN-MMAB algorithms from SIC-MMAB: Synchronisation Involves Communication in Multiplayer Multi-Armed Bandits, Etienne Boursier, Vianney Perchet
EC-SIC from Decentralized Multi-player Multi-armed Bandits with No Collision Information, Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang
First and Second algorithm from Multiplayer bandits without observing collision information, Gabor Lugosi, Abbas Mehrabian
MCTopM, SelfishUCB from Multi-Player Bandits Revisited, Lilian Besson, Emilie Kaufmann
Musical Chairs from Multi-Player Bandits -- a Musical Chairs Approach, Jonathan Rosenski, Ohad Shamir, Liran Szlak
Randomized SelfishUCB from A High Performance, Low Complexity Algorithm for Multi-Player Bandits Without Collision Sensing Information, Cindy Trinh, Richard Combes

This is the code attached to the following paper: A High Performance, Low Complexity Algorithm for Multi-Player Bandits Without Collision Sensing Information, Cindy Trinh, Richard Combes. (https://arxiv.org/abs/2102.10200)

Requirements

Python3
numpy, matplotlib
(Optional) To improve speed, compile the cythonized version of the compute of KL-UCB index:

cd multi_player_multi_armed_bandits/algorithms/cklucb
python setup.py build_ext --inplace

How to run

To reproduce the results of any section of the paper, run the corresponding script. For example, to reproduce the figures from Section 4.1, run:

cd multi_player_multi_armed_bandits
python paper_4_1_linearly_spaced_mu.py

This will save results into the folder code/results and code/results_plots

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
algorithms		algorithms
environment		environment
simulations		simulations
README.md		README.md
paper_2_2_rndsucb_sucb.py		paper_2_2_rndsucb_sucb.py
paper_3_1_bubeck.py		paper_3_1_bubeck.py
paper_3_1_linearly_spaced_mu.py		paper_3_1_linearly_spaced_mu.py
paper_3_2_cumregret_wrt_M.py		paper_3_2_cumregret_wrt_M.py
paper_3_2_cumregret_wrt_delta.py		paper_3_2_cumregret_wrt_delta.py
paper_3_2_cumregret_wrt_mu_K.py		paper_3_2_cumregret_wrt_mu_K.py
paper_3_3_corner_case.py		paper_3_3_corner_case.py
paper_4_comparison_with_collision.py		paper_4_comparison_with_collision.py
paper_5_1_dynamic_quasi_asynchronicity.py		paper_5_1_dynamic_quasi_asynchronicity.py
paper_5_2_dynamic_leaving.py		paper_5_2_dynamic_leaving.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

algorithms

algorithms

environment

environment

simulations

simulations

README.md

README.md

paper_2_2_rndsucb_sucb.py

paper_2_2_rndsucb_sucb.py

paper_3_1_bubeck.py

paper_3_1_bubeck.py

paper_3_1_linearly_spaced_mu.py

paper_3_1_linearly_spaced_mu.py

paper_3_2_cumregret_wrt_M.py

paper_3_2_cumregret_wrt_M.py

paper_3_2_cumregret_wrt_delta.py

paper_3_2_cumregret_wrt_delta.py

paper_3_2_cumregret_wrt_mu_K.py

paper_3_2_cumregret_wrt_mu_K.py

paper_3_3_corner_case.py

paper_3_3_corner_case.py

paper_4_comparison_with_collision.py

paper_4_comparison_with_collision.py

paper_5_1_dynamic_quasi_asynchronicity.py

paper_5_1_dynamic_quasi_asynchronicity.py

paper_5_2_dynamic_leaving.py

paper_5_2_dynamic_leaving.py

Repository files navigation

multi_player_multi_armed_bandit_algorithms

Requirements

How to run

About

Releases

Packages

Contributors 2

Languages

ctrnh/multi_player_multi_armed_bandit_algorithms

Folders and files

Latest commit

History

Repository files navigation

multi_player_multi_armed_bandit_algorithms

Requirements

How to run

About

Resources

Stars

Watchers

Forks

Languages