Code for Multimodal Bandits: Regret Lower Bounds and Optimal Algorithms

This repository contains the Python code used for the runtime and regret experiments in the paper.

Code Structure

➤ The code is organized in this order:

Helpful auxiliary functions
Main DP algorithm
Improved DP algorithm (see Appendix E in the paper)
Subgradient descent procedure
OSSB implementation
Experiments from the paper

➤ The flags RUNTIME_EXPERIMENT, RUNTIME_IMPROVED_DP_EXPERIMENT and REGRET_EXPERIMENT can be set to True to run the experiments of Appendix A.2, Appendix E.8 and Section 6 respectively.

All functions have a docstring, and a documentation is found in folder "docs".

License

MIT License.

Contact

Feel free to contact the authors:

William Réveillard wilrev@kth.se

Richard Combes richard.combes@centralesupelec.fr

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
docs		docs
.gitignore		.gitignore
README.md		README.md
multimodalbandits.py		multimodalbandits.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Code for Multimodal Bandits: Regret Lower Bounds and Optimal Algorithms

Code Structure

License

Contact

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

wilrev/MultimodalBandits

Folders and files

Latest commit

History

Repository files navigation

Code for Multimodal Bandits: Regret Lower Bounds and Optimal Algorithms

Code Structure

License

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages