Contextual Combinatorial Multi-output GP Bandits with Group Constraints

This repository is the official implementation of Contextual Combinatorial Multi-output GP Bandits with Group Constraints, published at TMLR.

Requirements

To install requirements:

pip install -r requirements.txt

We use the gpflow library for all GP-related computations and gpflow uses tensorflow.

Running the simulations

We ran a total of two simulations, one presented in the main paper and one presented in the supplemental. Moreover, none of the algorithms that we implement and test do offline-learning, thus there is no 'training' to be done. However, to be able to repeat the simulations and also improve speed, we first generate the arm contexts, rewards, and other setup-related information and save them as HDF5. We provide the links to the generated datasets that we used at the bottom of this README file. By default, when you run the script (main.py), it re-generates new datasets and runs the simulations on them.

Main paper simulations (privacy aware federated learning and caching-aware movie recommendation)

To run the main paper simulations, provide the argument main to the main.py script.

python main.py main

and to run the main paper simulations using pre-generated datasets , which must be in the root directory, use

python main.py main --use_saved_dataset

Supplemental simulations I (Different zeta)

To run the first supplemental simulation, use the argument supp_1.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Contextual Combinatorial Multi-output GP Bandits with Group Constraints

Requirements

Running the simulations

Main paper simulations (privacy aware federated learning and caching-aware movie recommendation)

Supplemental simulations I (Different zeta)

Files

README.md

Latest commit

History

README.md

File metadata and controls

Contextual Combinatorial Multi-output GP Bandits with Group Constraints

Requirements

Running the simulations

Main paper simulations (privacy aware federated learning and caching-aware movie recommendation)

Supplemental simulations I (Different zeta)