This is the numerical simulation code for SquareCBwK, the main algorithm in our paper Optimal Contextual Bandits with Knapsacks under Realizibility
via Regression Oracles, which is accepted by AISTATS2023. We present simulation results for linear CBwK to demonstrate the dependency on time horizon (
quejialin/SquareCBwK
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|