EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits

In this repository, we provide one implementation of EE-Net, where the linear decision-maker (e.g., f1 + f2 ) and neural decision-maker (f3) are provided respectively. For the exploration network, one-layer CNN is used to reduce the dimensionality of gradient of exploitation network.

Run:

Run EE-Net on Mnist:

python EENet_run.py

Run baselines on Mnist:

python baselines/baselines_run.py

Prerequisites:

python 3.8.8, CUDA 11.2, torch 1.9.0, torchvision 0.10.0, sklearn 0.24.1, numpy 1.20.1, scipy 1.6.2, pandas 1.2.4

Hyper-parameters

dim: dimensionality of arm context vector

n_arm: number of arms.

pooling_step_size: aggregation size for the gradient, and the aggregated gradient will be the input of f2

hidden: width of all neural networks

If you use the codes of this repository, please kindly cite the following papers:

@inproceedings{ban2022eenet,
title={{EE}-Net: Exploitation-Exploration Neural Networks in Contextual Bandits},
author={Yikun Ban and Yuchen Yan and Arindam Banerjee and Jingrui He},
booktitle={International Conference on Learning Representations},
year={2022},
url={https://openreview.net/forum?id=X_ch3VrNSRg}
}

@article{ban2023neural,
  title={Neural exploitation and exploration of contextual bandits},
  author={Ban, Yikun and Yan, Yuchen and Banerjee, Arindam and He, Jingrui},
  journal={arXiv preprint arXiv:2305.03784},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
baselines		baselines
data		data
results		results
EENet.py		EENet.py
EENetClass.py		EENetClass.py
EENet_run.py		EENet_run.py
README.md		README.md
__init__.py		__init__.py
plot_results.py		plot_results.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits

Run:

Prerequisites:

Hyper-parameters

About

Releases

Packages

Languages

banyikun/EE-Net-ICLR-2022

Folders and files

Latest commit

History

Repository files navigation

EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits

Run:

Prerequisites:

Hyper-parameters

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages