Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs (ϵ-MATS)

[AAAI 2024 Oral]

Tianyuan Jin^* · Hao-Lun Hsu^† · William Chang^‡ · Pan Xu^†

^* National University of Singapore · ^† Duke University · ^‡ University of California, Los Angles

Official implementation of the paper "Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs (ϵ-MATS)" which combines the MATS exploration with probability ε and greedy exploitation with probability 1 − ε.

Installation instructions

Dependencies

python==3.6
scipy >=1.2.1
matplotlib >= 3.0.2
pandas >= 0.25.3
numpy >= 1.17.0

Example

# Enter the anaconda virtual environment
source activate epsilon_mats
# Train on Bernoulli0101 using random exploration on 10 agents
python main.py --algo rd --env_name bernoulli --iter 2000 --seed 0 --n_agents 10

# Train on Poisson0101 using mats (including different epsilon) on 20 agents
python main.py --algo all --env_name poisson --iter 2000 --seed 0 --n_agents 20

Citation

@inproceedings{Jin2024MATS,
  title={Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs},
  author={Jin, Tianyuan and Hsu, Hao-Lun and Chang, William and Xu, Pan},
  booktitle={Annual AAAI Conference on Artificial Intelligence (AAAI)},
  volume={38},
  number={11},
  pages={12956--12964},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
LICENSE		LICENSE
README.md		README.md
coordination_graph.py		coordination_graph.py
environments.py		environments.py
main.py		main.py
posteriors.py		posteriors.py
thompson_sampling.py		thompson_sampling.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

coordination_graph.py

coordination_graph.py

environments.py

environments.py

main.py

main.py

posteriors.py

posteriors.py

thompson_sampling.py

thompson_sampling.py

Repository files navigation

Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs (ϵ-MATS)

[AAAI 2024 Oral]

Installation instructions

Dependencies

Example

Citation

About

Releases

Packages

Contributors 2

Languages

License

panxulab/eps-Multi-Agent-Thompson-Sampling

Folders and files

Latest commit

History

Repository files navigation

Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs (ϵ-MATS)

[AAAI 2024 Oral]

Installation instructions

Dependencies

Example

Citation

About

Resources

License

Stars

Watchers

Forks

Languages