Good-Action Identification

Code for ICML 2021 paper Lenient Regret and Good-Action Identification in Gaussian Process Bandits

Copyright by the authors: Xu Cai, Selwyn Gomes and Jonathan Scarlett

Dependencies

Python 3
NumPy
SciPy
Scikit-Learn
Matplotlib
GPy
PyTorch (for quasi-random sequences)
pybox2d & pygame (for robot pushing)
xgboost (for XGBoost)

The noisy/noiseless experiment on synthetic/real-world functions

Input arguments for main.py:

function: Specify the function name; See good_action/utils.py for details
noisy: Noisy or noiseless observation
epsilon: The good-action threshold; Float value

Output:

log file: Running logs
query histories: .npy file saving queried points and values

For example:

Testing on the noiseless 3D robot pushing function

python main.py robot3 noiseless 4.5

Visualization: Run plot.ipynb

The lenient regret experiment on synthetic GP function

Input arguments for lenient.py:

epsilon: The good-action threshold; Float value; Default=0.9

Output:

lenient and standard regrets: .npy file

For example:

python lenient.py 0.9

Visualization: Run plot_lenient.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Good-Action Identification

Dependencies

The noisy/noiseless experiment on synthetic/real-world functions

The lenient regret experiment on synthetic GP function

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
good_action		good_action
LICENSE		LICENSE
README.md		README.md
lenient.py		lenient.py
main.py		main.py
plot.ipynb		plot.ipynb
plot_lenient.ipynb		plot_lenient.ipynb

License

caitree/GoodAction

Folders and files

Latest commit

History

Repository files navigation

Good-Action Identification

Dependencies

The noisy/noiseless experiment on synthetic/real-world functions

The lenient regret experiment on synthetic GP function

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages