Implementing SLOPE++ for estimator selection #148

usaito · 2021-12-20T03:06:19Z

Overview

Implement a method to tune hyperparameters of OPE estimators called SLOPE++ proposed by Tucker and Lee(2021)., which improves the vanilla SLOPE proposed by Su et al.(2020). This tuning method is usable by setting tuning_method="slope" when using estimators implemented in obp.ope.estimators_tuning.
Implement some high probability lower bound estimators for the mean of independent random variables. The implemented methods include Hoeffding, empirical Bernstein, and the bound based on the Student's t distribution. Please see Thomas et al.(2015) for details.

Minor

Fix docstrings and error messages

References

George Tucker and Jonathan Lee .
"Improved Estimator Selection for Off-Policy Evaluation.", 2021.
https://lyang36.github.io/icml2021_rltheory/camera_ready/79.pdf
Yi Su, Pavithra Srinath, and Akshay Krishnamurthy.
"Adaptive Estimator Selection for Off-Policy Evaluation.", ICML2021.
http://proceedings.mlr.press/v119/su20d.html
Philip S. Thomas, Georgios Theocharous, and Mohammad Ghavamzadeh.
"High Confidence Off-Policy Evaluation" AAAI2015.
https://www.cs.utexas.edu/~sniekum/classes/RLFD-F17/papers/Thomas2015.pdf
Philip S. Thomas, Georgios Theocharous, and Mohammad Ghavamzadeh.
"High Confidence Off-Policy Improvement" AAAI2015.
http://proceedings.mlr.press/v37/thomas15.html

Co-authored-by: fullflu <9534465+fullflu@users.noreply.github.com>

Implementing Sub-Gaussian IPW and DR

Feature: implementing the Cascade-DR estimator for SlateOPE

aiueola and others added 30 commits June 17, 2021 18:48

rm conflict

d862f01

rm conflict

ec4ed68

Merge branch 'master' of github.com:aiueola/zr-obp

0c7c0aa

Merge branch 'master' of github.com:aiueola/zr-obp

4cd3c72

Merge remote-tracking branch 'upstream/master'

86a0999

implement cascade-dr

92f9099

implement cascade-dr

3d8914f

black

5edddd2

minor fix

4bd3ad8

minor fix

6e12418

minor fix

afcba42

fix docstrings

f4e9daa

fix flake8 err

5f9ba2e

fix flake8 err

cd6e788

fix docstrings based on review

f651d4d

fix docstrings based on review

7756759

fix docstrings based on review

5eacbd1

fix docstrings based on review

6680266

fix docstrings based on review

aee7f47

fix docstrings based on review

9543635

minor fix based on review

3e36337

Co-authored-by: fullflu <9534465+fullflu@users.noreply.github.com>

minor fix based on review

a225d7d

Co-authored-by: fullflu <9534465+fullflu@users.noreply.github.com>

minor fix based on review

3eb9c50

minor fix based on review

f611f64

fix test

1c63ece

fix docstrings

a54f729

refactor and test

ec9e62b

cascade-dr test

bef2087

black

f66fdad

fix test

03bd76b

aiueola and others added 23 commits November 20, 2021 17:03

fix flake8

dd6c933

Merge branch 'master' of https://github.com/st-tech/zr-obp

322cde3

fix test

26ab399

Merge branch 'master' into cascade-dr

be02c0e

fix flake8

190cd97

typo

269b2a2

fix test and docstrings

d19ecd3

impelement probability lower bounds

34cfa7a

implement SLOPE for hyperparam tuning of ope

dffcafb

add some tests

4ec3c8e

implement sub-gaussian estimators

3c63ee4

add corresponding tests

3000822

fix docs

ab6b477

fix tests

0659552

fix the weight of SGIPW/SGDR

e7cac29

fix conflicts

7fa343b

Merge pull request #149 from st-tech/feature/subgauss-ipw

8d869e8

Implementing Sub-Gaussian IPW and DR

Merge branch 'master' into cascade-dr

cc81663

Merge branch 'feature/estimator-selection' into cascade-dr

ee27faa

Merge pull request #142 from aiueola/cascade-dr

14af35c

Feature: implementing the Cascade-DR estimator for SlateOPE

fix lint

2dd970f

fix conflicts

e979df6

fix docs and error messages

4ff109d

usaito changed the title ~~[WIP] Implementing SLOPE++ for estimator selection~~ Implementing SLOPE++ for estimator selection Jan 12, 2022

usaito merged commit 4f075e9 into master Jan 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementing SLOPE++ for estimator selection #148

Implementing SLOPE++ for estimator selection #148

usaito commented Dec 20, 2021 •

edited

Loading

Implementing SLOPE++ for estimator selection #148

Implementing SLOPE++ for estimator selection #148

Conversation

usaito commented Dec 20, 2021 • edited Loading

Overview

Minor

References

usaito commented Dec 20, 2021 •

edited

Loading