Feature: Implementing Continuous OPE Estimators #113

usaito · 2021-07-06T14:10:13Z

new feature

estimators_continuous.py

implement the following Continuous OPE estimators
implement some kernel functions
- triangular_kernel
- gaussian_kernel
- epanechnikov_kernel
- cosine_kernel

reference
Nathan Kallus and Angela Zhou.
"Policy Evaluation and Optimization with Continuous Treatments", AISTATS, 2018.

meta_continuous.py

implement ContinuousOffPolicyEvaluation that streamlines OPE with continuous actions. This works as follows.

# (1) Synthetic Data Generation
dataset = SyntheticContinuousBanditDataset(dim_context=5)
bandit_feedback = dataset.obtain_batch_bandit_feedback(
    n_rounds=10000, min_action_value=-10, max_action_value=10,
)

# (2) Off-Policy Evaluation
ope = ContinuousOffPolicyEvaluation(
    bandit_feedback=bandit_feedback,
    ope_estimators=[KernelizedIPW(kernel="epanechnikov", bandwidth=0.02)]
)
estimated_policy_value = ope.estimate_policy_values(
     action_by_evaluation_policy=action_by_evaluation_policy,
)

tests

add the following tests wrt the new features

1 and 2 include some performance tests of the continuous OPE estimators using synthetic data as well as input checks.
4 checks whether the kernel functions satisfy the conditions described on page 9 of the following lecture slides: http://ibis.t.u-tokyo.ac.jp/suzuki/lecture/2015/dataanalysis/L9.pdf

fix descriptions (mostly English expressions) of docstrings

…ontinuous-estimators

nomuramasahir0 · 2021-08-14T04:30:50Z

I'm just letting you know:
IDE (PyCharm) generates a non-default argument error, though I do not find any specific runtime error.

If you would like to avoid this error, it might be better to inherit BaseContinuousOffPolicyEstimator, rather than KernelizedInverseProbabilityWeighting.

nomuramasahir0 · 2021-08-14T06:27:51Z

Other than the above one point, LGTM!

usaito · 2021-08-14T07:56:54Z

@nmasahiro Thanks!

usaito added 3 commits July 6, 2021 23:08

implement continuous ope estimators

c0874d4

add tests of continuous ope estimators

bf2eaf9

add some check funcs for continuous ope

ea3b2f9

usaito changed the base branch from master to continuous-dataset July 6, 2021 14:12

usaito added 7 commits July 7, 2021 10:18

add tests of meta_continuous

ce63f55

implemente meta continuous

8d8cd53

black and flake8

66b8b80

add synthetic_continuous_bandit_feedback

6ee2fde

fix a bug

bee833c

add example code

786eeca

flake8

efab71d

usaito changed the title ~~[WIP] feature: Continuous OPE Estimators~~ Feature: Continuous OPE Estimators Jul 7, 2021

usaito added 2 commits July 8, 2021 10:56

Merge branch 'continuous-dataset' of github.com:st-tech/zr-obp into c…

82c5f30

…ontinuous-estimators

fix some tests to adjust the changes of SyntheticContinuousBanditDataset

0582f84

usaito changed the title ~~Feature: Continuous OPE Estimators~~ Feature: Implementing Continuous OPE Estimators Jul 8, 2021

usaito added 2 commits July 8, 2021 20:06

fix docstrings

e33345f

fix docs

acb0076

update based on review

4c3f8cc

usaito merged commit 1c8233a into continuous-dataset Aug 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: Implementing Continuous OPE Estimators #113

Feature: Implementing Continuous OPE Estimators #113

usaito commented Jul 6, 2021 •

edited

Loading

nomuramasahir0 commented Aug 14, 2021 •

edited

Loading

nomuramasahir0 commented Aug 14, 2021

usaito commented Aug 14, 2021

Feature: Implementing Continuous OPE Estimators #113

Feature: Implementing Continuous OPE Estimators #113

Conversation

usaito commented Jul 6, 2021 • edited Loading

new feature

estimators_continuous.py

meta_continuous.py

tests

nomuramasahir0 commented Aug 14, 2021 • edited Loading

nomuramasahir0 commented Aug 14, 2021

usaito commented Aug 14, 2021

usaito commented Jul 6, 2021 •

edited

Loading

nomuramasahir0 commented Aug 14, 2021 •

edited

Loading