[WIP] GBTRegressor does not provide uncertainty estimates #9

betatim · 2016-03-23T16:11:00Z

Started a simple wrapper for regressors like GBTRegressor to provide uncertainty estatimates as well
as central predictions.

Any ideas on how to test that this indeed gives the central and 68% confidence interval (right terminology??) predictions? Best idea so far is to use a very large number of samples on a simple problem and check that the predicted standard deviation is approximately equal to the std used when sampling.

Todo:

documentation
check that GP uses same definition of uncertainty
smarter tests

betatim · 2016-03-23T16:11:44Z

skopt/tests/test_gbt.py

+    # noise level depends on value of `X`
+    return np.abs(2.5-X)/2.5 + 0.1
+
+def sample_noise(X, std=0.2, noise=constant_noise):


should pass in rng/use check_random_state

Wrap a GBTRegressor to provide uncertainty estatimates as well as central predictions

betatim · 2016-03-23T16:13:09Z

skopt/tests/test_gbt.py

+def constant_noise(X):
+    return np.ones_like(X)
+
+def sample_noise(X, std=0.2, noise=constant_noise):


use random_state pattern instead of relying on global rng

glouppe · 2016-03-23T16:44:46Z

I am not fond of the nomenclature, std != uncertainty != quantile.

betatim · 2016-03-23T17:43:38Z

I'm flexible, but out of ideas at the moment for a good (short) name.

GBTQuantileRegressor?

GBTQuantiles(quantiles=list_of_quantiles) with per_quantile_prediction = predict(). You set all the quantiles you want estimated and we return each single one? In this case GBTQuantiles([0.18, 0.5, 0.84])

MechCoder · 2016-03-24T17:02:44Z

I'm planning to review my GradientBoosting know-how, this weekend. Can have a look after that.

betatim · 2016-03-24T22:07:41Z

If you need this merge it, I'm away till Monday night without internet

MechCoder · 2016-03-28T00:01:52Z

skopt/gbt.py

+                                                      random_state=rng)
+                            for a in self.quantiles]
+        for rgr in self.regressors_:
+            rgr.fit(X, y)


return self

MechCoder · 2016-03-28T00:08:47Z

@glouppe @betatim
How do we compute the standard deviation of the predictions given the quantiles? There is no assumption about the conditional distribution of Y given X right?

betatim · 2016-03-28T14:50:07Z

The last item on the to do list: "check definition of uncertainty" is something I'd like to punt to #23, instead of waiting to merge this till we converge.

MechCoder · 2016-03-28T16:44:52Z

This looks ok to me.

betatim · 2016-03-28T16:46:34Z

📈

glouppe · 2016-03-29T06:55:36Z

Would be good to expose the base_estimator in this newly introduced class. Default params of GradientBoostingRegressor are very likely to give poor results.

merge upstream

betatim reviewed Mar 23, 2016
View reviewed changes

A wrapper for GBTRegressor to provide uncertainty estimates

0dcab69

Wrap a GBTRegressor to provide uncertainty estatimates as well as central predictions

betatim force-pushed the trees branch from 3f7aec7 to 0dcab69 Compare March 23, 2016 16:12

betatim reviewed Mar 23, 2016
View reviewed changes

Test by estimating the quantiles of the normal

2fbb4ba

betatim force-pushed the trees branch from 6b9463b to 2fbb4ba Compare March 23, 2016 20:20

MechCoder reviewed Mar 28, 2016
View reviewed changes

skopt/gbt.py

random_state=rng)

for a in self.quantiles]

for rgr in self.regressors_:

rgr.fit(X, y)

Copy link

Member

MechCoder Mar 28, 2016

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

return self

This was referenced Mar 28, 2016

Definition of terms related to uncertainty #23

Open

Implement Tree-structured Parzen Estimator (TPE) #21

Open

Documentation for GBTQuantiles

c5095f8

MechCoder merged commit fc07663 into scikit-optimize:master Mar 28, 2016

betatim deleted the trees branch March 28, 2016 16:46

betatim mentioned this pull request Mar 29, 2016

Expose base_estimator in GBTQuantiles #26

Closed

holgern added a commit that referenced this pull request Jan 29, 2020

Merge pull request #9 from scikit-optimize/master

212bafc

merge upstream

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] GBTRegressor does not provide uncertainty estimates #9

[WIP] GBTRegressor does not provide uncertainty estimates #9

betatim commented Mar 23, 2016

betatim Mar 23, 2016

betatim Mar 23, 2016

glouppe commented Mar 23, 2016

betatim commented Mar 23, 2016

MechCoder commented Mar 24, 2016

betatim commented Mar 24, 2016

MechCoder Mar 28, 2016

MechCoder commented Mar 28, 2016

betatim commented Mar 28, 2016

MechCoder commented Mar 28, 2016

betatim commented Mar 28, 2016

glouppe commented Mar 29, 2016

[WIP] GBTRegressor does not provide uncertainty estimates #9

[WIP] GBTRegressor does not provide uncertainty estimates #9

Conversation

betatim commented Mar 23, 2016

betatim Mar 23, 2016

Choose a reason for hiding this comment

betatim Mar 23, 2016

Choose a reason for hiding this comment

glouppe commented Mar 23, 2016

betatim commented Mar 23, 2016

MechCoder commented Mar 24, 2016

betatim commented Mar 24, 2016

MechCoder Mar 28, 2016

Choose a reason for hiding this comment

MechCoder commented Mar 28, 2016

betatim commented Mar 28, 2016

MechCoder commented Mar 28, 2016

betatim commented Mar 28, 2016

glouppe commented Mar 29, 2016