baseline reduction: separate learning of additive regression baseline #1336

albietz · 2017-10-05T20:45:56Z

This reduction allows a regression learner to separately learn an additive baseline prediction from only "constant" features (taken from the constant_namespace), and the residual on top of that. This seems to make it faster to learn a possibly large constant offset in practice.

cc @JohnLangford

JohnLangford · 2017-10-18T20:29:30Z

The windows barf here: https://ci.appveyor.com/project/JohnLangford/vowpal-wabbit/build/1.0.2255#L2307 is presumably because the windows build doesn't include the new file.

JohnLangford · 2017-10-18T20:39:48Z

vowpalwabbit/baseline.cc

+void predict_or_learn(baseline& data, base_learner& base, example& ec)
+{ if (is_learn)
+  { // do a full prediction, for safety in accurate predictive validation
+    base.predict(ec);


You can factor base.predict() out of the if/else for simplicity.

JohnLangford · 2017-10-18T20:42:21Z

This looks good to go other than the minor refactoring and fixing the windows build about here: https://github.com/JohnLangford/vowpal_wabbit/blob/master/vowpalwabbit/vw_dynamic.vcxproj#L436 . Can you tweak?

…t only

albietz · 2017-10-20T15:56:32Z

Some comments:

I added a learning rate multiplier based on the largest label magnitude seen so far. It seems like occasionally these get really large (e.g. I was getting values extremely large values at some point when using doubly robust estimates, even though labels were smaller than 10), hence the cap at 1000. It might be useful to allow to explicitly add the multiplier as a flag instead.
I added an option for using a separate example with a single global feature for the baseline (assuming the examples don't have that feature), which seems easier than fiddling with feature values if an example has other constant features other than the global. Perhaps I can use a separate namespace instead to avoid conflicts?

JohnLangford · 2017-11-12T19:22:40Z

Merged in, thanks.

albietz and others added 3 commits October 5, 2017 16:35

baseline: reduction for regression baseline from constant features

86e35fc

Merge branch 'master' into baseline

2880219

Merge branch 'master' into baseline

cdd08db

JohnLangford reviewed Oct 18, 2017

View reviewed changes

albietz added 4 commits October 19, 2017 14:54

baseline: fix windows build

7dbd310

cbify: options for loss values of success/failure

3ab0c44

baseline: scale baseline learning rate based on labels

104e33b

baseline: option for separate baseline prediction with global constan…

9ffdacd

…t only

JohnLangford added 2 commits November 12, 2017 12:45

Merge branch 'master' into baseline

0de84f1

Merge branch 'master' into baseline

9cb9002

JohnLangford merged commit fd259cd into VowpalWabbit:master Nov 12, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

baseline reduction: separate learning of additive regression baseline #1336

baseline reduction: separate learning of additive regression baseline #1336

albietz commented Oct 5, 2017

JohnLangford commented Oct 18, 2017

JohnLangford Oct 18, 2017

JohnLangford commented Oct 18, 2017

albietz commented Oct 20, 2017

JohnLangford commented Nov 12, 2017

baseline reduction: separate learning of additive regression baseline #1336

baseline reduction: separate learning of additive regression baseline #1336

Conversation

albietz commented Oct 5, 2017

JohnLangford commented Oct 18, 2017

JohnLangford Oct 18, 2017

Choose a reason for hiding this comment

JohnLangford commented Oct 18, 2017

albietz commented Oct 20, 2017

JohnLangford commented Nov 12, 2017