Add Vowpal Wabbit estimators #2846

angela97lin · 2021-09-27T15:25:19Z

Reviving #770 as part of inno days experimentation.

(Closes #770)

Perf test results here on adding to AutoML: https://alteryx.atlassian.net/wiki/spaces/PS/pages/1059620252/Adding+Vowpal+Wabbit+to+AutoML

I don't think we should add this to AutoML quite yet, but it's still worth to have these estimators as components!

IIRC, we probably need to update the feedstock requirements to get the conda package tests to pass, will do, but wanted to put this up for review first 😁

Edit: looks like the vowpal wabbit 8.8.1 version (aka the latest conda version) has a very different API from the latest (8.11.0) version which I based my code on. Filed VowpalWabbit/vowpal_wabbit#3406, but otherwise may consider looking into using the older API or holding off entirely.

codecov · 2021-09-30T16:43:04Z

Codecov Report

Merging #2846 (cf3195d) into main (8bc7823) will increase coverage by 0.1%.
The diff coverage is 98.4%.

@@           Coverage Diff           @@
##            main   #2846     +/-   ##
=======================================
+ Coverage   99.7%   99.7%   +0.1%     
=======================================
  Files        302     307      +5     
  Lines      28872   29049    +177     
=======================================
+ Hits       28781   28958    +177     
  Misses        91      91

Impacted Files	Coverage Δ
evalml/pipelines/__init__.py	`100.0% <ø> (ø)`
evalml/pipelines/components/__init__.py	`100.0% <ø> (ø)`
evalml/pipelines/components/estimators/__init__.py	`100.0% <ø> (ø)`
evalml/tests/conftest.py	`98.3% <ø> (ø)`
...alml/tests/model_family_tests/test_model_family.py	`100.0% <ø> (ø)`
...s/prediction_explanations_tests/test_explainers.py	`100.0% <ø> (ø)`
evalml/tests/utils_tests/test_dependencies.py	`85.2% <ø> (ø)`
evalml/utils/gen_utils.py	`98.1% <ø> (ø)`
evalml/tests/component_tests/test_utils.py	`95.3% <25.0%> (ø)`
evalml/model_family/model_family.py	`100.0% <100.0%> (ø)`
... and 9 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 8bc7823...cf3195d. Read the comment docs.

…abbit_integration

chukarsten

This looks all well and good to me! I have to say, I'm a little out of the loop on this classifier/regressor and mostly just thought "VowpalWabbit" was weird seeing it living on the PR list. But this looks like a textbook new component addition. I think it would be very interesting using this one as a guinea pig for @christopherbunn 's modifications to the perf test report.

chukarsten · 2021-10-12T16:16:27Z

evalml/pipelines/components/estimators/classifiers/vowpal_wabbit_classifiers.py

+        raise NotImplementedError(
+            "Feature importance is not implemented for the Vowpal Wabbit classifiers."
+        )


I think what we normally do is return an array of 0's. I think this is fine for now, and ultimately the right thing, but we might have to revisit this again come AutoML integration.

angela97lin · 2021-10-18T20:48:35Z

Update: looks like the vowpal wabbit 8.8.1 version (aka the latest conda version) has a very different API from the latest (8.11.0) version which I based my code on. Filed VowpalWabbit/vowpal_wabbit#3406, but otherwise may consider looking into using the older API or holding off entirely.

angela97lin · 2021-10-21T03:55:16Z

Since there's the discrepancy above, we could either:

Ask and hope that VW releases a newer version on conda--seems unlikely
Make VW a pip only package.

Prophet is already a pip only package so it's not an entirely foreign concept, and I like this idea until we find it important to add to AutoML / better integrate it with EvalML.

freddyaboulton

@angela97lin Thank you for reviving this! I think it's ok to keep as pip-only. I'm curious what the perf tests will look like for this estimator. I just went down a documentation rabbit hole and there's a lot of options we can configure here.

freddyaboulton · 2021-10-21T14:23:48Z

evalml/pipelines/components/estimators/classifiers/vowpal_wabbit_classifiers.py

+        self,
+        loss_function="logistic",
+        learning_rate=0.5,
+        decay_learning_rate=1.0,


Great question. Hard to tell how this parameter is used from the docs but my guess is that if the decay learning rate is 1, the learning rate won't decay. That seems fine to me for a default, especially if we let automl tune it.

However, since the number of passes defaults to 1, I don't think the value of this parameter will take effect. @angela97lin Maybe we should also expose/tune the number of passes parameter?

…abbit_integration

angela97lin · 2021-10-22T21:28:21Z

@freddyaboulton I had done really preliminary perf testing a while back here: https://alteryx.atlassian.net/wiki/spaces/PS/pages/1059620252/Adding+Vowpal+Wabbit+to+AutoML

I ran into a lot of weird behavior with the tuner and vowpal wabbit which will need to be resolved and will be pretty interesting to dig into once we want to add VW to AutoML, but you're totally right--there are so many knobs to turn with these estimators 😂

…abbit_integration

init

1f4c69d

angela97lin self-assigned this Sep 27, 2021

angela97lin added 5 commits September 27, 2021 11:50

separate binary and multiclass

ae9c728

init

31e07ea

oops forgot to uncomment imputer

5cb83f9

oops, fix regressor

06c4715

move things around

271ab89

angela97lin added 13 commits September 30, 2021 13:22

begin cleaning up tests

24a5446

clean up and add more tests:

e12eb52

add importorskip to test files

526923b

add to api index, and clean up imports

41b8dfa

attempt to change import

39a2500

remove from automl search

87d8dd6

Merge branch 'main' into wabbit_integration

85f2103

fix model family test

21f5aed

Merge branch 'wabbit_integration' of github.com:alteryx/evalml into w…

05dfc2e

…abbit_integration

clean up more tests

89ca480

fix tests, continued

583831f

update release notes and latest deps

dcad334

fix latest deps

15da97a

angela97lin marked this pull request as ready for review October 1, 2021 18:40

angela97lin requested review from christopherbunn, freddyaboulton, ParthivNaresh, bchen1116, chukarsten, dsherry, eccabay and jeremyliweishih October 1, 2021 18:40

chukarsten approved these changes Oct 12, 2021

View reviewed changes

angela97lin added 2 commits October 18, 2021 15:24

merge main

aa08766

github meta

2e9abd0

freddyaboulton approved these changes Oct 21, 2021

View reviewed changes

angela97lin added 14 commits October 21, 2021 13:46

Merge branch 'main' into wabbit_integration

6d7de64

start to remove for conda

3608dd0

Merge branch 'main' into wabbit_integration

cf660c7

change conda number

95bbe67

Merge branch 'wabbit_integration' of github.com:alteryx/evalml into w…

9d61c25

…abbit_integration

Merge branch 'main' into wabbit_integration

af96357

fix tests, update versions

1c3934f

add extra number for multiclass estimator

e174033

Merge branch 'main' into wabbit_integration

6e9b0e6

newlline

c0cf2f3

Merge branch 'wabbit_integration' of github.com:alteryx/evalml into w…

93c0551

…abbit_integration

add artifact upload

5fbfb13

fix order of vw in latest deps

62eadcb

add passes param

83e1e05

angela97lin added 4 commits October 22, 2021 17:39

cleanup and remove changes to detect_changes

ef232ee

Merge branch 'main' into wabbit_integration

a1b3be0

cleanup

d7c7fbf

Merge branch 'wabbit_integration' of github.com:alteryx/evalml into w…

cf3195d

…abbit_integration

angela97lin merged commit c11809c into main Oct 25, 2021

angela97lin deleted the wabbit_integration branch October 25, 2021 05:59

This was referenced Oct 25, 2021

Add conda support for vowpal wabbit #2952

Closed

Add vowpal wabbit to AutoMLSearch #2953

Open

chukarsten mentioned this pull request Oct 27, 2021

Release v0.36.0 #2974

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Vowpal Wabbit estimators #2846

Add Vowpal Wabbit estimators #2846

angela97lin commented Sep 27, 2021 •

edited

Loading

codecov bot commented Sep 30, 2021 •

edited

Loading

chukarsten left a comment

chukarsten Oct 12, 2021

angela97lin commented Oct 18, 2021

angela97lin commented Oct 21, 2021

freddyaboulton left a comment

freddyaboulton Oct 21, 2021

angela97lin commented Oct 22, 2021

Add Vowpal Wabbit estimators #2846

Add Vowpal Wabbit estimators #2846

Conversation

angela97lin commented Sep 27, 2021 • edited Loading

codecov bot commented Sep 30, 2021 • edited Loading

Codecov Report

chukarsten left a comment

Choose a reason for hiding this comment

chukarsten Oct 12, 2021

Choose a reason for hiding this comment

angela97lin commented Oct 18, 2021

angela97lin commented Oct 21, 2021

freddyaboulton left a comment

Choose a reason for hiding this comment

freddyaboulton Oct 21, 2021

Choose a reason for hiding this comment

angela97lin commented Oct 22, 2021

angela97lin commented Sep 27, 2021 •

edited

Loading

codecov bot commented Sep 30, 2021 •

edited

Loading