Improve SVM hyperparameters #2651

eccabay · 2021-08-18T13:02:29Z

Closes #2615 by removing "linear" as an option from SVMClassifier and SVMRegressor, and swaps "auto" to be SVM's default gamma parameter for increased first-guess performance (as discussed in performance test results here)

codecov · 2021-08-18T13:08:04Z

Codecov Report

Merging #2651 (1256989) into main (ed74174) will not change coverage.
The diff coverage is 100.0%.

@@          Coverage Diff          @@
##            main   #2651   +/-   ##
=====================================
  Coverage   99.9%   99.9%           
=====================================
  Files        298     298           
  Lines      27305   27305           
=====================================
  Hits       27261   27261           
  Misses        44      44

Impacted Files	Coverage Δ
...omponents/estimators/classifiers/svm_classifier.py	`100.0% <ø> (ø)`
evalml/tests/component_tests/test_components.py	`100.0% <ø> (ø)`
.../components/estimators/regressors/svm_regressor.py	`100.0% <100.0%> (ø)`
...valml/tests/component_tests/test_svm_classifier.py	`100.0% <100.0%> (ø)`
evalml/tests/component_tests/test_svm_regressor.py	`100.0% <100.0%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ed74174...1256989. Read the comment docs.

chukarsten

Looks good! Just two hyper nits about the doc strings!

evalml/pipelines/components/estimators/classifiers/svm_classifier.py

evalml/pipelines/components/estimators/regressors/svm_regressor.py

freddyaboulton

@eccabay Thank you for this! I agree with removing the linear kernel for both regression and classification but not sure if we should tweak the other hyperparameters for regression since the perf tests you only ran on binary classification.

I left a comment on your results as well about how much better SVM is than the next best estimator. The fit time is only slightly slower for most datasets but for some it's like 4x slower. If the SVM more than 4x better, there is an argument for including it in AutoMLSearch.

I'd like to continue the discussion on your perf test doc before approving!

freddyaboulton · 2021-08-18T16:02:06Z

evalml/pipelines/components/estimators/regressors/svm_regressor.py

@@ -42,7 +42,7 @@ class SVMRegressor(Estimator):
        ProblemTypes.TIME_SERIES_REGRESSION,
    ]"""

-    def __init__(self, C=1.0, kernel="rbf", gamma="scale", random_seed=0, **kwargs):
+    def __init__(self, C=1.0, kernel="rbf", gamma="auto", random_seed=0, **kwargs):


Should we make this change? The perf tests only considered binary classification problems.

This is a fair point! It's hard to say since we don't have very many regression datasets in looking glass. I'll run a few tests with what we have and see if the results are consistent or not!

@freddyaboulton Results from the regression testing is now in the performance test doc! On the very small number of datasets we have, "auto" performs better significantly more often than "scale", so I think this change should happen.

eccabay · 2021-08-18T18:20:16Z

One thing I forgot to implement/mention before publishing this PR is that kernel=precomputed also needs to be removed - I had it removed in my testing branch and forgot to carry it over into this one. With this parameter, Sklearn's SVC expects a precomputed kernel of shape (n_samples_test, n_samples_train), which we don't give it, so it errors out and doesn't run in the first place.

bchen1116

LGTM! Also curious about the change of gamma from scale to auto, but will wait for the results on that!

freddyaboulton

Thank you @eccabay !

eccabay added 2 commits August 18, 2021 08:53

Adjust kernel and gamma SVM hyperparameters

3842c6d

Update release notes

3942948

eccabay self-assigned this Aug 18, 2021

fix failing test

71678e8

eccabay marked this pull request as ready for review August 18, 2021 14:56

eccabay requested review from angela97lin, bchen1116, dsherry, chukarsten and freddyaboulton August 18, 2021 14:57

chukarsten approved these changes Aug 18, 2021

View reviewed changes

evalml/pipelines/components/estimators/classifiers/svm_classifier.py Outdated Show resolved Hide resolved

evalml/pipelines/components/estimators/regressors/svm_regressor.py Outdated Show resolved Hide resolved

freddyaboulton reviewed Aug 18, 2021

View reviewed changes

eccabay added 2 commits August 18, 2021 14:03

Docstring nitpicks

acca285

Remove precomputed kernel

71269cb

bchen1116 approved these changes Aug 19, 2021

View reviewed changes

freddyaboulton approved these changes Aug 19, 2021

View reviewed changes

eccabay added 3 commits August 19, 2021 15:39

Merge branch 'main' into svm_hyperparameters

b779ddd

Merge branch 'main' into svm_hyperparameters

21eaee7

Fix release notes

1256989

eccabay merged commit 406dd6b into main Aug 20, 2021

eccabay deleted the svm_hyperparameters branch August 23, 2021 12:17

chukarsten mentioned this pull request Sep 1, 2021

Release v0.32.0 #2729

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve SVM hyperparameters #2651

Improve SVM hyperparameters #2651

eccabay commented Aug 18, 2021

codecov bot commented Aug 18, 2021 •

edited

Loading

chukarsten left a comment

freddyaboulton left a comment •

edited

Loading

freddyaboulton Aug 18, 2021

eccabay Aug 18, 2021

eccabay Aug 19, 2021

eccabay commented Aug 18, 2021

bchen1116 left a comment

freddyaboulton left a comment

Improve SVM hyperparameters #2651

Improve SVM hyperparameters #2651

Conversation

eccabay commented Aug 18, 2021

codecov bot commented Aug 18, 2021 • edited Loading

Codecov Report

chukarsten left a comment

Choose a reason for hiding this comment

freddyaboulton left a comment • edited Loading

Choose a reason for hiding this comment

freddyaboulton Aug 18, 2021

Choose a reason for hiding this comment

eccabay Aug 18, 2021

Choose a reason for hiding this comment

eccabay Aug 19, 2021

Choose a reason for hiding this comment

eccabay commented Aug 18, 2021

bchen1116 left a comment

Choose a reason for hiding this comment

freddyaboulton left a comment

Choose a reason for hiding this comment

codecov bot commented Aug 18, 2021 •

edited

Loading

freddyaboulton left a comment •

edited

Loading