Give feedback when `svm.SVC` is configured with kernel hyperparameters for a different kernel #19614

PGijsbers · 2021-03-04T10:34:45Z

During our class we noticed some students incorrectly configure some hyperparameters which are irrelevant to the kernel used, for example setting gamma when using a linear kernel. We think it could make sense for scikit-learn to give feedback to the user when non-effective settings are configured.

Describe the workflow you want to enable

from sklearn.datasets import load_iris
from sklearn.svm import SVC

x, y = load_iris(return_X_y=True)
clf = SVC(kernel='linear', gamma=1e-6)
clf.fit(x, y)
print(clf.score(x, y))

current output:

0.9933333333333333

proposed output, something similar to:

UserWarning: Gamma is set but not used because a linear kernel is configured.
0.9933333333333333

The text was updated successfully, but these errors were encountered:

NicolasHug · 2021-03-04T12:46:18Z

Thanks for the report, I agree we should even error in this case (instead of a warning), as bug fix. The same goes for coef0 and for SVR.

PR welcome @PGijsbers

thomasjpfan · 2021-03-04T14:31:16Z

Currently, SVC ignores parameters depending on the kernel. For invalid combinations, I agree with @NicolasHug that we should error instead.

I think we should error when the parameter is not the default value and not compatible with the kernel. For example, if gamma!='scale' and kernel not in {'rbf', 'poly', 'sigmoid'}, we raise an error. This logic can extend to degree and coef0.

PGijsbers · 2021-03-05T08:23:49Z

I'll try to make the changes this weekend 👍

PGijsbers · 2021-03-05T08:25:32Z

Should the error be raised on fit similar to incompatible configurations for LogisticRegression, or on __init__?

NicolasHug · 2021-03-05T08:40:45Z

the validation should happen in fit (more details here if you're interested https://scikit-learn.org/stable/developers/develop.html#instantiation)

Also please make sure to write a non-regression test for the cases that are worth testing. The test should make sure that the error is now properly raised. Your snippet above is a great candidate.

Thanks!

PGijsbers · 2021-03-06T17:39:40Z

This change breaks some tests, e.g. sparse_svm. I'll go ahead and update those? Should I raise a DeprecationWarning first, or immediately setup a PR with ValueErrors?

thomasjpfan · 2021-03-06T17:49:27Z

I would prefer to deprecated first to be on the safe side. For the first PR, let's do this for gamma first to see how other reviewers feel about it. Then we can have follow up PRs on the other parameters.

NicolasHug · 2021-03-07T08:48:11Z

I don't think there's anything to deprecate here: there's no feature, just a silent (minor) bug.

We could raise a temporary warning instead of an error, but for such bugfixes we tend to error directly. On top of that, the upcoming release is 1.0, so it'd be nice to include such changes directly.

Also, the failing tests as mentioned above would have to fixed whether we raise a warning or an error.

PGijsbers · 2021-03-07T09:01:42Z

I'll update the PR when there is a new consensus (the PR has a FutureWarning instead of DeprecationWarning right now) :) just let me know

NicolasHug · 2021-03-07T09:53:51Z

@thomasjpfan what made you change your mind from error to warning? The offending test sparse_svm is clearly wrong as it passes clf = svm.OneClassSVM(gamma=1, kernel=kernel) for all kernels.

jnothman · 2021-03-07T13:55:07Z

I agree with Nicolas: the test is just being lazy.

thomasjpfan · 2021-03-07T14:42:43Z

The ignoring behavior is explicitly documented which makes me think it was intentional.

I do want to get to raising an error at some point and would be +1 on raising an error for 1.0.

ogrisel · 2021-03-15T13:29:15Z

I do want to get to raising an error at some point and would be +1 on raising an error for 1.0.

Why not just go through the usual deprecation cycle? This is more user friendly than a breaking change.

ogrisel · 2021-03-15T13:49:16Z

The fact that the degree parameter is documented as ignored explicitly if kernel is not poly is a marker that the current behavior was intentional and should therefore not be considered a bug. Also the current behavior can be (ab-)used to do simple yet efficient hyper-parameter search for all the kernel and there parametrizations at once using RandomizedSearchCV with distributions. Maybe that's an edge feature that few uses and the educational value of the warning (then error in the future) is more important.

Also, to be consistent, we should do the same for all 3 parameters (gamma, degree, coef0).

Also we should also issue the FutureWarning for SVC(gamma="auto", kernel="linear") because default is gamma="scale".

Since gamma="scale" is the default, it means that calling SVC(gamma="scale", kernel="linear") explicitly will not raise which is a bit weird/surprising. We could use None as a default marker for gamma, degree and coef0 but it's a bit sad because then it's not longer possible to see what is the meaning of the default value just by reading the prototype of the function. One would have to read the parameters section of the docstring instead. No strong opinion on that last point.

glemaitre · 2021-07-29T13:28:35Z

Raising an error will not be an issue with SerchCV thanks to the error_score parameter.

Since gamma="scale" is the default, it means that calling SVC(gamma="scale", kernel="linear") explicitly will not raise which is a bit weird/surprising. We could use None as a default marker for gamma, degree and coef0 but it's a bit sad because then it's not longer possible to see what is the meaning of the default value just by reading the prototype of the function.

I assume that we want to be consistent. But indeed getting gamma=None that will default to gamma="scale' if kernel="rbf" is semantically weird. I would expect the default to be "auto" meaning that it would default to a value (that is never None). However, for gamma, "auto" is even meaning something else.

So I assume that we are left with:

let the code as it is at the cost of users trying a non-meaningful set of hyperparameters if they are not experts and look at the documentation
change the code with obscure default values where you need to read the documentation to know what will be the real default

I am +0 on this. @ogrisel @NicolasHug @NicolasHug could you give a bit more thoughts and say what you think is best?
Depending on the direction, my review process will be different in the PR :)

thomasjpfan · 2021-07-29T17:08:32Z

Why not just go through the usual deprecation cycle? This is more user friendly than a breaking change.

Looking at this again, if we were to change behavior, I am +1 on deprecation.

We could use None as a default marker for gamma, degree and coef0 but it's a bit sad because then it's not longer possible to see what is the meaning of the default value just by reading the prototype of the function.

Maybe that is a good thing? Currently, one needs to read the docs to know if the parameter is even active.

Also the current behavior can be (ab-)used to do simple yet efficient hyper-parameter search for all the kernel and there parametrizations at once using RandomizedSearchCV with distributions.

I think the current implementation is less efficient, because one can have search spaces with parameters that are ignored.
For example, if we had a search space: kernel=['linear', 'poly', 'rbf', 'sigmoid'] x degree = [2, 3, 4, 5, 6], all the non-poly kernels will be training the same model. With error_score, the invalid combinations will early exit.

I am +0.5 on deprecating the current behavior.

glemaitre · 2021-08-02T12:36:26Z

So the deprecation could be a good option.

@thomasjpfan what are your thoughts regarding the gamma parameter:

I assume that we want to be consistent. But indeed getting gamma=None that will default to gamma="scale' if kernel="rbf" is semantically weird. I would expect the default to be "auto" meaning that it would default to a value (that is never None). However, for gamma, "auto" is even meaning something else.

thomasjpfan · 2021-08-08T21:27:46Z

Using 'auto' to mean 1/n_feature is kind of weird in itself. To move forward we can also rename 'auto' to reciprocal_n_features ?

…rnel in SVC

PGijsbers added the New Feature label Mar 4, 2021

NicolasHug added the help wanted label Mar 4, 2021

cmarmo added the module:svm label Mar 5, 2021

PGijsbers mentioned this issue Mar 6, 2021

DEP start to raise warning with inconsistent combination of hyperparameters in SVM #19630

Open

cmarmo removed the help wanted label Mar 7, 2021

adrinjalali added Easy Well-defined and straightforward way to resolve help wanted labels Mar 7, 2024

Issac-Kondreddy added a commit to Issac-Kondreddy/scikit-learn that referenced this issue Mar 17, 2024

Issue scikit-learn#19614: Warn on irrelevant parameters for linear ke…

4a78b6a

…rnel in SVC

Issac-Kondreddy mentioned this issue Mar 17, 2024

Issue #19614: Warn on irrelevant parameters for linear kernel in SVC #28641

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Give feedback when `svm.SVC` is configured with kernel hyperparameters for a different kernel #19614

Give feedback when `svm.SVC` is configured with kernel hyperparameters for a different kernel #19614

PGijsbers commented Mar 4, 2021

NicolasHug commented Mar 4, 2021 •

edited

thomasjpfan commented Mar 4, 2021

PGijsbers commented Mar 5, 2021

PGijsbers commented Mar 5, 2021

NicolasHug commented Mar 5, 2021

PGijsbers commented Mar 6, 2021

thomasjpfan commented Mar 6, 2021

NicolasHug commented Mar 7, 2021

PGijsbers commented Mar 7, 2021

NicolasHug commented Mar 7, 2021

jnothman commented Mar 7, 2021 via email

thomasjpfan commented Mar 7, 2021

ogrisel commented Mar 15, 2021

ogrisel commented Mar 15, 2021

glemaitre commented Jul 29, 2021

thomasjpfan commented Jul 29, 2021

glemaitre commented Aug 2, 2021

thomasjpfan commented Aug 8, 2021

Give feedback when svm.SVC is configured with kernel hyperparameters for a different kernel #19614

Give feedback when svm.SVC is configured with kernel hyperparameters for a different kernel #19614

Comments

PGijsbers commented Mar 4, 2021

Describe the workflow you want to enable

NicolasHug commented Mar 4, 2021 • edited

thomasjpfan commented Mar 4, 2021

PGijsbers commented Mar 5, 2021

PGijsbers commented Mar 5, 2021

NicolasHug commented Mar 5, 2021

PGijsbers commented Mar 6, 2021

thomasjpfan commented Mar 6, 2021

NicolasHug commented Mar 7, 2021

PGijsbers commented Mar 7, 2021

NicolasHug commented Mar 7, 2021

jnothman commented Mar 7, 2021 via email

thomasjpfan commented Mar 7, 2021

ogrisel commented Mar 15, 2021

ogrisel commented Mar 15, 2021

glemaitre commented Jul 29, 2021

thomasjpfan commented Jul 29, 2021

glemaitre commented Aug 2, 2021

thomasjpfan commented Aug 8, 2021

Give feedback when `svm.SVC` is configured with kernel hyperparameters for a different kernel #19614

Give feedback when `svm.SVC` is configured with kernel hyperparameters for a different kernel #19614

NicolasHug commented Mar 4, 2021 •

edited