Add common tests for consistent decision_function behavior in binary case #10175

amueller · 2017-11-20T21:31:53Z

I think there is no common test right now to check that the decision_function behavior is consistent across all estimators. We check that it is consistent with predict_proba and predict if the classes are [0, 1], but I don't think there is a test for [-1, 1] or arbitrary strings. I vaguely recall hard-coded cases for this, so I think adding a test would be really good.

The text was updated successfully, but these errors were encountered:

NarineK · 2017-11-26T18:10:31Z

Do you still need help with this issue ? May I work on it, @amueller ?

jnothman · 2017-11-26T22:43:04Z

Sure you may.

NarineK · 2017-12-15T20:32:48Z

Would this be the best place to add the test case ?
https://github.com/scikit-learn/scikit-learn/blob/master/sklearn/utils/estimator_checks.py#L1249
@amueller , @jnothman

amueller · 2017-12-15T21:16:52Z

that tests already tests something like this, but uses integers for classes.
You could check out https://github.com/scikit-learn/scikit-learn/blob/master/sklearn/utils/estimator_checks.py#L1394 which is multi-class, and maybe add a binary case and also check the decision functions there?

NarineK · 2017-12-15T21:36:57Z

I see. Let me check that one

NarineK · 2017-12-28T05:35:08Z

As I was writing the tests, I've noticed that the output of decision_function for NuSVC and SVC is inconsistent with predict when decision_function_shape='ovo'. It works fine with 'orv'
Here is an example:

classifier = NuSVC(decision_function_shape=‘ovo’)
X, y = make_blobs(n_samples=30, random_state=0, cluster_std=0.1)
X, y = shuffle(X, y, random_state=7)
X = StandardScaler().fit_transform(X)
X -= X.min() - .1
X = pairwise_estimator_convert_X(X, clf)
y_names = np.array(["one", "two", "three"])[y]
y_ = y_names
set_random_state(classifier)
classifier.fit(X, y_)
y_pred = classifier.predict(X)

decision = classifier.decision_function(X)
decision_y = np.argmax(decision, axis=1).astype(int)
dec_func = classifier.classes_[decision_y]

output:

dec_func:

>>> array(['two', 'three', 'two', 'two', 'two', 'one', 'two', 'one', 'one',
       'three', 'two', 'one', 'one', 'three', 'two', 'one', 'one', 'one',
       'two', 'one', 'one', 'three', 'two', 'three', 'three', 'two',
       'three', 'one', 'one', 'one'],
      dtype='|S5')

y_pred:
>>> 
array(['three', 'one', 'three', 'three', 'three', 'one', 'three', 'one',
       'two', 'one', 'three', 'two', 'two', 'one', 'three', 'two', 'two',
       'two', 'three', 'two', 'two', 'one', 'three', 'one', 'one', 'three',
       'one', 'two', 'two', 'one'],
      dtype='|S5')

Are you aware of this ? @amueller @jnothman
Should I exclude it from the tests ?

jnothman · 2017-12-31T09:13:48Z

Hmmm... Could you open a new issue (if this is not already covered by an existing one) please?

…

On 28 December 2017 at 16:35, NarineK ***@***.***> wrote: As I was writing the tests I've noticed that the output of decision_function for NuSVC and SVC is inconsistent with predict when decision_function_shape='ovo'. It works fine with 'orv' Here is an example: classifier = NuSVC(decision_function_shape=‘ovo’) X, y = make_blobs(n_samples=30, random_state=0, cluster_std=0.1) X, y = shuffle(X, y, random_state=7) X = StandardScaler().fit_transform(X) X -= X.min() - .1 X = pairwise_estimator_convert_X(X, clf) y_names = np.array(["one", "two", "three"])[y] y_ = y_names set_random_state(classifier) classifier.fit(X, y_) y_pred = classifier.predict(X) decision = classifier.decision_function(X) decision_y = np.argmax(decision, axis=1).astype(int) dec_func = classifier.classes_[decision_y] output: dec_func: >>> array(['two', 'three', 'two', 'two', 'two', 'one', 'two', 'one', 'one', 'three', 'two', 'one', 'one', 'three', 'two', 'one', 'one', 'one', 'two', 'one', 'one', 'three', 'two', 'three', 'three', 'two', 'three', 'one', 'one', 'one'], dtype='|S5') y_pred: >>> array(['three', 'one', 'three', 'three', 'three', 'one', 'three', 'one', 'two', 'one', 'three', 'two', 'two', 'one', 'three', 'two', 'two', 'two', 'three', 'two', 'two', 'one', 'three', 'one', 'one', 'three', 'one', 'two', 'two', 'one'], dtype='|S5') — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#10175 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAEz6zhpvlhoCpn2QJTKBOXo8Oja3_qhks5tEyiOgaJpZM4Qk7g8> .

NarineK · 2017-12-31T23:42:26Z

I didn't see exactly the same issue and created: #10388.
Should I skip SVC test cases or wait for the issue to be fixed ?

jnothman · 2018-01-01T01:25:33Z

ovo is irrelevant to the binary case

NarineK · 2018-01-02T17:26:51Z

I see, I'll do the checks only for binary then.

NarineK · 2018-02-02T18:14:32Z

@jnothman , @amueller have you had time to take a look into my pull request?

jnothman · 2018-02-03T12:36:10Z

Sorry, I'm not sure how this flew under my radar. Will try take a look soon.

amueller mentioned this issue Nov 20, 2017

[MRG] DOC A glossary of concepts and API elements #9517

Merged

NarineK mentioned this issue Dec 31, 2017

The decision_function for NuSVC and SVC is not inconsistent with predict #10388

Closed

NarineK mentioned this issue Jan 18, 2018

[MRG+1] Add common tests for consistent decision_function behavior in binary case #10500

Merged

glemaitre closed this as completed in #10500 Feb 16, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add common tests for consistent decision_function behavior in binary case #10175

Add common tests for consistent decision_function behavior in binary case #10175

amueller commented Nov 20, 2017

NarineK commented Nov 26, 2017

jnothman commented Nov 26, 2017

NarineK commented Dec 15, 2017 •

edited

amueller commented Dec 15, 2017

NarineK commented Dec 15, 2017

NarineK commented Dec 28, 2017 •

edited

jnothman commented Dec 31, 2017 via email

NarineK commented Dec 31, 2017

jnothman commented Jan 1, 2018 via email

NarineK commented Jan 2, 2018

NarineK commented Feb 2, 2018

jnothman commented Feb 3, 2018

Add common tests for consistent decision_function behavior in binary case #10175

Add common tests for consistent decision_function behavior in binary case #10175

Comments

amueller commented Nov 20, 2017

NarineK commented Nov 26, 2017

jnothman commented Nov 26, 2017

NarineK commented Dec 15, 2017 • edited

amueller commented Dec 15, 2017

NarineK commented Dec 15, 2017

NarineK commented Dec 28, 2017 • edited

jnothman commented Dec 31, 2017 via email

NarineK commented Dec 31, 2017

jnothman commented Jan 1, 2018 via email

NarineK commented Jan 2, 2018

NarineK commented Feb 2, 2018

jnothman commented Feb 3, 2018

NarineK commented Dec 15, 2017 •

edited

NarineK commented Dec 28, 2017 •

edited