FEA Top k accuracy metric #16625

gbolmier · 2020-03-03T18:13:38Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

This implements a top-k accuracy classification metric, for use with predicted class scores in multiclass classification settings. A prediction is considered top-k accurate if the correct class is one of the k classes with the highest predicted scores.

…tric

Handle multiclass case not multilabel

…tric

NicolasHug

Thanks for the PR @gbolmier , a few comments ;)

sklearn/metrics/_ranking.py

Raise errors for `k`=1, `k`=`n_classes`, cover binary `y_true` as it can be a subset of the possible classes, fix doc and update tests and doctest to match changes

…tric

gbolmier · 2020-03-03T22:06:49Z

Thank you so much for taking the time of the review @NicolasHug!
Let me know if there is more to improve :)

Add test for case when `y_true` = [0]*4

NicolasHug

Thanks @gbolmier a few more.

Let's also add a small section in the User Guide!

sklearn/metrics/_ranking.py

NicolasHug · 2020-03-03T22:25:54Z

sklearn/metrics/_ranking.py

    return np.average(gain, weights=sample_weight)
+
+
+def top_k_accuracy_score(y_true, y_score, k=5, normalize=True):


Thinking about the default, should it be 2, e.g. the minimum value for which the function can be called? It would work in all cases. 5 would fail if n_classes < 5

Agree with that!

I am not sure that we want ValueError here. I believe that for k>=n_classes we should raise a warning with the same message probably but we probably want to return a score. No?

I think we should raise an error if k >= n_classes. This will always output an accuracy of 1 which will lure inexperienced users into thinking their estimator is doing great, where in reality they're just misusing a metric.

In general, we try to make misuses hard / impossible

In general, we try to make misuses hard / impossible

Agree.

will lure inexperienced users into thinking their estimator is doing great

If they have a binary problem and they ask for top_5 accuracy then their estimator will be doing great obviously!

I dunno. IMHO a warning is enough.

If they have a binary problem and they ask for top_5 accuracy then their estimator will be doing great obviously!

Indeed. Also +1 to increase the default to at least 3. I think top_5 is good when you have a very large number of classes (e.g. ImageNet), but 30-60 classes which is more common with tabular data, top_3 is already quite useful.

If they have a binary problem and they ask for top_5 accuracy then their estimator will be doing great obviously!

Obvious to you, maybe not to them. Also note that we error in the binary case too.

sklearn/metrics/_ranking.py

sklearn/metrics/tests/test_ranking.py

Co-Authored-By: Nicolas Hug <contact@nicolas-hug.com>

Change `k` default to 2. Update docstring and error msg

…k_accuracy_metric

sklearn/metrics/_ranking.py

chkoar · 2020-03-04T13:43:02Z

Don't we want a default scorer for that metric? I understand that k might be blocker for that, though.

Add `y_true = [0]*5` case

Add `normalize == False` case

…tric

Change `k` default to 3 as it represents better the usual case. Make `y_true` value error message more explicit. Check `y_score` number of columns. Fix `k` check with n_classes

…tric

gbolmier · 2020-07-16T20:30:46Z

Should be better like that @jnothman

rth

Thanks for your work and patience @gbolmier, a few more minor comments below otherwise looks good.

Also please add it to https://scikit-learn.org/stable/modules/model_evaluation.html#common-cases-predefined-values

sklearn/metrics/_ranking.py

sklearn/metrics/tests/test_common.py

# More detailed explanatory text, if necessary. Wrap it to about 72 # characters or so. In some contexts, the first line is treated as the # subject of the commit and the rest of the text as the body. The # blank line separating the summary from the body is critical (unless # you omit the body entirely); various tools like `log`, `shortlog` # and `rebase` can get confused if you run the two together. # Explain the problem that this commit is solving. Focus on why you # are making this change as opposed to how (the code explains that). # Are there side effects or other unintuitive consequences of this # change? Here's the place to explain them. # Further paragraphs come after blank lines. # - Bullet points are okay, too # - Typically a hyphen or asterisk is used for the bullet, preceded # by a single space, with blank lines in between, but conventions # vary here # If you use an issue tracker, put references to them at the bottom, # like this: # Resolves: scikit-learn#123 # See also: scikit-learn#456, scikit-learn#789

…tric

gbolmier · 2020-08-09T11:01:33Z

Thanks for your work and patience @gbolmier, a few more minor comments below otherwise looks good.

This is how you learn the ropes :), thanks a lot for the review @rth! Let me know if it needs any other changes

rth

Thanks @gbolmier! LGTM. Should be good to merge unless anyone has more comments?

@jnothman I think you review comments were addressed.

cmarmo · 2020-09-24T08:39:03Z

Hi @jnothman, do you mind checking if your comments have been addressed? Two approvals here already. Thanks for your time.

cmarmo · 2020-10-15T13:10:32Z

Two approvals here! Time to merge?

rth · 2020-10-16T14:51:15Z

Merging thanks again @gbolmier ! Also thank you for the follow up @cmarmo !

jnothman · 2020-10-17T11:19:08Z

Thanks @gblomier. Sorry I've not found time to follow up!

Co-authored-by: Nicolas Hug <contact@nicolas-hug.com> Co-authored-by: Jeremiah Johnson <jwjohnson314@gmail.com> Co-authored-by: Roman Yurchak <rth.yurchak@gmail.com>

Jeremiah Johnson and others added 6 commits January 17, 2018 09:25

added top_k_accuracy_metric

4685cb5

Merge remote-tracking branch 'upstream/master' into top_k_accuracy_me…

67855a6

…tric

Update top_k_caccuracy_score and move to _ranking

c3ab5bb

Handle multiclass case not multilabel

Update and move test_top_k_accuracy_score to test_ranking

b20952e

Merge remote-tracking branch 'upstream/master' into top_k_accuracy_me…

b574a24

…tric

Fix doc

88dbcc0

github-actions bot added the module:metrics label Mar 3, 2020

NicolasHug reviewed Mar 3, 2020

View reviewed changes

gbolmier changed the title ~~Top k accuracy metric~~ [MRG] Top k accuracy metric Mar 3, 2020

gbolmier added 3 commits March 3, 2020 16:48

Fix unused import

abeb579

Update top_k_accuracy_score

7ae0db4

Raise errors for `k`=1, `k`=`n_classes`, cover binary `y_true` as it can be a subset of the possible classes, fix doc and update tests and doctest to match changes

Merge remote-tracking branch 'upstream/master' into top_k_accuracy_me…

7d17026

…tric

Update top_k_accuracy_score doctest

be82f37

Add test for case when `y_true` = [0]*4

NicolasHug reviewed Mar 3, 2020

View reviewed changes

gbolmier and others added 6 commits March 3, 2020 17:42

Update top_k_accuracy_score doc

d7918c0

Co-Authored-By: Nicolas Hug <contact@nicolas-hug.com>

Update top_k_accuracy_score doc

be7f946

Co-Authored-By: Nicolas Hug <contact@nicolas-hug.com>

Update top_k_accuracy_score

59cd109

Change `k` default to 2. Update docstring and error msg

Merge remote-tracking branch 'origin/top_k_accuracy_metric' into top_…

39f9299

…k_accuracy_metric

Fix test typo error

250c7d2

Fix line too long

706ad5e

chkoar reviewed Mar 4, 2020

View reviewed changes

sklearn/metrics/_ranking.py Outdated Show resolved Hide resolved

chkoar reviewed Mar 4, 2020

View reviewed changes

sklearn/metrics/_ranking.py Outdated Show resolved Hide resolved

gbolmier added 6 commits March 4, 2020 09:51

Update top_k_accuracy_score test

3abec37

Add `y_true = [0]*5` case

Update top_k_accuracy_score doc

f89e22d

Update top_k_accuracy_score test

6807f89

Add `normalize == False` case

Merge remote-tracking branch 'upstream/master' into top_k_accuracy_me…

e86a8aa

…tric

Update top_k_accuracy_score

ecc706d

Change `k` default to 3 as it represents better the usual case. Make `y_true` value error message more explicit. Check `y_score` number of columns. Fix `k` check with n_classes

Merge remote-tracking branch 'upstream/master' into top_k_accuracy_me…

c50a690

…tric

gbolmier added 6 commits June 26, 2020 16:38

Merge remote-tracking branch 'upstream/master' into top_k_accuracy_me…

2a24692

…tric

Test top_k_accuracy_score normalize option in test_common.py

4485fbd

Merge remote-tracking branch 'upstream/master' into top_k_accuracy_me…

54cab54

…tric

Fix user guide

984f129

Merge remote-tracking branch 'upstream/master' into top_k_accuracy_me…

d61aa9a

…tric

Merge remote-tracking branch 'upstream/master' into top_k_accuracy_me…

cf0ff36

…tric

rth reviewed Aug 6, 2020

View reviewed changes

sklearn/metrics/_ranking.py Outdated Show resolved Hide resolved

sklearn/metrics/_ranking.py Outdated Show resolved Hide resolved

sklearn/metrics/tests/test_common.py Outdated Show resolved Hide resolved

gbolmier added 4 commits August 9, 2020 12:49

Merge remote-tracking branch 'upstream/master' into top_k_accuracy_me…

4bd501b

…tric

cmarmo added the Waiting for Reviewer label Aug 12, 2020

Merge branch 'master' into top_k_accuracy_metric

d4ae167

rth approved these changes Aug 31, 2020

View reviewed changes

cmarmo removed the Waiting for Reviewer label Sep 24, 2020

Merge branch 'master' into top_k_accuracy_metric

97c272a

rth changed the title ~~[MRG] Top k accuracy metric~~ FEA Top k accuracy metric Oct 16, 2020

rth merged commit 5654da0 into scikit-learn:master Oct 16, 2020

antoniovs1029 mentioned this pull request Oct 30, 2020

Perf improvement for TopK Accuracy and return all topK in Classification Evaluator dotnet/machinelearning#5395

Merged

4 tasks

lucyleeow mentioned this pull request Nov 8, 2020

Precision Recall values per Hold out instance #5898

Closed

gbolmier deleted the top_k_accuracy_metric branch November 14, 2020 22:19

joclement mentioned this pull request Jan 29, 2021

[MRG] Fix top_k_accuracy_score ignoring labels for "multiclass" case #19300

Closed

		return np.average(gain, weights=sample_weight)


		def top_k_accuracy_score(y_true, y_score, k=5, normalize=True):

Uh oh!

FEA Top k accuracy metric #16625

FEA Top k accuracy metric #16625

Uh oh!

Conversation

gbolmier commented Mar 3, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gbolmier commented Mar 3, 2020

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

NicolasHug Mar 3, 2020

Choose a reason for hiding this comment

Uh oh!

gbolmier Mar 3, 2020

Choose a reason for hiding this comment

Uh oh!

chkoar Mar 4, 2020

Choose a reason for hiding this comment

Uh oh!

NicolasHug Mar 4, 2020

Choose a reason for hiding this comment

Uh oh!

chkoar Mar 4, 2020

Choose a reason for hiding this comment

Uh oh!

rth Mar 4, 2020

Choose a reason for hiding this comment

Uh oh!

NicolasHug Mar 4, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chkoar commented Mar 4, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gbolmier commented Jul 16, 2020

Uh oh!

rth left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gbolmier commented Aug 9, 2020

Uh oh!

rth left a comment

Choose a reason for hiding this comment

Uh oh!

cmarmo commented Sep 24, 2020

Uh oh!

cmarmo commented Oct 15, 2020

Uh oh!

rth commented Oct 16, 2020

Uh oh!

jnothman commented Oct 17, 2020 via email

Uh oh!

Uh oh!

gbolmier commented Mar 3, 2020 •

edited

Loading

chkoar commented Mar 4, 2020 •

edited

Loading