DOC Separate predefined scorer names from the ones requiring make_scorer #28750

marijavlajic · 2024-04-02T13:29:42Z

Reference Issues/PRs

Related to #28671

What does this implement/fix? Explain your changes.

Table of scorer names currently in 3.3.1.1. includes also some that haven't been implemented / can't be passed as strings to the scoring parameter directly. These have now been removed from the table; a new table has been created in 3.3.1.2. (where the usage of make_scorer is explained) including the scoring functions which need to be wrapped with make_scorer.

Any other comments?

github-actions · 2024-04-02T13:31:00Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: c9bce0c. Link to the linter CI: here}

ogrisel

Thanks for the PR. I think the new table should be put right after the main one and introduce this table with with a sentence that explains that those scoring functions need to be wrapped as a scorer by the user using the make_scorer function to set the main parameter of interest and link to the section about make_scorer usage for details.

doc/modules/model_evaluation.rst

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

doc/modules/model_evaluation.rst

glemaitre · 2024-04-02T15:37:42Z

@ogrisel For the first table, we have the "Comments" column. I'm wondering if for the metric that we have a comment, we should actually have the make_scorer(..) equivalent call.

This could help in terms of discoverability of make_scorer maybe?

ogrisel

Once the 2 comments below are addressed, I think this PR will be a net improvement. +1 for merge.

Note: the style of the table is a bit crammed but this will be addressed in other PRs, in particular in @Charlie-XIAO's work on adopting a new website theme.

Note I took this screenshot from the HTML page rendered by our CI, when clicking on the "details" link of the "Check the rendered docs here!" in the CI report below.

doc/modules/model_evaluation.rst

ogrisel · 2024-04-02T15:44:49Z

I like the suggestion in #28750 (comment).

Maybe add a new column named "Example usage" with the make_scorer call. If there is not enough horizontal screen real estate, we can either:

Shorten the title of the "User-settable parameter name" column to "Parameter"
or even drop that column entirely, because the parameter name should be explicit enough from the example snippet.

marijavlajic · 2024-04-02T16:40:55Z

I used your neat trick @ogrisel to look at HTML but looks like the second table is now not rendering at all?

doc/modules/model_evaluation.rst

Co-authored-by: Guillaume Lemaitre <guillaume@probabl.ai>

glemaitre · 2024-04-02T21:38:21Z

doc/modules/model_evaluation.rst

+:func:`metrics.mean_pinball_loss`      ``alpha``  ``make_scorer(mean_pinball_loss, alpha=0.95)``
+:func:`metrics.d2_tweedie_score`       ``power``  ``make_scorer(d2_tweedie_score, power=1)``
+:func:`metrics.d2_pinball_score`       ``alpha``  ``make_scorer(d2_pinball_score, alpha=0.95)``
+=====================================  =========  ===============================================


 Usage examples:


One last thing before to merge: I think that we should move the example below above the table that you added because it does not rely on make_scorer. However, we could add a similar example to show that we can pass a scorer object obtained via make_scorer.

In this case, we have a usage example for each configuration.

We had some back and forth already about the exact position of the table. I feel if it is below the example, and just above "Defining your scoring strategy from metric functions" section, then it might as well be within that section. Perhaps @ogrisel can chime in too?

And if I'm understanding correctly what you mean by "similar example to show that we can pass a scorer object obtained via make_scorer", then that example is in the 3.3.1.2. section already too?

So the example right now is:

Usage examples: >>> from sklearn import svm, datasets >>> from sklearn.model_selection import cross_val_score >>> X, y = datasets.load_iris(return_X_y=True) >>> clf = svm.SVC(random_state=0) >>> cross_val_score(clf, X, y, cv=5, scoring='recall_macro') array([0.96..., 0.96..., 0.96..., 0.93..., 1. ])

I feel strange that it comes right after the table that mentioning that you should make_scorer

Would it not be more friendly to have:

==== Table showing string metric ==== >>> from sklearn import svm, datasets >>> from sklearn.model_selection import cross_val_score >>> X, y = datasets.load_iris(return_X_y=True) >>> clf = svm.SVC(random_state=0) >>> cross_val_score(clf, X, y, cv=5, scoring='recall_macro') array([0.96..., 0.96..., 0.96..., 0.93..., 1. ]) Narration about `make_scorer` only metric === Table for `make_scorer` metric === Usage example: >>> from sklearn import svm, datasets >>> from sklearn.metrics import fbeta_score, make_scorer >>> from sklearn.model_selection import cross_val_score >>> X, y = datasets.load_iris(return_X_y=True) >>> clf = svm.SVC(random_state=0) >>> scorer = make_scorer(fbeta_score, beta=2) >>> cross_val_score(clf, X, y, cv=5, scoring=scorer) array([...])

It might be slightly redundant with the section below but this is just a short example usage.

But indeed @ogrisel had maybe another opinion, so let's see what I think?

Agree that this would be a more logical way to structure it, and yes, also agree on the redundancy since the next section talks about make_scorer in more detail and gives that exact same example.

Alright, let's move back the second table at the beginning of the "3.3.1.2. Defining your scoring strategy from metric functions" section. We can start the section with a short sentence such as the one you wrote:

The following metrics functions are not implemented as named scorers. They cannot be passed to the scoring parameters; instead their callable needs to be passed to make_scorer together with the value of the user-settable parameters

(without link to the section since we are already in it).

Then you can simplify the paragraph that starts with the "Many metrics are not given names to ..." to remove redundancies but keep introducing the usage example with the snippet that shows how to combine make_scorer with user settable parameters for a grid search.

Then, after the example snippet, finally insert the paragraph with the two bullet points:

This is not as important as the rest so I would move it to the end of the section but it's still useful information to speak about the naming conventions of the metric functions and how to set the higher_is_better keyword parameter in a consistent way.

I like this proposal.

ogrisel

LGTM! Thanks very much for the PR.

ArturoAmorQ · 2024-04-09T13:09:54Z

doc/modules/model_evaluation.rst

+:func:`metrics.mean_tweedie_deviance`  ``power``  ``make_scorer(mean_tweedie_deviance, power=1.5)``
+:func:`metrics.mean_pinball_loss`      ``alpha``  ``make_scorer(mean_pinball_loss, alpha=0.95)``
+:func:`metrics.d2_tweedie_score`       ``power``  ``make_scorer(d2_tweedie_score, power=1.5)``
+:func:`metrics.d2_pinball_score`       ``alpha``  ``make_scorer(d2_pinball_score, alpha=0.95)``


I think we are missing d2_absolute_error_score here. But this PR made me wonder if we should actually accept it as a named scorer. Thoughts on this @glemaitre, @ogrisel ?

Of course, that would be done in another PR. As this one already looks in good shape.

Yep this would be for another PR. I'm not sure that there is a meaningful default for all use-case and it might be better to make sure that a user choose the parameter each time.

@ogrisel will have better insights on these metrics indeed.

There was a discussion in: #28750 (comment)

I think it was removed because you do not have to provide a parameter to make_scorer. I also first thought that d2_absolute_error_score got forgotten. Maybe that is a sign we should add a sentence (in a future PR?) to the table to say "there are more metrics, this is just a subset"?

The thing is that d2_absolute_error_score is already equivalent to make_scorer(d2_pinball_score, alpha=0.5). Accepting it as a named scorer (a string to be passed to scoring) is redundant but could be a common enough practice to be worth the shortcut. Indeed, its presence in model_evaluation.rst L105 makes me think it was meant to be the case, but the OP from #22118 forgot to add it to the _SCORERS dict.

Let's do this in a dedicated follow-up PR (and document it as part of the first table for named scorers that do not require setting parameters).

ArturoAmorQ

Just a small change to reflect the decision in https://github.com/scikit-learn/scikit-learn/pull/28750/files#r1559656259. Otherwise LGTM!

doc/modules/model_evaluation.rst

Co-authored-by: Arturo Amor <86408019+ArturoAmorQ@users.noreply.github.com>

betatim · 2024-04-11T07:18:37Z

I committed the change and enabled auto-merge

marijavlajic added 2 commits April 2, 2024 15:14

DOC Separate predefined scorer names from the ones requiring make_scorer

c025b34

DOC Remove non-implemented scorer names

981b7c5

github-actions bot added the Documentation label Apr 2, 2024

ogrisel reviewed Apr 2, 2024

View reviewed changes

marijavlajic and others added 2 commits April 2, 2024 16:32

DOC Removed metric functions without user-settable parameters

bececea

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

DOC Moved the non-implemented scorers table

08dbee8

glemaitre reviewed Apr 2, 2024

View reviewed changes

doc/modules/model_evaluation.rst Outdated Show resolved Hide resolved

ogrisel approved these changes Apr 2, 2024

View reviewed changes

doc/modules/model_evaluation.rst Outdated Show resolved Hide resolved

doc/modules/model_evaluation.rst Outdated Show resolved Hide resolved

glemaitre reviewed Apr 2, 2024

View reviewed changes

doc/modules/model_evaluation.rst Outdated Show resolved Hide resolved

Address code review suggestions

45f6e38

glemaitre reviewed Apr 2, 2024

View reviewed changes

doc/modules/model_evaluation.rst Outdated Show resolved Hide resolved

glemaitre self-requested a review April 2, 2024 16:47

ogrisel reviewed Apr 2, 2024

View reviewed changes

doc/modules/model_evaluation.rst Outdated Show resolved Hide resolved

doc/modules/model_evaluation.rst Outdated Show resolved Hide resolved

glemaitre reviewed Apr 2, 2024

View reviewed changes

doc/modules/model_evaluation.rst Outdated Show resolved Hide resolved

marijavlajic and others added 2 commits April 2, 2024 19:08

Fix table formatting

9801391

Co-authored-by: Guillaume Lemaitre <guillaume@probabl.ai>

Change power parameter value in examples

91e0935

glemaitre reviewed Apr 2, 2024

View reviewed changes

glemaitre mentioned this pull request Apr 2, 2024

DOC Add examples of make_scorer usage to fbeta_score docstring #28755

Open

Restructure paragraphs and tables

cca45a9

ogrisel approved these changes Apr 5, 2024

View reviewed changes

ArturoAmorQ reviewed Apr 9, 2024

View reviewed changes

ArturoAmorQ approved these changes Apr 11, 2024

View reviewed changes

doc/modules/model_evaluation.rst Show resolved Hide resolved

Update doc/modules/model_evaluation.rst

c9bce0c

Co-authored-by: Arturo Amor <86408019+ArturoAmorQ@users.noreply.github.com>

betatim enabled auto-merge (squash) April 11, 2024 07:18

betatim merged commit 819b286 into scikit-learn:main Apr 11, 2024
28 checks passed

ArturoAmorQ mentioned this pull request Apr 15, 2024

FIX Accept d2_absolute_error_score as named scorer #28836

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC Separate predefined scorer names from the ones requiring make_scorer #28750

DOC Separate predefined scorer names from the ones requiring make_scorer #28750

marijavlajic commented Apr 2, 2024

github-actions bot commented Apr 2, 2024 •

edited

ogrisel left a comment

glemaitre commented Apr 2, 2024

ogrisel left a comment •

edited

ogrisel commented Apr 2, 2024

marijavlajic commented Apr 2, 2024 •

edited

glemaitre Apr 2, 2024

marijavlajic Apr 3, 2024

glemaitre Apr 3, 2024

marijavlajic Apr 3, 2024

ogrisel Apr 3, 2024

glemaitre Apr 3, 2024

ogrisel left a comment

ArturoAmorQ Apr 9, 2024 •

edited

glemaitre Apr 9, 2024

betatim Apr 10, 2024

ArturoAmorQ Apr 10, 2024 •

edited

ogrisel Apr 10, 2024

ArturoAmorQ left a comment

betatim commented Apr 11, 2024

DOC Separate predefined scorer names from the ones requiring make_scorer #28750

DOC Separate predefined scorer names from the ones requiring make_scorer #28750

Conversation

marijavlajic commented Apr 2, 2024

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

github-actions bot commented Apr 2, 2024 • edited

✔️ Linting Passed

ogrisel left a comment

Choose a reason for hiding this comment

glemaitre commented Apr 2, 2024

ogrisel left a comment • edited

Choose a reason for hiding this comment

ogrisel commented Apr 2, 2024

marijavlajic commented Apr 2, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ogrisel left a comment

Choose a reason for hiding this comment

ArturoAmorQ Apr 9, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ArturoAmorQ Apr 10, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ArturoAmorQ left a comment

Choose a reason for hiding this comment

betatim commented Apr 11, 2024

github-actions bot commented Apr 2, 2024 •

edited

ogrisel left a comment •

edited

marijavlajic commented Apr 2, 2024 •

edited

ArturoAmorQ Apr 9, 2024 •

edited

ArturoAmorQ Apr 10, 2024 •

edited