DOC custom scoring usage GridSearchCV and RandomizedSearchCV #28694

siddu1324 · 2024-03-25T18:41:28Z

Reference Issues/PRs

References #28671

What does this implement/fix? Explain your changes.

This PR adds documentation examples for using custom scoring functions with GridSearchCV and RandomizedSearchCV, specifically illustrating how to use make_scorer for metrics requiring additional parameters, like d2_pinball_score. This enhancement addresses user requests for clearer guidance on applying custom scorers in model selection.

github-actions · 2024-03-25T18:44:00Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 039d1b1. Link to the linter CI: here}

…earn into doc_makescore divergent branch

jeremiedbb · 2024-03-26T17:00:56Z

Thanks for the PR @siddu1324. I'm not sure the notes is the right place for this added doc. I don't think it would have helped figuring out how to correctly use the scoring parameter.

I think improving the scoring parameter description would have a better impact. ping @ogrisel who originally answered in the linked issue, wdyt ?

ogrisel

In addition to @jeremiedbb's remark above about the location of the snippet in docstring, there are several problem:

the linear regression model does not accept an alpha parameter. Calling fit on a dataset generated by make_regression would raise:

ValueError: Invalid parameter 'alpha' for estimator LinearRegression(). Valid parameters are: ['copy_X', 'fit_intercept', 'n_jobs', 'positive'].

I would also rather not tune a hyperparameter that has the same name as the metric parameter to avoid introducing any confusion;
furthermore, it's weird to tune a linear regression model that estimates the expected value of the target variable conditionally on the features on a metric that assess it's ability to estimate a 0.95 quantile. I would instead a quantile estimator for this loss or alternatively use another parametrized metric such as fbeta_score on a simple classifier such as LogisticRegression.

ogrisel · 2024-04-09T09:55:12Z

sklearn/model_selection/_search.py

+    >>> from scipy.stats import expon
+    >>> param_dist = {'alpha': expon()}
+    >>> rnd_search = RandomizedSearchCV(LinearRegression(),
+    param_distributions=param_dist, scoring=custom_scorer)


Also this is the docstring of the GridSearchCV class but this code snippet shows how to use the RandomizedSearchCV instead.

A similar can be added in the inline examples section of each of those classes but should be adapted accordingly.

spq6r added 2 commits March 25, 2024 12:56

DOC: Illustrate custom scorer usage in RandomizedSearchCV docstring

a544339

DOC: made changes to comply flake8

f371288

github-actions bot added module:model_selection Documentation labels Mar 25, 2024

Merge branch 'main' into doc_makescore

b53da98

spq6r added 2 commits March 25, 2024 15:16

made changes to the failed test

543e0fd

Merge branch 'doc_makescore' of https://github.com/siddu1324/scikit-l…

039d1b1

…earn into doc_makescore divergent branch

ogrisel reviewed Apr 9, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC custom scoring usage GridSearchCV and RandomizedSearchCV #28694

DOC custom scoring usage GridSearchCV and RandomizedSearchCV #28694

siddu1324 commented Mar 25, 2024

github-actions bot commented Mar 25, 2024 •

edited

jeremiedbb commented Mar 26, 2024

ogrisel left a comment

ogrisel Apr 9, 2024

DOC custom scoring usage GridSearchCV and RandomizedSearchCV #28694

Are you sure you want to change the base?

DOC custom scoring usage GridSearchCV and RandomizedSearchCV #28694

Conversation

siddu1324 commented Mar 25, 2024

Reference Issues/PRs

What does this implement/fix? Explain your changes.

github-actions bot commented Mar 25, 2024 • edited

✔️ Linting Passed

jeremiedbb commented Mar 26, 2024

ogrisel left a comment

Choose a reason for hiding this comment

ogrisel Apr 9, 2024

Choose a reason for hiding this comment

github-actions bot commented Mar 25, 2024 •

edited