DOC adapt logistic regression objective in user guide #28706

lorentzenchr · 2024-03-27T09:48:59Z

Reference Issues/PRs

This popped up in #28700.

What does this implement/fix? Explain your changes.

All solvers, except liblinear, use the formulation where C directly enters the penalty.

Any other comments?

lorentzenchr · 2024-03-27T09:49:25Z

@ogrisel ping.

github-actions · 2024-03-27T09:50:20Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: ca6edf8. Link to the linter CI: here}

doc/modules/linear_model.rst

ogrisel · 2024-03-27T14:05:31Z

doc/modules/linear_model.rst

-    \min_{w} C \sum_{i=1}^n s_i \left(-y_i \log(\hat{p}(X_i)) - (1 - y_i) \log(1 - \hat{p}(X_i))\right) + r(w),
+    \min_{w} \frac{1}{s}\sum_{i=1}^n s_i
+    \left(-y_i \log(\hat{p}(X_i)) - (1 - y_i) \log(1 - \hat{p}(X_i))\right)
+    + \frac{r(w)}{sC}\,,


Readers might find it weird to see an objective function that has a fixed constant positive multiplier on all the terms. But I agree it makes it explicit that the user-provided sample weights do not need to sum to 1.

Also maybe we could add a note to state that scikit-learn uses a weird inverse parametrization C for the strength of the regularizer. This has historical roots in the fact that scikit-learn reused the parametrization of Liblinear which is a library that provided an implementation of penalized logistic regression that reused the mathemtical parametrization from the support vector machine literature and maybe cross-link to https://scikit-learn.org/stable/modules/svm.html#svc.

I'm considering, for some time already, to deprecate C and add the same parameters as for elastic net. Not sure what other core devs think about that.

I opened #28711.

Readers might find it weird to see an objective function that has a fixed constant positive multiplier on all the terms.

I do

But I agree it makes it explicit that the user-provided sample weights do not need to sum to 1.

This is never an assumption in any scikit-learn estimator. What makes you think user might think otherwise here ?

The loss part is just an empirical mean, or weighted average, maybe writing np.average(loss, weights=weights) or a comment makes reading easier. Or throwing away the weights in the formula?
@jeremiedbb What is your suggestion?

The PR multiplies the objective function by a constant term, which can be seen as, quoting Olivier, weird. So I'm trying to understand the arguments for doing it anyway. I don't find the argument "it makes it explicit that the user-provided sample weights do not need to sum to 1" very convincing because I've never seen this being a source of confusion. This is why I was seeking for more information 😄.

Got a chat irl with Olivier and although I'm still not convinced by the argument of explicitness, I'm convinced by other arguments, like the aim to standardize the objective functions throughout the doc and make it a per sample averaged objective function to be comparable between datasets of different sizes (linked to #28169).

ogrisel

Even if the new expression seem artificially complex, I agree this makes that lack of any assumption about weight normalization more explicit.

Furthermore dividing the objective function by C makes the LogReg objective function more easily aligned with the other objective functions in the same page.

jeremiedbb

LGTM

doc/modules/linear_model.rst

lorentzenchr added 2 commits March 27, 2024 10:31

DOC adapte logistic regression objective in user guide

7482fd5

DOC update logistic regression objective in user guide

b2b1d93

github-actions bot added the Documentation label Mar 27, 2024

lorentzenchr added the module:linear_model label Mar 27, 2024

ogrisel reviewed Mar 27, 2024

View reviewed changes

ogrisel changed the title ~~DOC adapte logistic regression objective in user guide~~ DOC adapt logistic regression objective in user guide Mar 27, 2024

DOC address review comments

a0d55b6

lorentzenchr mentioned this pull request Mar 27, 2024

RFC New parameters for penalties in LogisticRegression #28711

Open

ogrisel approved these changes Mar 28, 2024

View reviewed changes

jeremiedbb approved these changes Apr 5, 2024

View reviewed changes

doc/modules/linear_model.rst Outdated Show resolved Hide resolved

Update doc/modules/linear_model.rst

ca6edf8

jeremiedbb enabled auto-merge (squash) April 5, 2024 13:21

jeremiedbb merged commit 4ebd7ff into scikit-learn:main Apr 5, 2024
28 checks passed

lorentzenchr deleted the doc_logreg_objective branch April 5, 2024 14:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC adapt logistic regression objective in user guide #28706

DOC adapt logistic regression objective in user guide #28706

lorentzenchr commented Mar 27, 2024

lorentzenchr commented Mar 27, 2024

github-actions bot commented Mar 27, 2024 •

edited

ogrisel Mar 27, 2024 •

edited

lorentzenchr Mar 27, 2024

lorentzenchr Mar 27, 2024

jeremiedbb Mar 29, 2024

lorentzenchr Mar 29, 2024

jeremiedbb Mar 29, 2024

jeremiedbb Apr 5, 2024

ogrisel left a comment

jeremiedbb left a comment

DOC adapt logistic regression objective in user guide #28706

DOC adapt logistic regression objective in user guide #28706

Conversation

lorentzenchr commented Mar 27, 2024

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

lorentzenchr commented Mar 27, 2024

github-actions bot commented Mar 27, 2024 • edited

✔️ Linting Passed

ogrisel Mar 27, 2024 • edited

Choose a reason for hiding this comment

lorentzenchr Mar 27, 2024

Choose a reason for hiding this comment

lorentzenchr Mar 27, 2024

Choose a reason for hiding this comment

jeremiedbb Mar 29, 2024

Choose a reason for hiding this comment

lorentzenchr Mar 29, 2024

Choose a reason for hiding this comment

jeremiedbb Mar 29, 2024

Choose a reason for hiding this comment

jeremiedbb Apr 5, 2024

Choose a reason for hiding this comment

ogrisel left a comment

Choose a reason for hiding this comment

jeremiedbb left a comment

Choose a reason for hiding this comment

github-actions bot commented Mar 27, 2024 •

edited

ogrisel Mar 27, 2024 •

edited