Stricter test for glms with newton-cholesky solver on collinear data #8

ogrisel · 2022-06-30T09:29:39Z

This PR aims at better testing the lbfgs_step fallback mechanism of scikit-learn#23314 (introduced in c9b1200 and subsequent commits).

This test is currently failing and highlights the fact that newton-cholesky with an inner solver that switches to 4 steps of LBFGS whenever the Hessian is found the ill-conditioned does not work as expected: it can be very slow because this problem can happen many times in a fit call and furthermore repeatedly calling 'L-BFGS-B' with "maxiter": 4 via scipy.optimize.minimize is not equivalent to calling it once with a large maxiter because each time we loose the memory of the previous gradients.

I would therefore recommend to stop attempting to implement a fined grained fallback mechanism in the for the inner_solve step of the Newton solvers. Instead I would suggest to let the Newton solver raise an exception when this happens and use a coarse grained fallback to the "lbfgs" solver until convergence (possibly warm starting from the previous coef in case the newton solver had a chance to update them successfully for a few iterations). This coarse fallback should be much simpler to implement and maintain and should be guaranteed to converge to solution as good as the ones if the user had chosen LBFGS originally.

The scikit-learn-level warning message should be adapted accordingly.

What do you think about this plan @lorentzenchr?

Credits: this plan was originally suggested by @GaelVaroquaux IRL.

sklearn/linear_model/_glm/tests/test_glm.py

ogrisel · 2022-06-30T09:37:14Z

I you agree with this plan, let me know and I can help you implement it to get the test in this PR to pass. But feel free to do it yourself if you prefer.

sklearn/linear_model/_glm/tests/test_glm.py

lorentzenchr · 2022-06-30T11:49:51Z

I would therefore recommend to stop attempting to implement a fined grained fallback mechanism in the for the inner_solve step of the Newton solvers. [...] What do you think about this plan?

I'm very happy with it. I thought a lot about the best action (in a user's perspective) of a solver in case of "dynamically detected" convergence problems:

Stop and raise error
Warn and try different solver, e.g. lbfgs (your suggestion)
Warn and temporarily use a different step in this iteration
a. gradient step
b. lbfgs step(s)
c. step with modified hessian (e.g. adding a multiple of the identity=penalty, LDL decomposition, QR, SVD, ...)
...

lorentzenchr · 2022-06-30T12:00:26Z

I you agree with this plan, let me know and I can help you implement it

I would appreciate your help very much. I added you as collaborator to this fork to make it easier to work together.

ogrisel · 2022-07-01T12:47:23Z

I will try to move this forward this afternoon. My plan to:

first make the existing test pass on FEA add Cholesky based Newton solver to GLMs scikit-learn/scikit-learn#23314 after the merge with main;
merge this PR with an xfail and a TODO;
implement the estimator-level fallback to LBFGS directly in FEA add Cholesky based Newton solver to GLMs scikit-learn/scikit-learn#23314.

EDIT: @lorentzenchr was too fast. Let me update this PR.

ogrisel · 2022-07-01T13:18:11Z

I merged and now the updated collinear data test passes!

Improve tests related to convergence warning on collinear data

be2fe6d

ogrisel commented Jun 30, 2022

View reviewed changes

sklearn/linear_model/_glm/tests/test_glm.py Show resolved Hide resolved

ogrisel commented Jun 30, 2022

View reviewed changes

sklearn/linear_model/_glm/tests/test_glm.py Outdated Show resolved Hide resolved

overfit -> fit

0906f94

ogrisel commented Jun 30, 2022

View reviewed changes

sklearn/linear_model/_glm/tests/test_glm.py Outdated Show resolved Hide resolved

Typo in comment

0aa83ac

ogrisel mentioned this pull request Jun 30, 2022

FEA add Cholesky based Newton solver to GLMs scikit-learn/scikit-learn#23314

Closed

Merge branch 'glm_newton_cholesky' into test-glm-collinear-data

4992398

ogrisel added 2 commits July 1, 2022 16:00

Improve test_linalg_warning_with_newton_solver

15192f1

Better comments

621ffd8

ogrisel merged commit 83944aa into lorentzenchr:glm_newton_cholesky Jul 1, 2022

ogrisel deleted the test-glm-collinear-data branch July 1, 2022 14:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Stricter test for glms with newton-cholesky solver on collinear data #8

Stricter test for glms with newton-cholesky solver on collinear data #8

Uh oh!

ogrisel commented Jun 30, 2022

Uh oh!

Uh oh!

ogrisel commented Jun 30, 2022

Uh oh!

Uh oh!

Uh oh!

lorentzenchr commented Jun 30, 2022

Uh oh!

lorentzenchr commented Jun 30, 2022

Uh oh!

ogrisel commented Jul 1, 2022 •

edited

Loading

Uh oh!

ogrisel commented Jul 1, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Stricter test for glms with newton-cholesky solver on collinear data #8

Stricter test for glms with newton-cholesky solver on collinear data #8

Uh oh!

Conversation

ogrisel commented Jun 30, 2022

Uh oh!

Uh oh!

ogrisel commented Jun 30, 2022

Uh oh!

Uh oh!

Uh oh!

lorentzenchr commented Jun 30, 2022

Uh oh!

lorentzenchr commented Jun 30, 2022

Uh oh!

ogrisel commented Jul 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ogrisel commented Jul 1, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ogrisel commented Jul 1, 2022 •

edited

Loading