TST add test to check that all ridge solver give the same results #13914

jeromedockes · 2019-05-21T08:10:13Z

There lacks a test checking that all the ridge solvers (including the gcv) give similar results for a few different datasets (varying shapes, float32/64, and sparsity)

this test:

scikit-learn/sklearn/linear_model/tests/test_ridge.py

Line 290 in 0a2dc72

def test_ridge_individual_penalties():

compares solvers but for only one dense dataset with null coefficients and no intercept, and is meant to test individual penalties on outputs, so RidgeCV cannot be included since it does not support individual penalties.

after IRL with @ogrisel

…lvers_consistency

agramfort · 2019-05-25T14:19:43Z

@jeromedockes do you time to also look at #13246

there is also there a problem with Ridge that is still different between dense and sparse

jeromedockes · 2019-05-28T15:49:50Z

@jeromedockes do you time to also look at #13246

there is also there a problem with Ridge that is still different between dense and sparse

yes, as far as I can tell the Ridge class (or other estimators that rely on it such as RANSAC with a Ridge as base estimator) is the only remaining problem for #13246 ?

It seems that at the moment, only the sparse_cg solver in Ridge can correctly fit an intercept when the design matrix is sparse, but it doesn't get selected when the solver is auto. SAG gets selected, and the documentation says it can fit an intercept with sparse X, but in most cases when X is sparse (hence not centered in preprocessing) sag requires many more iterations than the defaut max to find the right intercept, and the tolerance needs to be lowered as well.

the test that currently should check that Ridge fits an intercept uses a dataset where X has zero mean, which is why this problem is not caught by the tests on master.

I am planning to work on this and open a PR soon.

NicolasHug

Tests run pretty fast (about 1sec) so LGTM

sklearn/linear_model/tests/test_ridge.py

…lvers_consistency

Ridge docstring says it is more stable than cholesky

agramfort · 2019-06-10T06:56:25Z

sklearn/linear_model/tests/test_ridge.py

+    'n_samples,dtype,proportion_nonzero',
+    [(20, 'float32', .1), (40, 'float32', 1.), (20, 'float64', .2)])
+@pytest.mark.parametrize('sparse_X', [True, False])
+@pytest.mark.parametrize('seed', np.arange(300))


300 seeds seems a bit too much.

my apologies about that, I had wanted to check with many seeds locally and
forgot to reduce the number of seeds afterwards. I set it to 3 now

agramfort

besides maybe a too ambitious/slow test LGTM. +1 for MRG

agramfort · 2019-06-10T11:21:57Z

Can I merge this one?

sklearn/linear_model/tests/test_ridge.py

rth · 2019-06-10T11:59:08Z

sklearn/linear_model/tests/test_ridge.py

+    svd_ridge = Ridge(
+        solver='svd', normalize=True, alpha=alpha).fit(X, y)
+    X = X.astype(dtype)
+    y = y.astype(dtype)


Add copy=False

rth

Thanks @jeromedockes !

jeromedockes · 2019-06-11T11:31:21Z

Thanks @jeromedockes !

thanks for your help!

…ikit-learn#13914)

jeromedockes added 2 commits May 21, 2019 09:47

add test to check that all ridge solver give the same results

d33d410

Merge remote-tracking branch 'upstream/master' into add_test_ridge_so…

73ad6e1

…lvers_consistency

NicolasHug approved these changes May 28, 2019

View reviewed changes

sklearn/linear_model/tests/test_ridge.py Outdated Show resolved Hide resolved

jeromedockes added 3 commits May 28, 2019 21:45

p -> proportion_nonzero

294db99

Merge remote-tracking branch 'upstream/master' into add_test_ridge_so…

64bf535

…lvers_consistency

use svd solver as reference

3f308ba

Ridge docstring says it is more stable than cholesky

agramfort reviewed Jun 10, 2019

View reviewed changes

agramfort approved these changes Jun 10, 2019

View reviewed changes

agramfort changed the title ~~add test to check that all ridge solver give the same results~~ [MRG+1] add test to check that all ridge solver give the same results Jun 10, 2019

reduce number of seeds in ridge test

af88bcc

rth reviewed Jun 10, 2019

View reviewed changes

sklearn/linear_model/tests/test_ridge.py Outdated Show resolved Hide resolved

rth reviewed Jun 10, 2019

View reviewed changes

rth mentioned this pull request Jun 11, 2019

Programatically finding all supported solvers and losses for an estimator #14063

Open

jeromedockes added 2 commits June 11, 2019 11:39

avoid unnecessary copies in test

0f08cbc

avoid skipping tests

762fe5d

rth approved these changes Jun 11, 2019

View reviewed changes

rth changed the title ~~[MRG+1] add test to check that all ridge solver give the same results~~ TST add test to check that all ridge solver give the same results Jun 11, 2019

rth merged commit 61f6f5b into scikit-learn:master Jun 11, 2019

jeromedockes deleted the add_test_ridge_solvers_consistency branch June 11, 2019 11:30

koenvandevelde pushed a commit to koenvandevelde/scikit-learn that referenced this pull request Jul 12, 2019

TST add test to check that all ridge solver give the same results (sc…

5c8fcd7

…ikit-learn#13914)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TST add test to check that all ridge solver give the same results #13914

TST add test to check that all ridge solver give the same results #13914

jeromedockes commented May 21, 2019

agramfort commented May 25, 2019

jeromedockes commented May 28, 2019

NicolasHug left a comment

agramfort Jun 10, 2019

jeromedockes Jun 10, 2019

agramfort left a comment

agramfort commented Jun 10, 2019

rth Jun 10, 2019

rth left a comment

jeromedockes commented Jun 11, 2019

TST add test to check that all ridge solver give the same results #13914

TST add test to check that all ridge solver give the same results #13914

Conversation

jeromedockes commented May 21, 2019

agramfort commented May 25, 2019

jeromedockes commented May 28, 2019

NicolasHug left a comment

Choose a reason for hiding this comment

agramfort Jun 10, 2019

Choose a reason for hiding this comment

jeromedockes Jun 10, 2019

Choose a reason for hiding this comment

agramfort left a comment

Choose a reason for hiding this comment

agramfort commented Jun 10, 2019

rth Jun 10, 2019

Choose a reason for hiding this comment

rth left a comment

Choose a reason for hiding this comment

jeromedockes commented Jun 11, 2019