FIX Fixes test_scale_and_stability #18746

thomasjpfan · 2020-11-03T04:14:39Z

Reference Issues/PRs

Fixes #18613
Fixes #6279
Simliar to #13903

What does this implement/fix? Explain your changes.

Mapping small weights to zero helps with stabilization.

Any other comments?

This can error can be reproduced on windows with the latest numpy + scipy installed from pypi and python 3.8.

sklearn/cross_decomposition/_pls.py

ogrisel

+1 (I trust you for the windows stability manual check :)

ogrisel

Re-tracting my previous +1 review because now I realize that I has mis-understood the code in my first review. I will need more time to understand what's going on and what kind of fix would make sense.

I think it would help if the failing test could be split/decomposed or parametrized to show for which kind of data the problem is happening.

thomasjpfan · 2020-11-04T15:38:49Z

As background, this issue appears for CCA and X2, Y2 in the loop:

scikit-learn/sklearn/cross_decomposition/tests/test_pls.py

Line 419 in 51acc9d

for (X, Y) in [(X1, Y1), (X2, Y2), (X3, Y3)]:

The issue stems from the following dot product having a different result depending on the platform+ numpy version

scikit-learn/sklearn/cross_decomposition/_pls.py

Line 61 in 51acc9d

y_weights = np.dot(Y_pinv, x_score)

On windows one of the iterations get [1.56, 7.10e-15], while on linux it gets [1.56, 1.77e-15] this error propagates to the next iteration.

Edit There are small differences in the output of pinv2 between platforms as well.

NicolasHug · 2020-11-04T17:34:25Z

Is this just a matter of setting max_iter or tol then?

(CCA is notoriously unstable especially for poorly conditions matrices. When Y2 is scaled into Ys it looks like a rank-1 matrix to me)

thomasjpfan

Updated this PR with by adjusting the cond of pinv2.

sklearn/cross_decomposition/_pls.py

NicolasHug

LGTM then, thanks @thomasjpfan

This might need a whats new entry since this may induce a change in results in some cases.

doc/whats_new/v0.24.rst

ogrisel

The solution looks fine with me but I would be more confident if you could update the test_scale_and_stability to split it into 3 tests: test_scale_and_stability_linerud, test_scale_and_stability_non_regression_2821 and test_scale_and_stability_non_regression_7819 so that we know instantly which dataset is causing the instability shall we observe a new failure in the future.

Also we could use sklearn.processing.scale instead of manually scaling the input datasets.

I don't have a Windows machine handy so I cannot quickly check if the changes I suggest would still cause the failure to happen in master (scaling with a different ddof in particular).

ogrisel · 2020-11-05T16:54:59Z

Or alternatively, if you understand the cause of the problem well enough, could you implement a new test that would trigger a similar problem on any platform (not just windows + whatever BLAS is used on the failing CI) by making the instability even stronger?

NicolasHug · 2020-11-05T18:54:09Z

doc/whats_new/v0.24.rst

@@ -127,6 +127,9 @@ Changelog
  predictions for `est.transform(Y)` when the training data is single-target.
  :pr:`17095` by `Nicolas Hug`_.

+- |Fix| Increases the stability of :class:`cross_decomposition.CCA` :pr:`18746`


specify that it's only for poorly conditioned matrices? So that not all users expect to have changes in the results

The issue comes from:

https://github.com/scipy/scipy/blob/8e30f7797bd1ee442f4f1a25172e4402521c1e16/scipy/linalg/basic.py#L1385

when pinv2 is called. On windows and the pypi version of numpy the singular values for Xk are:

[0.7, 0.14, 9e-16]

which results in a rank of 3.

On other platforms, where the original test passes, the singular values are

[0.7, 0.14, 1.4e-17]

which results in a rank of 2. This is by designed as stated reference paper on page 12, where the rank should decrease when Xk gets updated.

Xk is updated here:

scikit-learn/sklearn/cross_decomposition/_pls.py

Line 263 in b5d63e3

Xk -= np.outer(x_scores, x_loadings)

Given, this I do not think its for poorly conditioned matrices. It is our usage of pinv2 that was flaky.

when pinv2 is called. On windows and the pypi version of numpy the singular values for Xk are:

After which iteration? If this happens at the first iter, then this is indeed a problem of condition number I believe

(by first iteration I mean the first component, or equivalently the first call to _get_first_singular_vectors_power_method)

After which iteration? If this happens at the first iter, then this is indeed a problem of condition number I believe

This happened on the second call to _get_first_singular_vectors_power_method.

I think it would still be valuable to identify in which cases there's a stability improvement. Clearly, not all users will be impacted by this

I think this happens randomly. I updated test_scale_and_stability to generate random ys and found that there were seeds that fail on my machine and on windows.

From a user point of view, for some datasets they may have gotten an incorrect model.

Both random y and X? Because otherwise the problem might just be coming from X.

I still believe this is related to poorly conditioned matrices because what you pass to _get_first_singular_vectors_power_method at iteration i are the deflated matrices from iteration i - 1 (deflation = subtracting with a rank-1 matrix to obtain a (r - 1) rank matrix)

I'm quite certain that the name of the parameter to pinv (cond or rcond) is related to the condition number

Both random y and X? Because otherwise the problem might just be coming from X.

Updated with randomly generated X and y. The condition number will alway gets much bigger when the matrix becomes a r - 1 rank matrix.

When the rank becomes r - 1, pinv2 by default is having trouble determining the rank of the matrix:

https://github.com/scipy/scipy/blob/8e30f7797bd1ee442f4f1a25172e4402521c1e16/scipy/linalg/basic.py#L1457-L1466

Setting cond= 10 * eps helps in this case.

Specificly when pinv2 thinks a r-1 matrix is rank r, it divides by a really small singular value when computing the inverse.

sklearn/cross_decomposition/tests/test_pls.py

…ability_round_2

cmarmo · 2020-11-12T11:10:39Z

This is ready to be merged, right? @glemaitre? Thanks!

ogrisel

The explanation @thomasjpfan provided above looks good to me and empirically backed by the fact that the updated test can fail with different random seeds even on linux 64 bit.

I am wondering: shall we try to remove the _UnstableArchMixin from the CCA mro?

ogrisel · 2020-11-13T16:23:28Z

doc/whats_new/v0.24.rst

@@ -128,6 +128,9 @@ Changelog
  predictions for `est.transform(Y)` when the training data is single-target.
  :pr:`17095` by `Nicolas Hug`_.

+- |Fix| Increases the stability of :class:`cross_decomposition.CCA` :pr:`18746`


~~I think this is not just for CCA but also for related PLSCanonical model, isn't it?~~ PLSSVD uses a different fit method.

PLSCanonical uses mode="A" which does not rely on those pinv2 calls. So the what's new entry is correct.

ogrisel · 2020-11-13T16:27:53Z

Let me try to push this and trigger a full build of all the wheels on all the architectures (32 bit and 64 bit windows and linux).

…st_scale_and_stability_round_2

ogrisel · 2020-11-13T17:12:42Z

Both the 32 bit Linux and Windows builds all passed without the _UnstableArchMixin ~~which is apparently useless now. Let me delete it.~~ _UnstableArchMixin is used by LocallyLinearEmbedding.

I will also launch the arm64 test just to be sure. Unfortunately we don't have any PPC CI configuration but I am pretty sure that this fix would also solve the numerical stability problem we observed for those as well.

ogrisel · 2020-11-13T18:13:14Z

It also passed in arm64. Waiting for the final Azure builds to complete and then I plan to merge, unless @NicolasHug or @thomasjpfan have an objection.

ogrisel · 2020-11-13T18:29:52Z

Merged! Thanks @thomasjpfan for tracking down the cause of this issue. I am really great to have a numerically stable CCA in scikit-learn after all those years.

FIX Fixes test_scale_and_stability

812f41d

github-actions bot added the module:cross_decomposition label Nov 3, 2020

NicolasHug reviewed Nov 3, 2020

View reviewed changes

sklearn/cross_decomposition/_pls.py Outdated Show resolved Hide resolved

ogrisel approved these changes Nov 3, 2020

View reviewed changes

thomasjpfan added 2 commits November 3, 2020 17:37

ENH Uses Nics suggestion

6637c3b

STY Linting

665a6c8

ogrisel reviewed Nov 4, 2020

View reviewed changes

ogrisel self-requested a review November 4, 2020 13:24

REV Revert change

0f2aef8

thomasjpfan added 2 commits November 4, 2020 10:39

REV Adjust weights before

679151c

BUG Fix

5e5cdaf

ENH Sets condition on pinv2

538798a

thomasjpfan commented Nov 4, 2020

View reviewed changes

sklearn/cross_decomposition/_pls.py Show resolved Hide resolved

NicolasHug approved these changes Nov 5, 2020

View reviewed changes

DOC Adds whats new

235565e

NicolasHug reviewed Nov 5, 2020

View reviewed changes

doc/whats_new/v0.24.rst Outdated Show resolved Hide resolved

ogrisel approved these changes Nov 5, 2020

View reviewed changes

thomasjpfan added 6 commits November 5, 2020 12:13

DOC Only rank deficient X

f96e865

DOC Rank-deficient in both

a6d4d99

REV Less diffs

43e5cba

TST Splits test_scale_and_stability into 3 tests

0fc771b

TST Only rank decifient y

59d6d85

DOC Update

76876f5

NicolasHug reviewed Nov 5, 2020

View reviewed changes

thomasjpfan added 2 commits November 6, 2020 11:34

DOC Update docstring for tests

6bdd21b

TST Adds test that fails on master

0aebfd4

NicolasHug reviewed Nov 6, 2020

View reviewed changes

sklearn/cross_decomposition/tests/test_pls.py Outdated Show resolved Hide resolved

thomasjpfan added 16 commits November 6, 2020 12:42

TST Adds back original dataset

ece83c5

REV Less diffs

9bfec22

TST Do not think old dataset is needed

e9485b8

ENH Removes unneeded code

2a4733e

TST Increase tolerance

f2318b0

Merge remote-tracking branch 'upstream/master' into test_scale_and_st…

ed30019

…ability_round_2

REV Less diffs

5dbaaf9

TST Use atol

dd56c1b

REV Less diffs

bf588bb

TST Update tests

c39e682

REV Places back all estimators

c8aa37c

ENH Adds original dataset

8287d8c

Merge remote-tracking branch 'upstream/master' into test_scale_and_st…

3dbdd9e

…ability_round_2

ENH Update

8a8860f

MNT Change eps

08cb652

REV Less diffs

8abba5a

jeremiedbb added this to the 0.24 milestone Nov 13, 2020

ogrisel reviewed Nov 13, 2020

View reviewed changes

ogrisel added 3 commits November 13, 2020 17:38

Try to remove _UnstableArchMixin from CCA

a02c2d1

Merge branch 'master' of github.com:scikit-learn/scikit-learn into te…

e76209f

…st_scale_and_stability_round_2

[cd build]

e79ba18

ogrisel added 2 commits November 13, 2020 18:47

Remove noqa flag

9d6296a

[arm64]

0d6d52d

ogrisel merged commit 84bd4e2 into scikit-learn:master Nov 13, 2020

thomasjpfan mentioned this pull request Mar 8, 2021

FIX Fixes regression in CCA #19646

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX Fixes test_scale_and_stability #18746

FIX Fixes test_scale_and_stability #18746

thomasjpfan commented Nov 3, 2020 •

edited

Loading

ogrisel left a comment •

edited

Loading

ogrisel left a comment

thomasjpfan commented Nov 4, 2020 •

edited

Loading

NicolasHug commented Nov 4, 2020

thomasjpfan left a comment

NicolasHug left a comment

ogrisel left a comment

ogrisel commented Nov 5, 2020

NicolasHug Nov 5, 2020

thomasjpfan Nov 5, 2020 •

edited

Loading

NicolasHug Nov 5, 2020

NicolasHug Nov 5, 2020

thomasjpfan Nov 5, 2020

NicolasHug Nov 6, 2020

thomasjpfan Nov 6, 2020

NicolasHug Nov 6, 2020

thomasjpfan Nov 6, 2020

thomasjpfan Nov 6, 2020 •

edited

Loading

cmarmo commented Nov 12, 2020

ogrisel left a comment •

edited

Loading

ogrisel Nov 13, 2020 •

edited

Loading

ogrisel commented Nov 13, 2020

ogrisel commented Nov 13, 2020 •

edited

Loading

ogrisel commented Nov 13, 2020

ogrisel commented Nov 13, 2020

FIX Fixes test_scale_and_stability #18746

FIX Fixes test_scale_and_stability #18746

Conversation

thomasjpfan commented Nov 3, 2020 • edited Loading

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

ogrisel left a comment • edited Loading

Choose a reason for hiding this comment

ogrisel left a comment

Choose a reason for hiding this comment

thomasjpfan commented Nov 4, 2020 • edited Loading

NicolasHug commented Nov 4, 2020

thomasjpfan left a comment

Choose a reason for hiding this comment

NicolasHug left a comment

Choose a reason for hiding this comment

ogrisel left a comment

Choose a reason for hiding this comment

ogrisel commented Nov 5, 2020

Choose a reason for hiding this comment

thomasjpfan Nov 5, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

thomasjpfan Nov 6, 2020 • edited Loading

Choose a reason for hiding this comment

cmarmo commented Nov 12, 2020

ogrisel left a comment • edited Loading

Choose a reason for hiding this comment

ogrisel Nov 13, 2020 • edited Loading

Choose a reason for hiding this comment

ogrisel commented Nov 13, 2020

ogrisel commented Nov 13, 2020 • edited Loading

ogrisel commented Nov 13, 2020

ogrisel commented Nov 13, 2020

thomasjpfan commented Nov 3, 2020 •

edited

Loading

ogrisel left a comment •

edited

Loading

thomasjpfan commented Nov 4, 2020 •

edited

Loading

thomasjpfan Nov 5, 2020 •

edited

Loading

thomasjpfan Nov 6, 2020 •

edited

Loading

ogrisel left a comment •

edited

Loading

ogrisel Nov 13, 2020 •

edited

Loading

ogrisel commented Nov 13, 2020 •

edited

Loading