ENH: sparse.linalg: Add the feature of spectral norm of sparse matrices #14855

zhaog6 · 2021-10-14T09:00:40Z

Reference issue

What does this implement/fix?

In this PR, the spectral norm (2-norm and -2-norm) of the matrices are added, the singular values are computed by efficient LOBPCG indirectly.

Additional information

Use "np.integer" in "issubdtype" Use LOBPCG directly instead of "scipy.sparse.linalg.svds" Find the "lobpcg" module

lobpcg · 2021-10-15T14:56:30Z

This can be more easily done just by calling https://github.com/scipy/scipy/blob/master/scipy/sparse/linalg/eigen/_svds.py
that includes lobpcg as one of the solvers.

zhaog6 · 2021-10-18T01:21:34Z

Dear Andrew @lobpcg
I had written the feature by calling svds before (made rebase and squeezed the commits history, so you don't see which). But the reason I changed my mind to directly call lobpcg instead is that, when I submit the code and execute some test cases in the SciPy CI/CR, a few special test cases will not pass because of the issue of the algebraic accuracy (for example, I stiill vaguely remember the following case,

A = [[-4, -3, -2],
     [-1,  0,  1],
     [2,   3,  4]];

the error between the smallest singular value solved by svds and solved by numpy.linalg.norm is ~5.3e-8, and the error of the smallest sing. val, solved by svds and the exact smallest sing. val. is also ~5.3e-8). I checked, it has nothing to do with adjusting the parameters tol and maxiter. Finally I located the problem, because the error of the smallest eigenvalue of A'A is ~2.8e-15 (the exact smallest eigenvalues of A'A and sing. val of A is zero). In this case, if we take the square root of the smallest eigenvalue of A'A, the error of the sing. val. will be O(1e-8), which is not zero and is still a certain distance O(1e-8) from numpy's calculation results or the exact value. So I used lobpcg instead of svds.

lobpcg · 2021-10-18T01:36:09Z

svds should pretty much do exactly the same thing (I wrote the part that adds lobpcg to it) as you do in this PR, so I am unsure why there might be a difference in the results, especially for simply computing the single largest singular value.

The case you provide above is for the smallest singular value, so I am also unsure how it relates to the spectral norm.

zhaog6 · 2021-10-18T01:47:52Z

@lobpcg In addition to the above reasons, there is another reason. For scipy.sparse.linalg.norm, I just want to obtain the smallest and largest sing. value without introducing additional operations. So I didn't choose scipy.sparse.linalg.eigen.svds, although it's easy to call.

zhaog6 · 2021-10-18T01:55:14Z

svds should pretty much do exactly the same thing (I wrote the part that adds lobpcg to it) as you do in this PR, so I am unsure why there might be a difference in the results, especially for simply computing the single largest singular value.

The case you provide above is for the smallest singular value, so I am also unsure how it relates to the spectral norm.

Dear Andrew,
In fact, on my server and laptop, svds is no problem for the alegbraic accuracy (it is zero exactly), But it reports the accuracy problem (it became 5.3e-8) on the SciPy Github Azure platform.

lobpcg · 2021-10-18T01:55:57Z

Why the smallest is needed for the norm?

zhaog6 · 2021-10-18T01:59:04Z

@lobpcg The largest sing. value is no problem on any platform, but the smallest sing. val will have problems on some specific platforms.

zhaog6 · 2021-10-18T02:03:29Z

Why the smallest is needed for the norm?

This is because NumPy defines a - 2 norm, SciPy need to be aligned with NumPy on some operations. But from computational mathematics textbook, -2 norm is not defined.

lobpcg · 2021-10-18T02:18:34Z

I recollect that svds does something special and strange with the smallest singular values, so I believe that there may be a difference that may justify a separate code for -2, indeed.

zhaog6 · 2021-10-18T02:39:45Z

I recollect that svds does something special and strange with the smallest singular values, so I believe that there may be a difference that may justify a separate code for -2, indeed.

Dear Andrew @lobpcg
Okay. If the smallest eigenvalue of A'A is smaller 1e-14, Could it be set to zero? So it is still zero after taking square root.

lobpcg · 2021-10-18T02:43:01Z

Possible, look at the end of svds code.

zhaog6 · 2021-10-18T08:55:19Z

Possible, look at the end of svds code.

@lobpcg I modified the file _svds.py and implementation of 2/-2 norm of sparse matrices, Please let me know if it is not a popular implementation way.

…in 2/-2 norm of sparse matrices

lobpcg

You need to make sure that everything works in single precision as well. So you cannot use fixed constants like 1e-14
You also need unit tests in both precisions.

zhaog6 · 2021-10-19T00:22:55Z

You need to make sure that everything works in single precision as well. So you cannot use fixed constants like 1e-14 You also need unit tests in both precisions.

Okay, I try it.

… for matrices with different accuracy

lobpcg · 2021-10-19T12:16:29Z

For your absolute constant use a power of np.finfo(dtype).eps with dtype=np.float64 or np.float32 depending on the case

zhaog6 · 2021-10-19T12:24:29Z

I resubmitted the PR, used 50 * np.finfo(...).eps in the codes (7d792a7), Could I do it this way?

lobpcg · 2021-10-19T12:29:55Z

The expression of the threshold should also depend on the shape and probably some norm of the input, like in the original code.

zhaog6 · 2021-10-19T12:37:16Z

The expression of the threshold should also depend on the shape and probably some norm of the input, like in the original code.

Yes, the threshold 50 in the code also indeed depend on float32 or float64. For float64, the threshold is ~20 (there are some small fluctuations), whereas for float32, the threshold is probably 30-50 (affected by different platforms)

lobpcg · 2021-10-19T12:49:31Z

Not good, the expression should be the same for different dtypes. Try using a fixed power or log of eps instead of multiplying eps by different constants.

zhaog6 · 2021-10-19T12:54:18Z

Dear Andrew, setting it to 50 is only to cut it off within the fluctuation range of machine accuracy，but the fluctuation seems doesn't have much regularity and there is no formula guide to rely on.

zhaog6 · 2021-10-19T13:02:43Z

I don't have much experience in error fluctuation caused by different platforms and different accuracy (float32 & float64). For the fixed power or log of eps, I have no idea yet currently. What value do you think is more reasonable？Thanks.

lobpcg · 2021-10-19T15:32:35Z

My suggestion is to try obtaining both values ~20*np.finfo(np.float64).eps and ~50*np.finfo(np.float32).eps by a single formula f(x) with x=np.finfo(dtype).eps with dtype=np.float64 or np.float32. Try to find experimentally such a fixed function f(x) to handle both cases, if possible.

zhaog6 · 2021-10-20T00:24:41Z

Okay, I'll try to find it, thanks.

zhaog6 · 2021-10-20T00:55:39Z

Because I can't reproduce this problem on my laptop [macOS Big Sur-64bit] and server [Ubuntu-20.04-64bit] (the problem only occurs in Win32 environment in Azure Pipelined of SciPy CI/CR), I've turn it into a draft and continue to test it by submitting the draft. If there are any email reminders/bother, please ignore them until I'm ready to please you review again. Thanks.

zhaog6 · 2021-10-21T11:07:29Z

Dear Andrew @lobpcg I make some tests varous platform in Docker, there is no rule for the fluctuation range of the result. However, I find a bug from NumPy when I check the call process in NumPy and SciPy (Notice: the scipy/sparse/linalg/tests/test_norm.py file compare the "SciPy norm" and "Numpy norm"). The bug occurs in the svd function in numpy/linalg/linalg.py
https://github.com/numpy/numpy/blob/ba3664766d1f8c6b2521b82edd717a2f2be974f2/numpy/linalg/linalg.py#L1659-L1661

For a np.float32 matrix, s in line 1660 is np.float64, and s in line 1661 is converted to np.float32, in other words, _umath_linalg_svd_n (gufunc) computed singular values with np.float64, then the type conversion (np.float64->np.float32) is performed. It is different from svds in SciPy (svds->lobpcg->eigh->drv). So SciPy truely implemented it with np.float32 when given matrix is np.float32, but NumPy doesn't seem to be. The above are only the clues I found when debugging, for reference only.

lobpcg · 2021-10-21T11:36:40Z

Please submit a bug report and the fix, since you have already found it!

zhaog6 · 2021-10-22T02:27:38Z

Please submit a bug report and the fix, since you have already found it!

Okay, I'll submit the bug report and fix it later. Thanks.

lobpcg · 2021-10-30T19:36:18Z

For the smallest SVD, have a look at #11829

github-actions bot added the scipy.sparse.linalg label Oct 14, 2021

zhaog6 changed the title ~~ENH: sparse.linalg: Add the feature of spectral norm of matrices~~ ENH: sparse.linalg: Add the feature of spectral norm of sparse matrices Oct 14, 2021

ENH: sparse.linalg: Add the feature of spectral norm of matrices

dac424b

Use "np.integer" in "issubdtype" Use LOBPCG directly instead of "scipy.sparse.linalg.svds" Find the "lobpcg" module

zhaog6 marked this pull request as draft October 18, 2021 09:27

Fixed the accuracy problem on specific platform and used svds to obta…

02d2425

…in 2/-2 norm of sparse matrices

zhaog6 marked this pull request as ready for review October 18, 2021 12:31

lobpcg reviewed Oct 18, 2021

View reviewed changes

zhaog6 marked this pull request as draft October 19, 2021 01:50

Fixed the accuracy problems on specific platform and added test cases…

7d792a7

… for matrices with different accuracy

zhaog6 marked this pull request as ready for review October 19, 2021 12:12

zhaog6 marked this pull request as draft October 20, 2021 00:28

Reproducing the problem on Win32 environment in Azure Piplined

b8a781e

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: sparse.linalg: Add the feature of spectral norm of sparse matrices #14855

ENH: sparse.linalg: Add the feature of spectral norm of sparse matrices #14855

zhaog6 commented Oct 14, 2021

lobpcg commented Oct 15, 2021

zhaog6 commented Oct 18, 2021 •

edited

Loading

lobpcg commented Oct 18, 2021

zhaog6 commented Oct 18, 2021

zhaog6 commented Oct 18, 2021

lobpcg commented Oct 18, 2021

zhaog6 commented Oct 18, 2021

zhaog6 commented Oct 18, 2021 •

edited

Loading

lobpcg commented Oct 18, 2021

zhaog6 commented Oct 18, 2021

lobpcg commented Oct 18, 2021

zhaog6 commented Oct 18, 2021

lobpcg left a comment •

edited

Loading

zhaog6 commented Oct 19, 2021

lobpcg commented Oct 19, 2021

zhaog6 commented Oct 19, 2021 •

edited

Loading

lobpcg commented Oct 19, 2021

zhaog6 commented Oct 19, 2021 •

edited

Loading

lobpcg commented Oct 19, 2021

zhaog6 commented Oct 19, 2021 •

edited

Loading

zhaog6 commented Oct 19, 2021 •

edited

Loading

lobpcg commented Oct 19, 2021 •

edited

Loading

zhaog6 commented Oct 20, 2021 •

edited

Loading

zhaog6 commented Oct 20, 2021

zhaog6 commented Oct 21, 2021

lobpcg commented Oct 21, 2021

zhaog6 commented Oct 22, 2021

lobpcg commented Oct 30, 2021

ENH: sparse.linalg: Add the feature of spectral norm of sparse matrices #14855

ENH: sparse.linalg: Add the feature of spectral norm of sparse matrices #14855

Conversation

zhaog6 commented Oct 14, 2021

Reference issue

What does this implement/fix?

Additional information

lobpcg commented Oct 15, 2021

zhaog6 commented Oct 18, 2021 • edited Loading

lobpcg commented Oct 18, 2021

zhaog6 commented Oct 18, 2021

zhaog6 commented Oct 18, 2021

lobpcg commented Oct 18, 2021

zhaog6 commented Oct 18, 2021

zhaog6 commented Oct 18, 2021 • edited Loading

lobpcg commented Oct 18, 2021

zhaog6 commented Oct 18, 2021

lobpcg commented Oct 18, 2021

zhaog6 commented Oct 18, 2021

lobpcg left a comment • edited Loading

Choose a reason for hiding this comment

zhaog6 commented Oct 19, 2021

lobpcg commented Oct 19, 2021

zhaog6 commented Oct 19, 2021 • edited Loading

lobpcg commented Oct 19, 2021

zhaog6 commented Oct 19, 2021 • edited Loading

lobpcg commented Oct 19, 2021

zhaog6 commented Oct 19, 2021 • edited Loading

zhaog6 commented Oct 19, 2021 • edited Loading

lobpcg commented Oct 19, 2021 • edited Loading

zhaog6 commented Oct 20, 2021 • edited Loading

zhaog6 commented Oct 20, 2021

zhaog6 commented Oct 21, 2021

lobpcg commented Oct 21, 2021

zhaog6 commented Oct 22, 2021

lobpcg commented Oct 30, 2021

zhaog6 commented Oct 18, 2021 •

edited

Loading

zhaog6 commented Oct 18, 2021 •

edited

Loading

lobpcg left a comment •

edited

Loading

zhaog6 commented Oct 19, 2021 •

edited

Loading

zhaog6 commented Oct 19, 2021 •

edited

Loading

zhaog6 commented Oct 19, 2021 •

edited

Loading

zhaog6 commented Oct 19, 2021 •

edited

Loading

lobpcg commented Oct 19, 2021 •

edited

Loading

zhaog6 commented Oct 20, 2021 •

edited

Loading