Test error in "SVD Converge" in generic.py #69

Ali-Tehrani · 2021-03-08T18:27:49Z

During testing with github action, the follow test failed "test_generic_rectangular_translate_scale" for test_generic.py" with m = 829, n = 878 on python 3.7.

The error is
def _raise_linalgerror_svd_nonconvergence(err, flag): raise LinAlgError("SVD did not converge") E numpy.linalg.LinAlgError: SVD did not converge.

For more info on the github action, see https://github.com/Ali-Tehrani/procrustes/runs/2058961187.

However, running this test on my own machine it passes. So it seems the error is hardware related. Google searching this returns a 2019 SciPy issue [1]. I would recommend to decrease the matrix size so that it isn't hardware/MKL/OpenBlas dependent.

[1] - scipy/scipy#10032

The text was updated successfully, but these errors were encountered:

PaulWAyers · 2021-03-08T18:35:00Z

It may still have a problem, I guess, because matrices with rows/columns of zeros are evil..... It's a strange error (that we encountered once before in GOpt) because SVD is a matrix factorization and therefore is very robust. But the divide-and-conquer algorithm is not as robust (but it is significantly faster). I guess that using smaller matrices is a good compromise, but perhaps we should at least document this error message, which I think is not unlikely to appear when zero-padding is substantial because the size-mismatch of the matrices is large. The obvious solution is to directly access the SVD algorithm that is a factorizaton (instead of the default divide-and-conquer) algorithm. Derrick may remember how to do that. He also had a hack-ish way to fix the error.

FarnazH · 2021-03-09T17:43:26Z

These tests passed on Ubunto with python 3.8 before. Is GitHub Action using two different hardware for these runs?
We can decrease the matrix size and see whether it happens again.

The new runs failed again: https://github.com/theochem/procrustes/runs/2069045083#step:5:354

PaulWAyers · 2021-03-09T18:05:26Z

The solution here, which may also be useful to @tczorro , is to use LAPACK directly. Construct the factorized SVD (more robust than the divide-and-conquer iterative algorithm), using (I think) scipy.linalg.lapack.gesvd though you may need to prefix the gesvd routine with dgesvd (float) or zgesvd (complex). Once you have the SVD (and this one should always work), you can then use

Replace all elements of the sigma (singular value) diagonal that are greater than a threshhold with their inverse; replace values smaller than the threshold with zero. Call this inverse matrix Sigma-1, and its inverse .T (.H for Hermitian transpose).
Then construct V {dot} Sigma.T {dot} U.H = pseudoinverse.

The traditional threshhold value for step 1 is, for an mxn input matrix A, max(m,n)*epsilon(elements of A). I.e., it is the machine-precision for the elements of A times the number of elements in the larger dimension. This could be approximated easily by merely using max(m,n)*1e-15, which is good enough.

This can be made (potentially much) more efficient by truncating the singular-value-inverse, U, and V to include only the elements corresponding to the nonzero rows/columns. That way the matrix multiplication is faster (because you are not computing a lot of "multiply something by zero" terms).

See: theochem#69

FarnazH · 2021-03-09T18:14:14Z

We can use the lapack_driver argument of https://docs.scipy.org/doc/scipy/reference/generated/scipy.linalg.svd.html

See: theochem#69

Ali-Tehrani · 2021-03-09T18:28:37Z

The generic function uses Morse Penrose Inverse from numpy and it uses the divide and conquer method from Lapack (gesdd) to calculate the SVD, whereas SciPy's Morse Penrose inverse uses the general rectangular approach ('gesvd') [1]. So I think that we just need to change from numpy to scipy.

[1] - https://stackoverflow.com/questions/13265299/the-difference-of-pseudo-inverse-between-scipy-and-numpy

See: theochem#69

PaulWAyers · 2021-03-09T18:38:57Z

That's a good trick. @tczorro should look at it too. You have to fall down the rabbit-hole of documentation a bit to find these things sometimes :-( .

If this works, it may be worth remembering that if someone tried a VERY huge matrix, the speed/memory benefits of the pinv2 algorithm in scipy/numpy might help. But Numpy had a stupid tolerance. And if you have a huge matrix, the robustness of a "real SVD" instead of an "iterative SVD" is liekly to be important.....

See: theochem#69

FarnazH · 2021-03-09T18:54:01Z

I think scipy.linalg.pinv fixed the problem... While we are at this, should we use scipy.linalg.svd instead of numpy.linalg.svd (so we can set the lapack_driver='gesvd')?

PaulWAyers · 2021-03-09T21:05:21Z

I would use scipy.linalg.svd with lapack_drive='gesvd' everywhere. Where that is not appropriate (where speed is critical) probably there are better choices than SVD (which is a bit of a brute-force approach inherently).

FanwangM · 2021-03-09T23:09:38Z

Using scipy.linalg.pinv instead of numpy.linalg.pinv can also fix the problem of failing generic Procrustes, #61. I have tested that this idea can fix the issuse both locally and on GitHub as well. This can lead to a more robust implementation of generic Procrustes testing without setting a random seed. So we should do the resting without adding a random see.

Thanks for proposing to use scipy.linalg.pinv! @FarnazH @Ali-Tehrani

FarnazH · 2021-03-10T17:00:28Z

@Ali-Tehrani can you please take care of using scipy.linalg instead of numpy.linalg throughout the code? Thanks a lot.

Related to issue theochem#69, since scipy svd allows better control of the type of SVD algorithm that is being used.

Related to issue theochem#69, added to rotation, permutation and symmetric.

Ali-Tehrani changed the title ~~Test error in "SVD Converge" for python 3.7~~ Test error in "SVD Converge" Mar 8, 2021

Ali-Tehrani changed the title ~~Test error in "SVD Converge"~~ Test error in "SVD Converge" in generic.py Mar 8, 2021

FarnazH mentioned this issue Mar 9, 2021

Refactor test symmetric.py #70

Merged

FarnazH added a commit to Ali-Tehrani/procrustes that referenced this issue Mar 9, 2021

Change the random seed for test_generic

247d249

See: theochem#69

FarnazH added a commit to Ali-Tehrani/procrustes that referenced this issue Mar 9, 2021

Change seed back & default cutoff value of np SVD

c35b9b0

See: theochem#69

FarnazH added a commit to Ali-Tehrani/procrustes that referenced this issue Mar 9, 2021

Use scipy.linalg.pinv that uses least-squares

f4fdfab

See: theochem#69

FarnazH added a commit to Ali-Tehrani/procrustes that referenced this issue Mar 9, 2021

Use scipy.linalg.pinv that uses least-squares

7a18f72

See: theochem#69

FanwangM mentioned this issue Mar 9, 2021

Use random matrices for genric Procrustes tests #61

Merged

Ali-Tehrani added a commit to Ali-Tehrani/procrustes that referenced this issue Mar 10, 2021

Change numpy svd to scipy svd with param gesvd

2b2c7f1

Related to issue theochem#69, since scipy svd allows better control of the type of SVD algorithm that is being used.

Ali-Tehrani added a commit to Ali-Tehrani/procrustes that referenced this issue Mar 10, 2021

Change numpy svd to scipy svd with param 'gesvd'

33a2049

Related to issue theochem#69, added to rotation, permutation and symmetric.

Ali-Tehrani mentioned this issue Mar 10, 2021

Change svd #77

Merged

FarnazH mentioned this issue Mar 14, 2021

lapack_driver argument #78

Closed

FarnazH closed this as completed Mar 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test error in "SVD Converge" in generic.py #69

Test error in "SVD Converge" in generic.py #69

Ali-Tehrani commented Mar 8, 2021 •

edited

Loading

PaulWAyers commented Mar 8, 2021

FarnazH commented Mar 9, 2021 •

edited

Loading

PaulWAyers commented Mar 9, 2021

FarnazH commented Mar 9, 2021

Ali-Tehrani commented Mar 9, 2021 •

edited

Loading

PaulWAyers commented Mar 9, 2021 •

edited

Loading

FarnazH commented Mar 9, 2021 •

edited

Loading

PaulWAyers commented Mar 9, 2021

FanwangM commented Mar 9, 2021 •

edited

Loading

FarnazH commented Mar 10, 2021

Test error in "SVD Converge" in generic.py #69

Test error in "SVD Converge" in generic.py #69

Comments

Ali-Tehrani commented Mar 8, 2021 • edited Loading

PaulWAyers commented Mar 8, 2021

FarnazH commented Mar 9, 2021 • edited Loading

PaulWAyers commented Mar 9, 2021

FarnazH commented Mar 9, 2021

Ali-Tehrani commented Mar 9, 2021 • edited Loading

PaulWAyers commented Mar 9, 2021 • edited Loading

FarnazH commented Mar 9, 2021 • edited Loading

PaulWAyers commented Mar 9, 2021

FanwangM commented Mar 9, 2021 • edited Loading

FarnazH commented Mar 10, 2021

Ali-Tehrani commented Mar 8, 2021 •

edited

Loading

FarnazH commented Mar 9, 2021 •

edited

Loading

Ali-Tehrani commented Mar 9, 2021 •

edited

Loading

PaulWAyers commented Mar 9, 2021 •

edited

Loading

FarnazH commented Mar 9, 2021 •

edited

Loading

FanwangM commented Mar 9, 2021 •

edited

Loading