Numba-accelerated truncated SVD fails to converge #189

juliendrapeau · 2023-07-05T19:41:03Z

What is happening?

While adding gates to large MPSs with the gate_with_auto_swap function, I sometimes get a convergence failure error during the SVD computation of the swapping process. The problem lies in the numba-accelerated part of the svd_truncated function in the decomp module. This error is due to the LAPACK driver used for the SVD function in numba. The gesdd LAPACK routine does not guarantee convergence of the computation. This is problematic because, even if this issue does not happen a lot, it can still raise an error.

How to possibly fix it?

I was able to circumvent this issue by using the gesvd LAPACK routine provided by SciPy. A quick fix to guarantee the convergence would be to use the SciPy function when the numba function fails to converge. For example, changing the original function:

@svd_truncated.register("numpy")
@njit  # pragma: no cover
def svd_truncated_numba(
    x, cutoff=-1.0, cutoff_mode=3, max_bond=-1, absorb=0, renorm=0
):
    """Accelerated version of ``svd_truncated`` for numpy arrays."""
    U, s, VH = np.linalg.svd(x, full_matrices=False)
    return _trim_and_renorm_svd_result_numba(
        U, s, VH, cutoff, cutoff_mode, max_bond, absorb, renorm
    )

by the following functions seems to fix the problem:

@njit  # pragma: no cover
def svd_truncated_numba(
    x, cutoff=-1.0, cutoff_mode=3, max_bond=-1, absorb=0, renorm=0
):
    """Accelerated version of ``svd_truncated`` for numpy arrays."""
    U, s, VH = np.linalg.svd(x, full_matrices=False)
    return _trim_and_renorm_svd_result_numba(
        U, s, VH, cutoff, cutoff_mode, max_bond, absorb, renorm
    )


def svd_truncated_scipy(
    x, cutoff=-1.0, cutoff_mode=3, max_bond=-1, absorb=0, renorm=0
):
    """Non-accelerated version of ``svd_truncated`` for numpy arrays with guaranteed convergence by scipy."""
    U, s, VH = sp.linalg.svd(x, full_matrices=False, lapack_driver="gesvd")
    return _trim_and_renorm_svd_result_numba(
        U, s, VH, cutoff, cutoff_mode, max_bond, absorb, renorm
    )


@svd_truncated.register("numpy")
def svd_truncated_numba_scipy(
    x, cutoff=-1.0, cutoff_mode=3, max_bond=-1, absorb=0, renorm=0
):
    """Accelerated version of ``svd_truncated`` for numpy arrays with guaranteed convergence by scipy."""
    try:
        return svd_truncated_numba(x, cutoff, cutoff_mode, max_bond, absorb, renorm)
    except:
        return svd_truncated_scipy(x, cutoff, cutoff_mode, max_bond, absorb, renorm)

Is there a better way to deal with this issue?

Thank you for your help.

The text was updated successfully, but these errors were encountered:

jcmgray · 2023-07-13T22:19:44Z

Hi @juliendrapeau, yes something along those lines seems reasonable, maybe with a warning raised as well? A fallback like this happens in the core part of quimb for some functions too.

Generally I have found that if one is having problems with linear algebra routine convergence then there is usually some instability in the higher tensor network algorithm that should be addressed, e.g. large differences in norms of tensors appearing.

However, with #192 as well, I wonder if something has changed about the numba svd implementation that has begun to cause this issue. If you have time it would be helpful to see if:

it appears for different version of numba going back
it appears for a different backend like torch.
you could save an actual matrix/tensor it occurs on.

No worries if not, its just a little hard to reproduce these things otherwise.

jcmgray · 2024-04-26T20:41:08Z

I restored the fallback to scipy back in b507abc so closing for the moment, feel free to re-open if problem persists!

jcmgray closed this as completed Apr 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Numba-accelerated truncated SVD fails to converge #189

Numba-accelerated truncated SVD fails to converge #189

juliendrapeau commented Jul 5, 2023 •

edited

Loading

jcmgray commented Jul 13, 2023

jcmgray commented Apr 26, 2024

Numba-accelerated truncated SVD fails to converge #189

Numba-accelerated truncated SVD fails to converge #189

Comments

juliendrapeau commented Jul 5, 2023 • edited Loading

What is happening?

How to possibly fix it?

jcmgray commented Jul 13, 2023

jcmgray commented Apr 26, 2024

juliendrapeau commented Jul 5, 2023 •

edited

Loading