Do not segfault in svd(a) with VT.size > INT_MAX #20349

ev-br · 2024-03-28T10:24:18Z

Reference issue

What does this implement/fix?

linalg.svd with too large matrices may segfault if m*n > int_max on large-memory machines (on smaller memory machines it may fail with a MemoryError instead). The root cause is an integer overflow in indexing 2D arrays, deep in the LAPACK code.

Thus, detect a possible error condition in the f2py wrapper and error out early.

Additional information

Suggested by @pearu in #14001 (comment)

Adding this kind of check to f2py may still make sense IMO. No reason to not guard against this specific segfault in the meantime though.

ilayn · 2024-03-28T10:37:38Z

Looks good to me. We can also catch these in the Python side without passing them already to f2py but that's a minor point.

pearu

I suggest using an alternative approach that does not involve using sqrt and F_INT_MAX, and that provides perhaps a more useful exception message for the end users.

Notice also that checking overflow in m * n is an approximation for resolving the problem. There still exists combinations of m and n such that m * n does not overflow but can cause segfaults because the int overflow may occur from other terms that are added to m * n result (each lapack function may have different expression that involves m * n).

pearu · 2024-03-28T10:49:53Z

scipy/linalg/flapack_gen.pyf.src

+    check(m <= sqrt(F_INT_MAX)/n)
+    integer intent(hide),depend(m,n) :: minmn = MIN(m,n)


Another approach of detecting integer overflow that does not involve introducing F_INT_MAX and not using sqrt, is (untested):

Suggested change

check(m <= sqrt(F_INT_MAX)/n)

integer intent(hide),depend(m,n) :: minmn = MIN(m,n)

integer intent(hide),depend(m,n) :: mn_overflow = m * n

check(m == 0 || mn_overflow / m == n)

integer intent(hide),depend(m,n) :: minmn = MIN(m,n)

or similar.

Notice that when the check fails, the check expression is displayed to the user. So, it is advisable to use an expression that users interpret it correctly.

Or just use

check(m == 0 || (m * n) / m == n)

pearu · 2024-03-28T11:58:50Z

scipy/linalg/_decomp_svd.py

+    if lapack_driver == 'gesdd' and compute_uv:
+        max_mn = max(a1.shape)
+        # XXX: revisit int32 when ILP64 lapack becomes a thing
+        if max_mn > math.sqrt(numpy.iinfo(numpy.int32).max):


Using max(m, n) > sqrt(float32_max) is unnecessarily restrictive for cases where min(m, n) is relatively small.

Why not use exact overflow condition that is defined by gesdd source code?

ISTM the product of m*n does not really matter. Let a1.shape == (2, 53130) and compute_uv==True. The orthogonal matrix VT has shape (53130, 53130) and causes the overflow.

In the example of #14001 (comment), note that
print*, n*n gives -1472170396

Demo:

$ nano gesdd.i4.f90 ev-br@qgpu3:~/temp$ gfortran gesdd.i4.f90 -lblas -llapack ev-br@qgpu3:~/temp$ ./a.out -1472170396 info = 0 opt lwork = 1700166 Program received signal SIGSEGV: Segmentation fault - invalid memory reference. Backtrace for this error: #0 0x7f1028a0c51f in ??? at ./signal/../sysdeps/unix/sysv/linux/x86_64/libc_sigaction.c:0 #1 0x7f102a82577b in ??? #2 0x7f102a83d898 in ??? #3 0x7f102a83da70 in ??? #4 0x7f102a7d1769 in ??? #5 0x561ff340c261 in ??? #6 0x561ff340c43a in ??? #7 0x7f10289f3d8f in __libc_start_call_main at ../sysdeps/nptl/libc_start_call_main.h:58 #8 0x7f10289f3e3f in __libc_start_main_impl at ../csu/libc-start.c:392 #9 0x561ff340b138 in ??? Segmentation fault (core dumped)

with

$ cat gesdd.i4.f90 implicit none character*1 :: jobz = 'A' integer*4 :: m, n integer*4 :: lda, ldu, ldvt, lwork integer*4 :: info integer*4, allocatable :: iwork(:) real*8, allocatable :: a(:, :) real*8, allocatable :: s(:) real*8, allocatable :: u(:, :) real*8, allocatable :: vt(:, :) real*8, allocatable :: work(:) ! m < n only m = 2 !4799 ! <<<<< HERE n = 53130 lda = m ldu = m ldvt = n allocate(a(lda, n), s(n), u(ldu, m), vt(ldvt, n)) allocate(iwork(8*n)) a = 1.d0 ! workspace query allocate(work(1)) lwork = -1 call dgesdd(jobz, m, n, a, lda, s, u, ldu, vt, ldvt, work, lwork, iwork, info) lwork = int(work(1)) print*, n*n print*, 'info = ', info print*, 'opt lwork = ', lwork deallocate(work) allocate(work(lwork)) call dgesdd(jobz, m, n, a, lda, s, u, ldu, vt, ldvt, work, lwork, iwork, info) print*, 'info = ', info !print*, s print*, 'max(sigma) = ', maxval(s) end

OK, relaxed the condition for full_matrices=False, where the VT size is n, min(m, n).

ev-br · 2024-03-28T12:01:05Z

Pushed an update: 1) The failure mode in gh-14001 is not m*n overflows; it's when the U and VT matrices are requested, they have the shape (m, m) and (n, n) and one of n*n or m*m overflows. So check that. 2) Move the check to the python level to simplify things and give a more informative error message.

linalg.svd with too large matrices may segfault if max(m, n)*max(m, n) > int_max on large-memory machines (on smaller memory machines it may fail with a MemoryError instead). The root cause is an integer overflow in indexing 2D arrays, deep in the LAPACK code. Thus, detect a possible error condition and bail out early.

scipy/linalg/_decomp_svd.py

pearu

I have one minor nit, otherwise, LGTM! Thanks, @ev-br!

scipy/linalg/_decomp_svd.py

Co-authored-by: Pearu Peterson <pearu.peterson@gmail.com>

ev-br · 2024-03-28T15:28:16Z

Thanks for the reviews Pearu, Ilhan. Merging as approved

ev-br added defect A clear bug or issue that prevents SciPy from being installed or used as expected scipy.linalg labels Mar 28, 2024

ev-br requested review from larsoner and ilayn as code owners March 28, 2024 10:24

ev-br force-pushed the gesdd_segfault branch from c51ad59 to 400a3ef Compare March 28, 2024 10:42

pearu suggested changes Mar 28, 2024

View reviewed changes

ev-br force-pushed the gesdd_segfault branch 4 times, most recently from 7664655 to 0ac0e41 Compare March 28, 2024 11:58

pearu reviewed Mar 28, 2024

View reviewed changes

ev-br force-pushed the gesdd_segfault branch from 0ac0e41 to f0f70d4 Compare March 28, 2024 12:40

ev-br force-pushed the gesdd_segfault branch from f0f70d4 to ddb3964 Compare March 28, 2024 12:41

ev-br changed the title ~~Do not segfault in svd(a) with a.size > INT_MAX~~ Do not segfault in svd(a) with VT.size > INT_MAX Mar 28, 2024

pearu reviewed Mar 28, 2024

View reviewed changes

scipy/linalg/_decomp_svd.py Show resolved Hide resolved

pearu approved these changes Mar 28, 2024

View reviewed changes

scipy/linalg/_decomp_svd.py Outdated Show resolved Hide resolved

Update scipy/linalg/_decomp_svd.py

b810fdb

Co-authored-by: Pearu Peterson <pearu.peterson@gmail.com>

ev-br merged commit 4a343ab into scipy:main Mar 28, 2024
29 checks passed

ev-br added this to the 1.14.0 milestone Mar 28, 2024

tylerjereddy added the backport-candidate This fix should be ported by a maintainer to previous SciPy versions. label Mar 28, 2024

rgommers mentioned this pull request Mar 29, 2024

BUG: linalg: support empty arrays #20295

Merged

92 tasks

tylerjereddy modified the milestones: 1.14.0, 1.13.0 Apr 1, 2024

tylerjereddy mentioned this pull request Apr 1, 2024

MAINT, REL: Prepare for SciPy 1.13.0 "final" (proposing to skip RC2 for Numpy 2 series support) #20375

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not segfault in svd(a) with VT.size > INT_MAX #20349

Do not segfault in svd(a) with VT.size > INT_MAX #20349

ev-br commented Mar 28, 2024

ilayn commented Mar 28, 2024

pearu left a comment

pearu Mar 28, 2024

pearu Mar 28, 2024

pearu Mar 28, 2024

ev-br Mar 28, 2024

ev-br Mar 28, 2024

ev-br Mar 28, 2024

ev-br commented Mar 28, 2024 •

edited

pearu left a comment

ev-br commented Mar 28, 2024

		check(m <= sqrt(F_INT_MAX)/n)
		integer intent(hide),depend(m,n) :: minmn = MIN(m,n)

Do not segfault in svd(a) with VT.size > INT_MAX #20349

Do not segfault in svd(a) with VT.size > INT_MAX #20349

Conversation

ev-br commented Mar 28, 2024

Reference issue

What does this implement/fix?

Additional information

ilayn commented Mar 28, 2024

pearu left a comment

Choose a reason for hiding this comment

pearu Mar 28, 2024

Choose a reason for hiding this comment

pearu Mar 28, 2024

Choose a reason for hiding this comment

pearu Mar 28, 2024

Choose a reason for hiding this comment

ev-br Mar 28, 2024

Choose a reason for hiding this comment

ev-br Mar 28, 2024

Choose a reason for hiding this comment

ev-br Mar 28, 2024

Choose a reason for hiding this comment

ev-br commented Mar 28, 2024 • edited

pearu left a comment

Choose a reason for hiding this comment

ev-br commented Mar 28, 2024

ev-br commented Mar 28, 2024 •

edited