Added support for calling superlu with 64-bit integer support #20338

liuyangzhuan · 2024-03-26T21:58:59Z

The current tarball of the superlu SRC supports 64-bit integers. In other words, the matrix size m, n are still 32bit, but the number of nonzeros, index array and pointer array (CSC or CSR) can be 64 bit integers.

However the current scipy interface in linsolve.py always call _safe_downcast_indices to convert the input matrix into 32bit indexed and call superlu.

This PR fixes the issue by using an environment variable XSDK_INDEX_SIZE to pass to the meson build system at compile time, as well as to select the correct idx_dtype at runtime when superlu is being used. More specifically, the 64-bit build can be done as follows:

export XSDK_INDEX_SIZE=64
CFLAGS="-DXSDK_INDEX_SIZE=${XSDK_INDEX_SIZE}" meson setup build

Note that one still need export XSDK_INDEX_SIZE=64 at runtime. That's why I used the environment variable approach.
Note that the 32-bit build is used by default, or by setting export XSDK_INDEX_SIZE=32

rgommers

Thanks for working on this @liuyangzhuan. It looks like a good start. The environment variable usage won't work though, that is way to fragile. What we need here instead is to build SuperLU twice, so we can pass it both 32-bit and 64-bit indices to avoid the downcasting. That should "just work" without the user needing to toggle any build or runtime knobs.

Binary size impact is small-ish, since the extension is 300 kb unpacked in a release build. So wheel size contribution is smaller than that; doubling it seems acceptable.

The most important thing is to check how BLAS/LAPACK usage is handled; I haven't looked but I am guessing that it requires ILP64 BLAS support to make this work. If so, then that's in the works but will be optional so it'll only work with custom builds. Meaning that 64-bit indices should continue to be downcasted for default (non-ILP64-enabled) builds.

There may be some hiccups in having two separate extension modules that are in use at the same time, not sure.

liuyangzhuan · 2024-03-27T16:50:32Z

Hi, @rgommers, I agree that env is not the perfect way to go. Building superlu twice is fine with me, as this is the way BLAS/LAPACK being used in scipy. But it seems that for superlu, the bridging C routines e.g.,
https://github.com/liuyangzhuan/scipy/blob/main/scipy/sparse/linalg/_dsolve/_superlumodule.c
https://github.com/liuyangzhuan/scipy/blob/main/scipy/sparse/linalg/_dsolve/_superluobject.c
need also to be built twice. Note that currently I'm using "int_t" to be either int or int64_t based on -DXSDK_INDEX_SIZE. I'm not sure how easy you can modify these bridging routines.

BTW, superlu always use 32-bit BLAS, either it's vendor-based or its own internal blas. This might cause some conflict when ILP64 BLAS is used in other place of scipy. @xiaoyeli

rgommers · 2024-03-27T18:46:57Z

BTW, superlu always use 32-bit BLAS, either it's vendor-based or its own internal blas. This might cause some conflict when ILP64 BLAS is used in other place of scipy. @xiaoyeli

32-bit BLAS will always be available for internal use, since the exported scipy.linalg.cython_blas and cython_lapack APIs are 32-bit and cannot change.

liuyangzhuan · 2024-03-27T19:08:51Z

BTW, superlu always use 32-bit BLAS, either it's vendor-based or its own internal blas. This might cause some conflict when ILP64 BLAS is used in other place of scipy. @xiaoyeli

32-bit BLAS will always be available for internal use, since the exported scipy.linalg.cython_blas and cython_lapack APIs are 32-bit and cannot change.

Cool. Any idea about the bridging C files?

rgommers · 2024-03-28T07:58:49Z

Any idea about the bridging C files?

What I'd suggest trying is to indeed build those twice too. So you produce _superlu.so and _superlu_64.so, and the corresponding static libraries. There may be a few things in those two files that may need a symbol suffix, e.g.:

#define PY_ARRAY_UNIQUE_SYMBOL _scipy_sparse_superlu_ARRAY_API

if you do that by passing a compile argument, I hope you can build everything twice without conflicts.

liuyangzhuan added 2 commits March 26, 2024 14:47

add support for 64-bit index superlu interface

1c1f243

Merge branch 'scipy:main' into main

2cf1b9e

liuyangzhuan requested a review from perimosocordiae as a code owner March 26, 2024 21:59

github-actions bot added scipy.sparse.linalg scipy.sparse C/C++ Items related to the internal C/C++ code base labels Mar 26, 2024

rgommers requested changes Mar 27, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added support for calling superlu with 64-bit integer support #20338

Added support for calling superlu with 64-bit integer support #20338

liuyangzhuan commented Mar 26, 2024

rgommers left a comment

liuyangzhuan commented Mar 27, 2024

rgommers commented Mar 27, 2024

liuyangzhuan commented Mar 27, 2024

rgommers commented Mar 28, 2024

Added support for calling superlu with 64-bit integer support #20338

Are you sure you want to change the base?

Added support for calling superlu with 64-bit integer support #20338

Conversation

liuyangzhuan commented Mar 26, 2024

rgommers left a comment

Choose a reason for hiding this comment

liuyangzhuan commented Mar 27, 2024

rgommers commented Mar 27, 2024

liuyangzhuan commented Mar 27, 2024

rgommers commented Mar 28, 2024