Add CUSOLVERRF.jl integration for GPU-accelerated sparse LU factorization #673

ChrisRackauckas-Claude · 2025-08-05T01:47:03Z

Summary

This PR adds support for NVIDIA's cusolverRF sparse LU factorization library through a package extension, providing high-performance GPU-accelerated solving for sparse linear systems.

Motivation

CUSOLVERRF.jl provides access to NVIDIA's cusolverRF library, which offers significant performance improvements for sparse LU factorization on GPUs. This integration makes it accessible through LinearSolve.jl's unified interface.

Key Features

New CUSOLVERRFFactorization algorithm with configurable options:
- symbolic: Choose between :RF (default) or :KLU for symbolic factorization
- reuse_symbolic: Reuse symbolic factorization for matrices with same sparsity pattern
Automatic CPU-to-GPU conversion for convenience
Support for multiple right-hand sides
Adjoint solve support
Comprehensive test suite

Implementation Details

The implementation follows LinearSolve.jl's extension pattern:

Extension module in ext/LinearSolveCUSOLVERRFExt.jl
Core types and exports in src/factorization.jl and src/LinearSolve.jl
Weak dependency configuration in Project.toml
Tests in test/gpu/cusolverrf.jl

Usage Example

using LinearSolve, CUSOLVERRF, SparseArrays

# Create sparse system
A = sprand(1000, 1000, 0.01) + 5I
b = rand(1000)

# Solve with default options
prob = LinearProblem(A, b)
sol = solve(prob, CUSOLVERRFFactorization())

# Use KLU for symbolic factorization
sol = solve(prob, CUSOLVERRFFactorization(symbolic = :KLU))

Limitations

Only supports Float64 element types with Int32 indices (CUSOLVERRF limitation)
Requires CUDA-capable GPU

Testing

Tests have been added to the GPU test suite and can be run with appropriate hardware.

This is a rebased version of #651.

🤖 Generated with Claude Code

Project.toml

…tion This PR adds support for NVIDIA's cusolverRF sparse LU factorization library through a package extension. CUSOLVERRF provides high-performance GPU-accelerated factorization for sparse matrices. Key features: - New `CUSOLVERRFFactorization` algorithm with configurable symbolic factorization (RF or KLU) - Automatic CPU-to-GPU conversion for convenience - Support for multiple right-hand sides - Reusable symbolic factorization for matrices with same sparsity pattern - Adjoint solve support - Comprehensive test suite The implementation follows LinearSolve.jl's extension pattern, similar to the existing CUDSS integration. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>

Include CUSOLVERRF tests in the GPU test suite when the package is available. The tests are conditionally included to avoid failures when CUSOLVERRF.jl is not installed. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>

- Added CUSOLVERRF to recommended methods for sparse matrices - Added CUSOLVERRF section in the full list of solvers - Added CUSOLVERRF examples in GPU tutorial documentation - Documented supported options and limitations 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>

- Updated sparse matrices recommendation to include both CUDSS.jl and CUSOLVERRF.jl - Clarified that CUDSS provides interface to NVIDIA's cuDSS library - Maintained that both offer high performance for GPU-accelerated sparse LU factorization 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>

- Clarified that CUDSS works through LUFactorization() when CUDSS.jl is loaded - Explained that it automatically uses cuDSS for CuSparseMatrixCSR arrays - Removed incorrect reference to a separate CUDSS factorization type 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>

src/extension_algs.jl

Project.toml

ext/LinearSolveCUSOLVERRFExt.jl

test/gpu/cusolverrf.jl

ext/LinearSolveCUSOLVERRFExt.jl

test/gpu/cusolverrf.jl

ChrisRackauckas reviewed Aug 5, 2025

View reviewed changes

Project.toml Outdated Show resolved Hide resolved

claude and others added 11 commits August 5, 2025 14:34

Update Project.toml

e40ad85

Update Project.toml

cc7911b

Update factorization.jl

235e333

Update extension_algs.jl

f784d42

Update solvers.md

0ac5d28

Update Project.toml

d7f1f8c

ChrisRackauckas force-pushed the add-cusolverrf-support branch from 6fb23da to d7f1f8c Compare August 5, 2025 18:34

ChrisRackauckas reviewed Aug 5, 2025

View reviewed changes

src/extension_algs.jl Outdated Show resolved Hide resolved

Update src/extension_algs.jl

0a075fe

ChrisRackauckas reviewed Aug 5, 2025

View reviewed changes

src/extension_algs.jl Outdated Show resolved Hide resolved

Update src/extension_algs.jl

1c1e917

ChrisRackauckas reviewed Aug 5, 2025

View reviewed changes

Project.toml Outdated Show resolved Hide resolved

ChrisRackauckas added 2 commits August 5, 2025 16:59

Update Project.toml

b92906c

Update Project.toml

e88bad8

ChrisRackauckas reviewed Aug 5, 2025

View reviewed changes

Project.toml Outdated Show resolved Hide resolved

Update Project.toml

82fbc55

ChrisRackauckas reviewed Aug 5, 2025

View reviewed changes

ext/LinearSolveCUSOLVERRFExt.jl Outdated Show resolved Hide resolved

Update ext/LinearSolveCUSOLVERRFExt.jl

7a8dac7

ChrisRackauckas reviewed Aug 5, 2025

View reviewed changes

ext/LinearSolveCUSOLVERRFExt.jl Outdated Show resolved Hide resolved

Update ext/LinearSolveCUSOLVERRFExt.jl

288d382

ChrisRackauckas reviewed Aug 5, 2025

View reviewed changes

ext/LinearSolveCUSOLVERRFExt.jl Outdated Show resolved Hide resolved

Update ext/LinearSolveCUSOLVERRFExt.jl

62bc9ae

ChrisRackauckas reviewed Aug 5, 2025

View reviewed changes

ext/LinearSolveCUSOLVERRFExt.jl Outdated Show resolved Hide resolved

Update ext/LinearSolveCUSOLVERRFExt.jl

d559e8b

ChrisRackauckas reviewed Aug 6, 2025

View reviewed changes

ext/LinearSolveCUSOLVERRFExt.jl Show resolved Hide resolved

Update ext/LinearSolveCUSOLVERRFExt.jl

5175137

ChrisRackauckas reviewed Aug 6, 2025

View reviewed changes

test/gpu/cusolverrf.jl Outdated Show resolved Hide resolved

Update test/gpu/cusolverrf.jl

f1f3bb8

ChrisRackauckas reviewed Aug 6, 2025

View reviewed changes

ext/LinearSolveCUSOLVERRFExt.jl Outdated Show resolved Hide resolved

Update ext/LinearSolveCUSOLVERRFExt.jl

6db7c55

ChrisRackauckas reviewed Aug 6, 2025

View reviewed changes

ext/LinearSolveCUSOLVERRFExt.jl Outdated Show resolved Hide resolved

Update ext/LinearSolveCUSOLVERRFExt.jl

b8ca961

ChrisRackauckas reviewed Aug 6, 2025

View reviewed changes

test/gpu/cusolverrf.jl Outdated Show resolved Hide resolved

ChrisRackauckas added 2 commits August 5, 2025 21:41

Update test/gpu/cusolverrf.jl

6a96db1

Update resolve.jl

b4bd9ed

ChrisRackauckas merged commit a35ef8d into SciML:main Aug 6, 2025
95 of 100 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add CUSOLVERRF.jl integration for GPU-accelerated sparse LU factorization #673

Add CUSOLVERRF.jl integration for GPU-accelerated sparse LU factorization #673

Uh oh!

ChrisRackauckas-Claude commented Aug 5, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Add CUSOLVERRF.jl integration for GPU-accelerated sparse LU factorization #673

Add CUSOLVERRF.jl integration for GPU-accelerated sparse LU factorization #673

Uh oh!

Conversation

ChrisRackauckas-Claude commented Aug 5, 2025

Summary

Motivation

Key Features

Implementation Details

Usage Example

Limitations

Testing

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!