-
-
Notifications
You must be signed in to change notification settings - Fork 67
Add CUSOLVERRF.jl integration for GPU-accelerated sparse LU factorization #673
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
ChrisRackauckas
merged 26 commits into
SciML:main
from
ChrisRackauckas-Claude:add-cusolverrf-support
Aug 6, 2025
Merged
Add CUSOLVERRF.jl integration for GPU-accelerated sparse LU factorization #673
ChrisRackauckas
merged 26 commits into
SciML:main
from
ChrisRackauckas-Claude:add-cusolverrf-support
Aug 6, 2025
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
…tion This PR adds support for NVIDIA's cusolverRF sparse LU factorization library through a package extension. CUSOLVERRF provides high-performance GPU-accelerated factorization for sparse matrices. Key features: - New `CUSOLVERRFFactorization` algorithm with configurable symbolic factorization (RF or KLU) - Automatic CPU-to-GPU conversion for convenience - Support for multiple right-hand sides - Reusable symbolic factorization for matrices with same sparsity pattern - Adjoint solve support - Comprehensive test suite The implementation follows LinearSolve.jl's extension pattern, similar to the existing CUDSS integration. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>
Include CUSOLVERRF tests in the GPU test suite when the package is available. The tests are conditionally included to avoid failures when CUSOLVERRF.jl is not installed. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>
- Added CUSOLVERRF to recommended methods for sparse matrices - Added CUSOLVERRF section in the full list of solvers - Added CUSOLVERRF examples in GPU tutorial documentation - Documented supported options and limitations 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>
- Updated sparse matrices recommendation to include both CUDSS.jl and CUSOLVERRF.jl - Clarified that CUDSS provides interface to NVIDIA's cuDSS library - Maintained that both offer high performance for GPU-accelerated sparse LU factorization 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>
- Clarified that CUDSS works through LUFactorization() when CUDSS.jl is loaded - Explained that it automatically uses cuDSS for CuSparseMatrixCSR arrays - Removed incorrect reference to a separate CUDSS factorization type 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>
6fb23da
to
d7f1f8c
Compare
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Summary
This PR adds support for NVIDIA's cusolverRF sparse LU factorization library through a package extension, providing high-performance GPU-accelerated solving for sparse linear systems.
Motivation
CUSOLVERRF.jl provides access to NVIDIA's cusolverRF library, which offers significant performance improvements for sparse LU factorization on GPUs. This integration makes it accessible through LinearSolve.jl's unified interface.
Key Features
CUSOLVERRFFactorization
algorithm with configurable options:symbolic
: Choose between:RF
(default) or:KLU
for symbolic factorizationreuse_symbolic
: Reuse symbolic factorization for matrices with same sparsity patternImplementation Details
The implementation follows LinearSolve.jl's extension pattern:
ext/LinearSolveCUSOLVERRFExt.jl
src/factorization.jl
andsrc/LinearSolve.jl
Project.toml
test/gpu/cusolverrf.jl
Usage Example
Limitations
Float64
element types withInt32
indices (CUSOLVERRF limitation)Testing
Tests have been added to the GPU test suite and can be run with appropriate hardware.
This is a rebased version of #651.
🤖 Generated with Claude Code