Skip to content

Refactor: support modern cuSOLVERMpwith NCCL (replaces libcal)#6951

Draft
dzzz2001 wants to merge 3 commits intodevelopfrom
transition-to-nccl
Draft

Refactor: support modern cuSOLVERMpwith NCCL (replaces libcal)#6951
dzzz2001 wants to merge 3 commits intodevelopfrom
transition-to-nccl

Conversation

@dzzz2001
Copy link
Collaborator

@dzzz2001 dzzz2001 commented Feb 4, 2026

Summary

  • Replace deprecated libcal dependency with NCCL for cuSOLVERMp/cuBLASMp initialization
  • Update CMake configuration to find NCCL instead of CAL library
  • Simplify DiagoCusolverMp implementation to use NCCL communicators directly
  • Update toolchain documentation

Closes #6940

Background

Starting from cuBLASMp v0.5.0 and cuSOLVERMp v0.7.0, NVIDIA has transitioned from the Communication Abstraction Library (libcal) to using NCCL directly. This PR updates ABACUS to be compatible with NVIDIA HPC SDK 25.9+ .

Note

After this PR, Abacus is only compatible with cuSolverMp version 0.70 or later, and HPC SDK version 25.9 or later.
🤖 Generated with Claude Code

@dzzz2001 dzzz2001 changed the title Support modern cuSOLVERMp/cuBLASMp with NCCL (replaces libcal) Refactor: support modern cuSOLVERMpwith NCCL (replaces libcal) Feb 4, 2026
@dzzz2001
Copy link
Collaborator Author

dzzz2001 commented Feb 4, 2026

I updated some toolchain-related docs. @QuantumMisaka , could you please review them? The toolchain might need further changes

@dzzz2001 dzzz2001 marked this pull request as draft February 4, 2026 16:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Feature Request: Support for modern cuSOLVERMp/cuBLASMp (v0.5.0+ and v0.7.0+) without libcal dependency

1 participant