Always build with JIT+LTO by KyleFromNVIDIA · Pull Request #1923 · rapidsai/cuvs

KyleFromNVIDIA · 2026-03-16T20:16:23Z

Since #1909, we've been able to use older versions of the CUDA driver, since we no longer rely on cudaLibraryEnumerateKernels(). Since #1918, we've been using static cudart, which allows us to run on platforms with versions of CUDA older than 12.8 installed, since the runtime library API is now bundled with cuvs. Always build with JIT+LTO so that we can get the full compile time and binary size benefits in CUDA 12 too.

Since rapidsai#1909, we've been able to use older versions of the CUDA driver, since we no longer rely on `cudaLibraryEnumerateKernels()`. Since rapidsai#1918, we've been using static cudart, which allows us to run on platforms with versions of CUDA older than 12.8 installed, since the runtime library API is now bundled with cuvs. Always build with JIT+LTO so that we can get the full compile time and binary size benefits in CUDA 12 too.

cpp/CMakeLists.txt

KyleFromNVIDIA · 2026-03-16T21:50:11Z

It seems that even after #1918, we're still not using cudart_static, since rmm forces us to use the shared version:

./../../..//bin/gtests/libcuvs/STATS_TEST: symbol lookup error: /opt/conda/envs/test/bin/gtests/libcuvs/../../../lib/libcuvs.so: undefined symbol: cudaLibraryGetKernel, version libcudart.so.12

We have to get rmm on cudart_static.

KyleFromNVIDIA · 2026-03-16T23:08:24Z

We've decided to switch to the driver API instead, since rmm is blocked on rapidsai/cudf#20814, which in turn is also blocked.

cpp/src/detail/jit_lto/AlgorithmPlanner.cpp

dependencies.yaml

cpp/CMakeLists.txt

This reverts commit e26519f.

This reverts commit 6c91f9d.

- Enable static linking of libcudart by default (`CUDA_STATIC_RUNTIME=ON`) - Remove `cuda-cudart` from conda recipe run requirements (no longer needed when statically linked) This is part of a RAPIDS-wide effort to switch to static CUDA runtime linking. See rapidsai/build-planning#235 for tracking. - `cpp/CMakeLists.txt`: Change `CUDA_STATIC_RUNTIME` default from OFF to ON - `conda/recipes/cuvs/recipe.yaml`: Remove `cuda-cudart` from run deps - `conda/recipes/libcuvs/recipe.yaml`: Remove `cuda-cudart` from run deps (4 outputs) Note: Python builds already use `CUDA_STATIC_RUNTIME=ON` (set in `python/libcuvs/CMakeLists.txt`). Authors: - Bradley Dice (https://github.com/bdice) - Kyle Edwards (https://github.com/KyleFromNVIDIA) Approvers: - Kyle Edwards (https://github.com/KyleFromNVIDIA) - Robert Maynard (https://github.com/robertmaynard) - Ben Frederickson (https://github.com/benfred) URL: rapidsai#1627

c/tests/CMakeLists.txt

KyleFromNVIDIA

Remove the debug statement when finished

cpp/src/detail/jit_lto/AlgorithmPlanner.cpp

conda/recipes/libcuvs/recipe.yaml

KyleFromNVIDIA requested review from a team as code owners March 16, 2026 20:16

KyleFromNVIDIA requested a review from msarahan March 16, 2026 20:16

KyleFromNVIDIA added breaking Introduces a breaking change improvement Improves an existing functionality labels Mar 16, 2026

github-project-automation bot added this to Unstructured Data Processing Mar 16, 2026

KyleFromNVIDIA changed the base branch from main to release/26.04 March 16, 2026 20:17

KyleFromNVIDIA added the DO NOT MERGE label Mar 16, 2026

KyleFromNVIDIA commented Mar 16, 2026

View reviewed changes

cpp/CMakeLists.txt Outdated Show resolved Hide resolved

divyegala approved these changes Mar 16, 2026

View reviewed changes

Use the driver API instead

6c91f9d

KyleFromNVIDIA requested a review from a team as a code owner March 16, 2026 23:26

divyegala reviewed Mar 16, 2026

View reviewed changes

cpp/src/detail/jit_lto/AlgorithmPlanner.cpp Outdated Show resolved Hide resolved

divyegala reviewed Mar 16, 2026

View reviewed changes

dependencies.yaml Show resolved Hide resolved

KyleFromNVIDIA added 2 commits March 16, 2026 23:33

Conda recipe

e858407

deps

1972a74

divyegala reviewed Mar 16, 2026

View reviewed changes

cpp/CMakeLists.txt Outdated Show resolved Hide resolved

KyleFromNVIDIA added 2 commits March 16, 2026 23:43

PRIVATE

4503307

auditwheel

a42ede0

KyleFromNVIDIA requested a review from a team as a code owner March 17, 2026 00:05

KyleFromNVIDIA added 7 commits March 17, 2026 02:43

Conda recipe

e26519f

Merge branch 'release/26.04' into jit-lto-cuda-12

697d1d0

Revert "Conda recipe"

3269055

This reverts commit e26519f.

COMPILE_ONLY

07c50e6

PUBLIC

788fd34

Revert "Use the driver API instead"

e16b88f

This reverts commit 6c91f9d.

Remove driver dep

96e9162

bdice and others added 4 commits March 18, 2026 17:51

Opt out of rmm's cudart dependency

56229e8

Make rmm interface dependency COMPILE_ONLY

0a0540a

Merge branch 'main' into jit-lto-cuda-12

f379ad4

KyleFromNVIDIA changed the base branch from release/26.04 to main March 18, 2026 18:30

aamijar assigned KyleFromNVIDIA Mar 18, 2026

aamijar moved this to In Progress in Unstructured Data Processing Mar 18, 2026

KyleFromNVIDIA added 4 commits March 19, 2026 11:26

Merge branch 'main' into jit-lto-cuda-12

38c9e9d

Push

17c5cd7

Merge branch 'main' into cudart-static

af0a04e

Merge branch 'cudart-static' into jit-lto-cuda-12

8c771a5

KyleFromNVIDIA requested review from a team as code owners March 23, 2026 13:54

KyleFromNVIDIA commented Mar 23, 2026

View reviewed changes

c/tests/CMakeLists.txt Show resolved Hide resolved

benfred approved these changes Mar 23, 2026

View reviewed changes

Debugging

b6560be

KyleFromNVIDIA commented Mar 23, 2026

View reviewed changes

cpp/src/detail/jit_lto/AlgorithmPlanner.cpp Outdated Show resolved Hide resolved

cpp/src/detail/jit_lto/AlgorithmPlanner.cpp Outdated Show resolved Hide resolved

KyleFromNVIDIA added 4 commits March 24, 2026 13:37

Downgrade to compute 7.0 for CUDA 12

84ddcf9

Merge branch 'main' into jit-lto-cuda-12

b08a35d

Remove JIT_LTO_COMPILATION variable

a8493a3

Remove CUVS_ENABLE_JIT_LTO preprocessor definition

997ab66

bdice reviewed Mar 24, 2026

View reviewed changes

conda/recipes/libcuvs/recipe.yaml Outdated Show resolved Hide resolved

Use libnvjitlink run exports

fe67525

KyleFromNVIDIA removed the DO NOT MERGE label Mar 24, 2026

bdice approved these changes Mar 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Always build with JIT+LTO#1923

Always build with JIT+LTO#1923
KyleFromNVIDIA wants to merge 27 commits intorapidsai:mainfrom
KyleFromNVIDIA:jit-lto-cuda-12

KyleFromNVIDIA commented Mar 16, 2026

Uh oh!

Uh oh!

KyleFromNVIDIA commented Mar 16, 2026 •

edited

Loading

Uh oh!

KyleFromNVIDIA commented Mar 16, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

KyleFromNVIDIA left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

KyleFromNVIDIA commented Mar 16, 2026

Uh oh!

Uh oh!

KyleFromNVIDIA commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KyleFromNVIDIA commented Mar 16, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

KyleFromNVIDIA left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

KyleFromNVIDIA commented Mar 16, 2026 •

edited

Loading