Ensure that all CUDA kernels in cudf have hidden visibility. #14726

robertmaynard · 2024-01-09T15:31:44Z

Description

To correct potential issues when using a static cuda runtime, we mark all kernels with internal linkage via the static keyword or hidden visibility.

Note: This doesn't fix dependencies, but focuses just on the CUDA kernels in cudf directly.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

cpp/include/cudf/types.hpp

bdice · 2024-01-09T15:41:25Z

cpp/src/io/comp/gpuinflate.cu

@@ -1024,7 +1024,7 @@ __device__ int parse_gzip_header(uint8_t const* src, size_t src_size)
 * @param parse_hdr If nonzero, indicates that the compressed bitstream includes a GZIP header
 */
 template <int block_size>
-__global__ void __launch_bounds__(block_size)
+CUDF_KERNEL void __launch_bounds__(block_size)


Should we take this opportunity to normalize the order of launch bounds, CUDF_KERNEL, and the return type across all kernels in libcudf? Some put launch bounds first, others put it last.

Happy to make everything consistent as part of the PR, and we can always discuss offline/follow up what style we want. I don't want to hold up the entire PR over a style issue though

We can defer on this. It's easy enough to change later. Just wanted to raise that question in case you had a clear preference. I don't know which one I prefer. Maybe CUDF_KERNEL __launch_bounds__(...) void, but that doesn't align with any of the kernels that I saw.

You could further future proof it (and potentially enable wider compatibility by making a version that takes the bounds as parameters

CUDF_KERNEL_WITH_LAUNCH_BOUNDS(...) void foo(...)

Or even make all kernels use the same macro with varargs.

CUDF_KERNEL(...) void foo(...)

cpp/src/join/mixed_join_kernel.cuh

cpp/tests/error/error_handling_test.cu

jrhemstad · 2024-01-09T17:55:24Z

cpp/include/cudf/types.hpp

+/**
+ * @brief Indicates that the function is a CUDA kernel
+ */
+#define CUDF_KERNEL __global__ static


note (non-blocking): I don't think anyone builds libcudf with rdc=true, but if you wanted to be extra pedantic, then CUDF_KERNEL should expand to __attribute__ ((visibility ("hidden"))) when __CUDACC_RDC__ is defined in order to preserve the binary size improvements that come from symbol deduplication within the DLL with rdc=true.

cpp/examples/strings/custom_optimized.cu

harrism · 2024-01-11T02:55:07Z

This looks like a PR that will need to be duplicated across RAPIDS. So I think it should have a rapidsai/build-planning issue with a checklist of per-repo issues.

robertmaynard · 2024-01-11T14:45:27Z

This looks like a PR that will need to be duplicated across RAPIDS. So I think it should have a rapidsai/build-planning issue with a checklist of per-repo issues.

You are correct, we can track the meta issue at: rapidsai/build-planning#12

robertmaynard · 2024-01-17T15:12:17Z

/merge

This marks all kernels in CUCO as `static` so that they have internal linkage and won't conflict when used by multiple DSOs. I didn't see a single shared/common header in cuco where I could place a `CUCO_KERNEL` macro so I modified each instance instead. While `cccl` went with a `__attribute__ ((visibility ("hidden")))` approach to help reduce RDC size, this approach seemed very invasive for cuco. This is due to the fact that we would need to pragma push and pop both gcc warnings and nvcc warnings in each cuco header so that we don't introduce any warnings. This is needed as the compiler incorrectly state that the `__attribute__ ((visibility ("hidden")))` has no side-effect. Context: rapidsai/cudf#14726 NVIDIA/cccl#166 rapidsai/raft#1722 --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Yunsong Wang <yunsongw@nvidia.com>

robertmaynard requested review from a team as code owners January 9, 2024 15:31

robertmaynard requested review from shrshi and nvdbaranec January 9, 2024 15:31

github-actions bot added libcudf Affects libcudf (C++/CUDA) code. CMake CMake build issue labels Jan 9, 2024

bdice changed the title ~~Ensuree that all CUDA kernels in cudf have hidden visibility.~~ Ensure that all CUDA kernels in cudf have hidden visibility. Jan 9, 2024

robertmaynard force-pushed the bug/mark_kernels_as_static branch from 5c496d6 to a0ee7a9 Compare January 9, 2024 15:36

github-actions bot removed the CMake CMake build issue label Jan 9, 2024

Mark all CUDA kernels in cudf have hidden visibility.

3e2caac

robertmaynard force-pushed the bug/mark_kernels_as_static branch from a0ee7a9 to 3e2caac Compare January 9, 2024 15:38

robertmaynard added bug Something isn't working non-breaking Non-breaking change labels Jan 9, 2024

bdice reviewed Jan 9, 2024

View reviewed changes

Add doc strings to CUDF macros

670031b

robertmaynard force-pushed the bug/mark_kernels_as_static branch 2 times, most recently from 967a9e1 to 3968a89 Compare January 9, 2024 16:15

Correct issues found by review

6a53ceb

robertmaynard force-pushed the bug/mark_kernels_as_static branch from 452869e to 6a53ceb Compare January 9, 2024 17:18

robertmaynard added 2 commits January 9, 2024 12:45

Correct issues found by CI

9f8499b

Don't use CUDF_KERNEL from tests that don't include cudf

0fd6c03

jrhemstad reviewed Jan 9, 2024

View reviewed changes

github-actions bot added the CMake CMake build issue label Jan 9, 2024

robertmaynard force-pushed the bug/mark_kernels_as_static branch from c8b9590 to 0fd6c03 Compare January 9, 2024 19:35

github-actions bot removed the CMake CMake build issue label Jan 9, 2024

This was referenced Jan 9, 2024

Mark all cuco kernels as static so they have hidden visibility NVIDIA/cuCollections#422

Merged

[BUG] Static builds of libcudf cause runtime failures due to public kernel symbols #14734

Closed

GregoryKimball assigned robertmaynard Jan 10, 2024

ttnghia reviewed Jan 10, 2024

View reviewed changes

cpp/examples/strings/custom_optimized.cu Show resolved Hide resolved

robertmaynard added the Spark Functionality that helps Spark RAPIDS label Jan 11, 2024

robertmaynard requested review from bdice and jrhemstad January 11, 2024 21:34

bdice approved these changes Jan 13, 2024

View reviewed changes

robertmaynard and others added 2 commits January 16, 2024 16:04

Merge branch 'branch-24.02' into bug/mark_kernels_as_static

a57144e

Merge branch 'branch-24.02' into bug/mark_kernels_as_static

0368028

ttnghia approved these changes Jan 17, 2024

View reviewed changes

rapids-bot bot merged commit 6abef4a into rapidsai:branch-24.02 Jan 17, 2024
66 of 67 checks passed

robertmaynard deleted the bug/mark_kernels_as_static branch January 17, 2024 15:12

ttnghia mentioned this pull request Jan 25, 2024

[FEA] Mark all kernels with hidden visibility NVIDIA/spark-rapids-jni#1734

Closed

jlowe mentioned this pull request Jan 30, 2024

Update native RAPIDS accelerated UDF examples to hide kernel symbol visibility NVIDIA/spark-rapids-examples#356

Open

jihoonson mentioned this pull request Jun 24, 2024

All kernels in the JNI should have hidden visibility NVIDIA/spark-rapids-jni#2168

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure that all CUDA kernels in cudf have hidden visibility. #14726

Ensure that all CUDA kernels in cudf have hidden visibility. #14726

robertmaynard commented Jan 9, 2024

bdice Jan 9, 2024

robertmaynard Jan 9, 2024

bdice Jan 9, 2024

harrism Jan 9, 2024

jrhemstad Jan 9, 2024

harrism commented Jan 11, 2024

robertmaynard commented Jan 11, 2024

robertmaynard commented Jan 17, 2024

Ensure that all CUDA kernels in cudf have hidden visibility. #14726

Ensure that all CUDA kernels in cudf have hidden visibility. #14726

Conversation

robertmaynard commented Jan 9, 2024

Description

Checklist

bdice Jan 9, 2024

Choose a reason for hiding this comment

robertmaynard Jan 9, 2024

Choose a reason for hiding this comment

bdice Jan 9, 2024

Choose a reason for hiding this comment

harrism Jan 9, 2024

Choose a reason for hiding this comment

jrhemstad Jan 9, 2024

Choose a reason for hiding this comment

harrism commented Jan 11, 2024

robertmaynard commented Jan 11, 2024

robertmaynard commented Jan 17, 2024