Initial CUB/NVRTC support #1081

gevtushenko · 2023-11-10T07:14:35Z

Description

closes #403

This PR provides initial CUB/NVRTC support and adds a simple smoke test to see if basic warp / block-scope functionality works. Testing approach is temporary. Once the PR is merged, we'll need to hoist nvrtcc to CCCL level and start utilizing it for CUB.

Checklist

New or existing tests cover these changes.
The documentation is up to date with these changes.

miscco · 2023-11-10T07:22:50Z

cub/cub/util_type.cuh

-
-#include <cuda.h>
+#include <cuda/std/limits>
+#include <cuda/__cccl_config> // _LIBCUDACXX_CUDACC_VER


That should not be necessary as we pull in __cccl_config from cuda/__config

@miscco did you mean cub/config.cuh?

I meant that <cuda/std/limits> already pulls it in but also <cub/config.cuh>

I prefer having independent headers with explicit inclusion of what's used. Do you anticipate any issues with including cuda/__cccl_config directly?

miscco · 2023-11-10T07:23:50Z

cub/cub/util_type.cuh

+#if !defined(_LIBCUDACXX_COMPILER_NVRTC)
+#  include <iterator>
+#else
+#  include <cuda/std/iterator>
+#endif


I am of the impression, that we are quite feature complete now with respect to iterator. what do we need from std?

cub/test/catch2_test_nvrtc.cu

wmaxey · 2023-11-13T18:59:47Z

Swap _LIBCUDACXX macros with _CCCL macros.
_LIBCUDACXX_(.*)->_CCCL_$1

gevtushenko · 2023-11-13T23:05:52Z

This PR is blocked until #1097 is merged

cub/test/CMakeLists.txt

Initial CUB/NVRTC support

33426a5

gevtushenko requested review from a team as code owners November 10, 2023 07:14

gevtushenko requested review from wmaxey, elstehle and miscco and removed request for a team November 10, 2023 07:14

miscco approved these changes Nov 10, 2023

View reviewed changes

jrhemstad reviewed Nov 10, 2023

View reviewed changes

cub/test/catch2_test_nvrtc.cu Outdated Show resolved Hide resolved

gevtushenko added 2 commits November 10, 2023 09:34

Compile to SASS instead of PTX

cbeea3a

Detect current GPU arch

422cb1f

gevtushenko added 3 commits November 14, 2023 09:06

Merge branch 'main' into enh-main/github/cub_nvrtc

da86457

Guard printf usage in clang cuda

fcfffc5

Suppress complex literal warnings

f0cee9b

gevtushenko requested a review from a team as a code owner November 14, 2023 18:29

gevtushenko commented Nov 14, 2023

View reviewed changes

cub/test/CMakeLists.txt Show resolved Hide resolved

wmaxey approved these changes Nov 14, 2023

View reviewed changes

gevtushenko merged commit db37b60 into NVIDIA:main Nov 15, 2023
518 checks passed

leofang mentioned this pull request Mar 8, 2024

Do not use plain thrust namespace in complex clone cupy/cupy#8221

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial CUB/NVRTC support #1081

Initial CUB/NVRTC support #1081

gevtushenko commented Nov 10, 2023 •

edited

Loading

miscco Nov 10, 2023

gevtushenko Nov 10, 2023

miscco Nov 10, 2023

gevtushenko Nov 10, 2023

miscco Nov 10, 2023

gevtushenko Nov 10, 2023

wmaxey commented Nov 13, 2023

gevtushenko commented Nov 13, 2023

Initial CUB/NVRTC support #1081

Initial CUB/NVRTC support #1081

Conversation

gevtushenko commented Nov 10, 2023 • edited Loading

Description

Checklist

miscco Nov 10, 2023

Choose a reason for hiding this comment

gevtushenko Nov 10, 2023

Choose a reason for hiding this comment

miscco Nov 10, 2023

Choose a reason for hiding this comment

gevtushenko Nov 10, 2023

Choose a reason for hiding this comment

miscco Nov 10, 2023

Choose a reason for hiding this comment

gevtushenko Nov 10, 2023

Choose a reason for hiding this comment

wmaxey commented Nov 13, 2023

gevtushenko commented Nov 13, 2023

gevtushenko commented Nov 10, 2023 •

edited

Loading