Skip to content

[Backport] Add stream__wrapper test to thrust#4492

Merged
miscco merged 3 commits intoNVIDIA:branch/3.0.xfrom
alliepiper:backport_3.0.x_on_cudaStream_t
Apr 22, 2025
Merged

[Backport] Add stream__wrapper test to thrust#4492
miscco merged 3 commits intoNVIDIA:branch/3.0.xfrom
alliepiper:backport_3.0.x_on_cudaStream_t

Conversation

@alliepiper
Copy link
Copy Markdown
Contributor

Add test to ensure that stream wrappers work with thrust::device.on.

* Use `cudaStream_t` for `thrust::device.on(...)`.

This was recently switched to use `cuda::stream_ref`, which broke users that have their own custom stream wrappers (nvbench, rmm, probably others).
There's no real benefit to using a stream_ref here, and it breaks existing implicit conversions.

* Add test to ensure that stream wrappers work with thrust::device.on
@alliepiper alliepiper requested review from a team as code owners April 17, 2025 18:38
@github-project-automation github-project-automation Bot moved this to Todo in CCCL Apr 17, 2025
@copy-pr-bot
Copy link
Copy Markdown
Contributor

copy-pr-bot Bot commented Apr 17, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@alliepiper alliepiper requested a review from rwgk April 17, 2025 18:38
@cccl-authenticator-app cccl-authenticator-app Bot moved this from Todo to In Review in CCCL Apr 17, 2025
@alliepiper alliepiper changed the base branch from main to branch/3.0.x April 17, 2025 18:39
@alliepiper alliepiper changed the title [Backpor] Add stream__wrapper test to thrust [Backport] Add stream__wrapper test to thrust Apr 17, 2025
@alliepiper alliepiper requested review from miscco and removed request for a team and rwgk April 17, 2025 18:39
@miscco miscco enabled auto-merge (squash) April 22, 2025 06:26
@github-actions
Copy link
Copy Markdown
Contributor

🟩 CI finished in 1h 35m: Pass: 100%/97 | Total: 18h 41m | Avg: 11m 33s | Max: 1h 13m | Hits: 98%/134408
  • 🟩 cub: Pass: 100%/45 | Total: 10h 20m | Avg: 13m 47s | Max: 28m 08s | Hits: 97%/53817

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 10h 00m | Avg: 13m 57s | Max: 28m 08s | Hits:  97%/51371 
      🟩 arm64              Pass: 100%/2   | Total: 20m 06s | Avg: 10m 03s | Max: 10m 26s | Hits:  97%/2446  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 59m 22s | Avg: 11m 52s | Max: 20m 08s | Hits:  97%/5944  
      🟩 12.6               Pass: 100%/2   | Total: 22m 13s | Avg: 11m 06s | Max: 11m 14s | Hits:  98%/2260  
      🟩 12.8               Pass: 100%/38  | Total:  8h 59m | Avg: 14m 11s | Max: 28m 08s | Hits:  97%/45613 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 52s | Avg:  4m 56s | Max:  4m 57s | Hits: 100%/2108  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 59m 22s | Avg: 11m 52s | Max: 20m 08s | Hits:  97%/5944  
      🟩 nvcc12.6           Pass: 100%/2   | Total: 22m 13s | Avg: 11m 06s | Max: 11m 14s | Hits:  98%/2260  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  8h 49m | Avg: 14m 41s | Max: 28m 08s | Hits:  97%/43505 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 52s | Avg:  4m 56s | Max:  4m 57s | Hits: 100%/2108  
      🟩 nvcc               Pass: 100%/43  | Total: 10h 10m | Avg: 14m 12s | Max: 28m 08s | Hits:  97%/51709 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 39m 05s | Avg:  9m 46s | Max:  9m 58s | Hits:  97%/4900  
      🟩 Clang15            Pass: 100%/2   | Total: 20m 26s | Avg: 10m 13s | Max: 10m 25s | Hits:  97%/2446  
      🟩 Clang16            Pass: 100%/2   | Total: 19m 57s | Avg:  9m 58s | Max: 10m 09s | Hits:  97%/2446  
      🟩 Clang17            Pass: 100%/2   | Total: 20m 15s | Avg: 10m 07s | Max: 10m 11s | Hits:  97%/2446  
      🟩 Clang18            Pass: 100%/7   | Total:  1h 26m | Avg: 12m 18s | Max: 25m 46s | Hits:  98%/8223  
      🟩 GCC7               Pass: 100%/2   | Total: 19m 41s | Avg:  9m 50s | Max: 10m 17s | Hits:  97%/2450  
      🟩 GCC8               Pass: 100%/1   | Total: 10m 42s | Avg: 10m 42s | Max: 10m 42s | Hits:  97%/1225  
      🟩 GCC9               Pass: 100%/2   | Total: 21m 14s | Avg: 10m 37s | Max: 10m 48s | Hits:  97%/2450  
      🟩 GCC10              Pass: 100%/2   | Total: 20m 33s | Avg: 10m 16s | Max: 10m 18s | Hits:  96%/2450  
      🟩 GCC11              Pass: 100%/2   | Total: 21m 22s | Avg: 10m 41s | Max: 10m 46s | Hits:  97%/2446  
      🟩 GCC12              Pass: 100%/2   | Total: 20m 57s | Avg: 10m 28s | Max: 10m 37s | Hits:  96%/2446  
      🟩 GCC13              Pass: 100%/11  | Total:  3h 33m | Avg: 19m 25s | Max: 28m 08s | Hits:  95%/13453 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 40m 36s | Avg: 20m 18s | Max: 20m 28s | Hits:  99%/2088  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 43m 45s | Avg: 21m 52s | Max: 21m 55s | Hits:  99%/2088  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 22m 13s | Avg: 11m 06s | Max: 11m 14s | Hits:  98%/2260  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  3h 05m | Avg: 10m 55s | Max: 25m 46s | Hits:  98%/20461 
      🟩 GCC                Pass: 100%/22  | Total:  5h 28m | Avg: 14m 55s | Max: 28m 08s | Hits:  96%/26920 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 24m | Avg: 21m 05s | Max: 21m 55s | Hits:  99%/4176  
      🟩 NVHPC              Pass: 100%/2   | Total: 22m 13s | Avg: 11m 06s | Max: 11m 14s | Hits:  98%/2260  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 16m | Avg: 25m 22s | Max: 28m 08s | Hits:  89%/3669  
      🟩 rtx2080            Pass: 100%/34  | Total:  6h 24m | Avg: 11m 18s | Max: 21m 55s | Hits:  97%/40364 
      🟩 rtxa6000           Pass: 100%/8   | Total:  2h 40m | Avg: 20m 00s | Max: 26m 51s | Hits:  99%/9784  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  7h 14m | Avg: 11m 43s | Max: 28m 08s | Hits:  96%/44033 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 31s | Avg: 22m 31s | Max: 22m 31s | Hits:  99%/1223  
      🟩 GraphCapture       Pass: 100%/1   | Total: 20m 53s | Avg: 20m 53s | Max: 20m 53s | Hits:  99%/1223  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 18m | Avg: 26m 11s | Max: 26m 51s | Hits:  99%/3669  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 04m | Avg: 21m 31s | Max: 22m 11s | Hits:  99%/3669  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 16m | Avg: 25m 22s | Max: 28m 08s | Hits:  89%/3669  
      🟩 90;90a;100         Pass: 100%/1   | Total: 11m 30s | Avg: 11m 30s | Max: 11m 30s | Hits:  96%/1223  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  3h 53m | Avg: 11m 39s | Max: 21m 55s | Hits:  97%/23677 
      🟩 20                 Pass: 100%/25  | Total:  6h 27m | Avg: 15m 29s | Max: 28m 08s | Hits:  97%/30140 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 6h 33m | Avg: 8m 44s | Max: 28m 47s | Hits: 99%/80271

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 18m 02s | Avg:  9m 01s | Max: 11m 33s | Hits:  99%/3570  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  6h 23m | Avg:  8m 54s | Max: 28m 47s | Hits:  99%/76702 
      🟩 arm64              Pass: 100%/2   | Total: 10m 27s | Avg:  5m 13s | Max:  5m 39s | Hits:  99%/3569  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 40m 53s | Avg:  8m 10s | Max: 21m 19s | Hits:  99%/8916  
      🟩 12.6               Pass: 100%/2   | Total: 30m 54s | Avg: 15m 27s | Max: 15m 44s | Hits:  99%/3568  
      🟩 12.8               Pass: 100%/38  | Total:  5h 21m | Avg:  8m 28s | Max: 28m 47s | Hits:  99%/67787 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 17s | Avg:  5m 08s | Max:  5m 11s | Hits:  99%/3568  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 40m 53s | Avg:  8m 10s | Max: 21m 19s | Hits:  99%/8916  
      🟩 nvcc12.6           Pass: 100%/2   | Total: 30m 54s | Avg: 15m 27s | Max: 15m 44s | Hits:  99%/3568  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  5h 11m | Avg:  8m 39s | Max: 28m 47s | Hits:  99%/64219 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 17s | Avg:  5m 08s | Max:  5m 11s | Hits:  99%/3568  
      🟩 nvcc               Pass: 100%/43  | Total:  6h 23m | Avg:  8m 54s | Max: 28m 47s | Hits:  99%/76703 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 23s | Avg:  5m 20s | Max:  6m 02s | Hits:  99%/7136  
      🟩 Clang15            Pass: 100%/2   | Total: 11m 40s | Avg:  5m 50s | Max:  6m 04s | Hits:  99%/3568  
      🟩 Clang16            Pass: 100%/2   | Total: 11m 34s | Avg:  5m 47s | Max:  5m 54s | Hits:  99%/3568  
      🟩 Clang17            Pass: 100%/2   | Total: 11m 21s | Avg:  5m 40s | Max:  5m 57s | Hits:  99%/3568  
      🟩 Clang18            Pass: 100%/7   | Total: 44m 37s | Avg:  6m 22s | Max: 10m 27s | Hits:  99%/12488 
      🟩 GCC7               Pass: 100%/2   | Total: 10m 46s | Avg:  5m 23s | Max:  5m 50s | Hits:  99%/3570  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 26s | Avg:  5m 26s | Max:  5m 26s | Hits:  99%/1785  
      🟩 GCC9               Pass: 100%/2   | Total: 11m 05s | Avg:  5m 32s | Max:  5m 56s | Hits:  99%/3570  
      🟩 GCC10              Pass: 100%/2   | Total: 12m 00s | Avg:  6m 00s | Max:  6m 17s | Hits:  99%/3570  
      🟩 GCC11              Pass: 100%/2   | Total: 11m 40s | Avg:  5m 50s | Max:  5m 57s | Hits:  99%/3570  
      🟩 GCC12              Pass: 100%/2   | Total: 12m 31s | Avg:  6m 15s | Max:  6m 23s | Hits:  99%/3570  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 19m | Avg:  7m 54s | Max: 11m 33s | Hits:  99%/17850 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 44m 48s | Avg: 22m 24s | Max: 23m 29s | Hits:  99%/3556  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  1h 14m | Avg: 24m 55s | Max: 28m 47s | Hits:  99%/5334  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 30m 54s | Avg: 15m 27s | Max: 15m 44s | Hits:  99%/3568  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  1h 40m | Avg:  5m 55s | Max: 10m 27s | Hits:  99%/30328 
      🟩 GCC                Pass: 100%/21  | Total:  2h 22m | Avg:  6m 47s | Max: 11m 33s | Hits:  99%/37485 
      🟩 MSVC               Pass: 100%/5   | Total:  1h 59m | Avg: 23m 55s | Max: 28m 47s | Hits:  99%/8890  
      🟩 NVHPC              Pass: 100%/2   | Total: 30m 54s | Avg: 15m 27s | Max: 15m 44s | Hits:  99%/3568  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 16m 03s | Avg:  8m 01s | Max: 11m 11s | Hits:  99%/3570  
      🟩 rtx2080            Pass: 100%/33  | Total:  4h 16m | Avg:  7m 46s | Max: 23m 29s | Hits:  99%/58868 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 00m | Avg: 12m 04s | Max: 28m 47s | Hits:  99%/17833 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  5h 03m | Avg:  7m 59s | Max: 23m 35s | Hits:  99%/67785 
      🟩 TestCPU            Pass: 100%/3   | Total: 45m 02s | Avg: 15m 00s | Max: 28m 47s | Hits:  99%/5347  
      🟩 TestGPU            Pass: 100%/4   | Total: 44m 40s | Avg: 11m 10s | Max: 11m 33s | Hits:  99%/7139  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 16m 03s | Avg:  8m 01s | Max: 11m 11s | Hits:  99%/3570  
      🟩 90;90a;100         Pass: 100%/1   | Total:  7m 00s | Avg:  7m 00s | Max:  7m 00s | Hits:  99%/1785  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  2h 52m | Avg:  8m 37s | Max: 23m 29s | Hits:  99%/35671 
      🟩 20                 Pass: 100%/23  | Total:  3h 22m | Avg:  8m 49s | Max: 28m 47s | Hits:  99%/41030 
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 16m 32s | Avg: 4m 08s | Max: 5m 01s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 42s | Avg:  4m 51s | Max:  5m 01s
      🟩 arm64              Pass: 100%/2   | Total:  6m 50s | Avg:  3m 25s | Max:  3m 33s
    🟩 ctk
      🟩 12.6               Pass: 100%/4   | Total: 16m 32s | Avg:  4m 08s | Max:  5m 01s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/4   | Total: 16m 32s | Avg:  4m 08s | Max:  5m 01s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 16m 32s | Avg:  4m 08s | Max:  5m 01s
    🟩 cxx
      🟩 NVHPC25.1          Pass: 100%/4   | Total: 16m 32s | Avg:  4m 08s | Max:  5m 01s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 16m 32s | Avg:  4m 08s | Max:  5m 01s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 16m 32s | Avg:  4m 08s | Max:  5m 01s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 16m 32s | Avg:  4m 08s | Max:  5m 01s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 34s | Avg:  4m 17s | Max:  5m 01s
      🟩 20                 Pass: 100%/2   | Total:  7m 58s | Avg:  3m 59s | Max:  4m 41s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 17m 07s | Avg: 8m 33s | Max: 14m 53s | Hits: 98%/320

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 17m 07s | Avg:  8m 33s | Max: 14m 53s | Hits:  98%/320   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 17m 07s | Avg:  8m 33s | Max: 14m 53s | Hits:  98%/320   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 17m 07s | Avg:  8m 33s | Max: 14m 53s | Hits:  98%/320   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 17m 07s | Avg:  8m 33s | Max: 14m 53s | Hits:  98%/320   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 17m 07s | Avg:  8m 33s | Max: 14m 53s | Hits:  98%/320   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 17m 07s | Avg:  8m 33s | Max: 14m 53s | Hits:  98%/320   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 17m 07s | Avg:  8m 33s | Max: 14m 53s | Hits:  98%/320   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 14s | Avg:  2m 14s | Max:  2m 14s | Hits:  98%/160   
      🟩 Test               Pass: 100%/1   | Total: 14m 53s | Avg: 14m 53s | Max: 14m 53s | Hits:  98%/160   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 13m | Avg: 1h 13m | Max: 1h 13m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 13m | Avg:  1h 13m | Max:  1h 13m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 13m | Avg:  1h 13m | Max:  1h 13m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 13m | Avg:  1h 13m | Max:  1h 13m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 13m | Avg:  1h 13m | Max:  1h 13m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 13m | Avg:  1h 13m | Max:  1h 13m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 13m | Avg:  1h 13m | Max:  1h 13m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 13m | Avg:  1h 13m | Max:  1h 13m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 13m | Avg:  1h 13m | Max:  1h 13m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 97)

# Runner
68 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-arm64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

@github-actions
Copy link
Copy Markdown
Contributor

🟩 CI finished in 1h 33m: Pass: 100%/97 | Total: 16h 39m | Avg: 10m 18s | Max: 1h 11m | Hits: 99%/134408
  • 🟩 cub: Pass: 100%/45 | Total: 8h 15m | Avg: 11m 00s | Max: 25m 40s | Hits: 99%/53817

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  8h 03m | Avg: 11m 14s | Max: 25m 40s | Hits:  99%/51371 
      🟩 arm64              Pass: 100%/2   | Total: 11m 51s | Avg:  5m 55s | Max:  6m 16s | Hits:  99%/2446  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 46m 56s | Avg:  9m 23s | Max: 23m 02s | Hits:  99%/5944  
      🟩 12.6               Pass: 100%/2   | Total: 22m 17s | Avg: 11m 08s | Max: 11m 13s | Hits:  98%/2260  
      🟩 12.8               Pass: 100%/38  | Total:  7h 06m | Avg: 11m 12s | Max: 25m 40s | Hits:  99%/45613 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 19s | Avg:  5m 09s | Max:  5m 10s | Hits: 100%/2108  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 46m 56s | Avg:  9m 23s | Max: 23m 02s | Hits:  99%/5944  
      🟩 nvcc12.6           Pass: 100%/2   | Total: 22m 17s | Avg: 11m 08s | Max: 11m 13s | Hits:  98%/2260  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  6h 55m | Avg: 11m 33s | Max: 25m 40s | Hits:  99%/43505 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 19s | Avg:  5m 09s | Max:  5m 10s | Hits: 100%/2108  
      🟩 nvcc               Pass: 100%/43  | Total:  8h 05m | Avg: 11m 16s | Max: 25m 40s | Hits:  99%/51709 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 24m 06s | Avg:  6m 01s | Max:  6m 23s | Hits: 100%/4900  
      🟩 Clang15            Pass: 100%/2   | Total: 13m 21s | Avg:  6m 40s | Max:  6m 45s | Hits: 100%/2446  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 49s | Avg:  6m 24s | Max:  6m 25s | Hits: 100%/2446  
      🟩 Clang17            Pass: 100%/2   | Total: 13m 09s | Avg:  6m 34s | Max:  6m 54s | Hits: 100%/2446  
      🟩 Clang18            Pass: 100%/7   | Total:  1h 13m | Avg: 10m 28s | Max: 24m 26s | Hits: 100%/8223  
      🟩 GCC7               Pass: 100%/2   | Total: 12m 51s | Avg:  6m 25s | Max:  6m 49s | Hits:  99%/2450  
      🟩 GCC8               Pass: 100%/1   | Total:  7m 03s | Avg:  7m 03s | Max:  7m 03s | Hits:  99%/1225  
      🟩 GCC9               Pass: 100%/2   | Total: 13m 23s | Avg:  6m 41s | Max:  6m 56s | Hits:  99%/2450  
      🟩 GCC10              Pass: 100%/2   | Total: 13m 44s | Avg:  6m 52s | Max:  7m 02s | Hits:  99%/2450  
      🟩 GCC11              Pass: 100%/2   | Total: 13m 29s | Avg:  6m 44s | Max:  6m 47s | Hits:  99%/2446  
      🟩 GCC12              Pass: 100%/2   | Total: 15m 13s | Avg:  7m 36s | Max:  7m 41s | Hits:  99%/2446  
      🟩 GCC13              Pass: 100%/11  | Total:  2h 51m | Avg: 15m 34s | Max: 25m 40s | Hits:  99%/13453 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 44m 29s | Avg: 22m 14s | Max: 23m 02s | Hits:  99%/2088  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 44m 44s | Avg: 22m 22s | Max: 22m 29s | Hits:  99%/2088  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 22m 17s | Avg: 11m 08s | Max: 11m 13s | Hits:  98%/2260  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 16m | Avg:  8m 02s | Max: 24m 26s | Hits: 100%/20461 
      🟩 GCC                Pass: 100%/22  | Total:  4h 07m | Avg: 11m 13s | Max: 25m 40s | Hits:  99%/26920 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 29m | Avg: 22m 18s | Max: 23m 02s | Hits:  99%/4176  
      🟩 NVHPC              Pass: 100%/2   | Total: 22m 17s | Avg: 11m 08s | Max: 11m 13s | Hits:  98%/2260  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 51m 43s | Avg: 17m 14s | Max: 24m 28s | Hits:  99%/3669  
      🟩 rtx2080            Pass: 100%/34  | Total:  4h 53m | Avg:  8m 38s | Max: 23m 02s | Hits:  99%/40364 
      🟩 rtxa6000           Pass: 100%/8   | Total:  2h 30m | Avg: 18m 45s | Max: 25m 40s | Hits:  99%/9784  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  5h 12m | Avg:  8m 26s | Max: 23m 02s | Hits:  99%/44033 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 24m 36s | Avg: 24m 36s | Max: 24m 36s | Hits:  99%/1223  
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 29s | Avg: 19m 29s | Max: 19m 29s | Hits:  99%/1223  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 14m | Avg: 24m 51s | Max: 25m 40s | Hits:  99%/3669  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 04m | Avg: 21m 30s | Max: 22m 04s | Hits:  99%/3669  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 51m 43s | Avg: 17m 14s | Max: 24m 28s | Hits:  99%/3669  
      🟩 90;90a;100         Pass: 100%/1   | Total:  7m 15s | Avg:  7m 15s | Max:  7m 15s | Hits:  99%/1223  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  3h 03m | Avg:  9m 09s | Max: 23m 02s | Hits:  99%/23677 
      🟩 20                 Pass: 100%/25  | Total:  5h 12m | Avg: 12m 29s | Max: 25m 40s | Hits:  99%/30140 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 6h 38m | Avg: 8m 51s | Max: 29m 04s | Hits: 99%/80271

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 17m 44s | Avg:  8m 52s | Max: 11m 37s | Hits:  99%/3570  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  6h 28m | Avg:  9m 01s | Max: 29m 04s | Hits:  99%/76702 
      🟩 arm64              Pass: 100%/2   | Total: 10m 13s | Avg:  5m 06s | Max:  5m 32s | Hits:  99%/3569  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 43m 33s | Avg:  8m 42s | Max: 23m 11s | Hits:  99%/8916  
      🟩 12.6               Pass: 100%/2   | Total: 30m 21s | Avg: 15m 10s | Max: 15m 29s | Hits:  99%/3568  
      🟩 12.8               Pass: 100%/38  | Total:  5h 24m | Avg:  8m 32s | Max: 29m 04s | Hits:  99%/67787 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 26s | Avg:  5m 13s | Max:  5m 16s | Hits:  99%/3568  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 43m 33s | Avg:  8m 42s | Max: 23m 11s | Hits:  99%/8916  
      🟩 nvcc12.6           Pass: 100%/2   | Total: 30m 21s | Avg: 15m 10s | Max: 15m 29s | Hits:  99%/3568  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  5h 14m | Avg:  8m 43s | Max: 29m 04s | Hits:  99%/64219 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 26s | Avg:  5m 13s | Max:  5m 16s | Hits:  99%/3568  
      🟩 nvcc               Pass: 100%/43  | Total:  6h 27m | Avg:  9m 01s | Max: 29m 04s | Hits:  99%/76703 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 02s | Avg:  5m 15s | Max:  5m 33s | Hits: 100%/7136  
      🟩 Clang15            Pass: 100%/2   | Total: 11m 46s | Avg:  5m 53s | Max:  6m 05s | Hits: 100%/3568  
      🟩 Clang16            Pass: 100%/2   | Total: 11m 16s | Avg:  5m 38s | Max:  5m 47s | Hits: 100%/3568  
      🟩 Clang17            Pass: 100%/2   | Total: 11m 27s | Avg:  5m 43s | Max:  5m 55s | Hits: 100%/3568  
      🟩 Clang18            Pass: 100%/7   | Total: 44m 28s | Avg:  6m 21s | Max: 10m 25s | Hits:  99%/12488 
      🟩 GCC7               Pass: 100%/2   | Total: 10m 24s | Avg:  5m 12s | Max:  5m 27s | Hits:  99%/3570  
      🟩 GCC8               Pass: 100%/1   | Total:  6m 04s | Avg:  6m 04s | Max:  6m 04s | Hits:  99%/1785  
      🟩 GCC9               Pass: 100%/2   | Total: 11m 03s | Avg:  5m 31s | Max:  5m 49s | Hits:  99%/3570  
      🟩 GCC10              Pass: 100%/2   | Total: 11m 31s | Avg:  5m 45s | Max:  5m 49s | Hits:  99%/3570  
      🟩 GCC11              Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  6m 08s | Hits:  99%/3570  
      🟩 GCC12              Pass: 100%/2   | Total: 12m 51s | Avg:  6m 25s | Max:  6m 37s | Hits:  99%/3570  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 20m | Avg:  8m 02s | Max: 12m 05s | Hits:  99%/17850 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 48m 34s | Avg: 24m 17s | Max: 25m 23s | Hits:  99%/3556  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  1h 15m | Avg: 25m 04s | Max: 29m 04s | Hits:  99%/5334  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 30m 21s | Avg: 15m 10s | Max: 15m 29s | Hits:  99%/3568  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  1h 39m | Avg:  5m 52s | Max: 10m 25s | Hits:  99%/30328 
      🟩 GCC                Pass: 100%/21  | Total:  2h 24m | Avg:  6m 52s | Max: 12m 05s | Hits:  99%/37485 
      🟩 MSVC               Pass: 100%/5   | Total:  2h 03m | Avg: 24m 45s | Max: 29m 04s | Hits:  99%/8890  
      🟩 NVHPC              Pass: 100%/2   | Total: 30m 21s | Avg: 15m 10s | Max: 15m 29s | Hits:  99%/3568  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 12m 05s | Hits:  99%/3570  
      🟩 rtx2080            Pass: 100%/33  | Total:  4h 20m | Avg:  7m 53s | Max: 25m 23s | Hits:  99%/58868 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 00m | Avg: 12m 03s | Max: 29m 04s | Hits:  99%/17833 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  5h 07m | Avg:  8m 05s | Max: 25m 23s | Hits:  99%/67785 
      🟩 TestCPU            Pass: 100%/3   | Total: 45m 01s | Avg: 15m 00s | Max: 29m 04s | Hits:  99%/5347  
      🟩 TestGPU            Pass: 100%/4   | Total: 46m 00s | Avg: 11m 30s | Max: 12m 05s | Hits:  99%/7139  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 12m 05s | Hits:  99%/3570  
      🟩 90;90a;100         Pass: 100%/1   | Total:  6m 43s | Avg:  6m 43s | Max:  6m 43s | Hits:  99%/1785  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  2h 57m | Avg:  8m 53s | Max: 25m 23s | Hits:  99%/35671 
      🟩 20                 Pass: 100%/23  | Total:  3h 22m | Avg:  8m 49s | Max: 29m 04s | Hits:  99%/41030 
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 16m 41s | Avg: 4m 10s | Max: 4m 43s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 24s | Avg:  4m 42s | Max:  4m 43s
      🟩 arm64              Pass: 100%/2   | Total:  7m 17s | Avg:  3m 38s | Max:  3m 39s
    🟩 ctk
      🟩 12.6               Pass: 100%/4   | Total: 16m 41s | Avg:  4m 10s | Max:  4m 43s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/4   | Total: 16m 41s | Avg:  4m 10s | Max:  4m 43s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 16m 41s | Avg:  4m 10s | Max:  4m 43s
    🟩 cxx
      🟩 NVHPC25.1          Pass: 100%/4   | Total: 16m 41s | Avg:  4m 10s | Max:  4m 43s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 16m 41s | Avg:  4m 10s | Max:  4m 43s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 16m 41s | Avg:  4m 10s | Max:  4m 43s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 16m 41s | Avg:  4m 10s | Max:  4m 43s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 22s | Avg:  4m 11s | Max:  4m 43s
      🟩 20                 Pass: 100%/2   | Total:  8m 19s | Avg:  4m 09s | Max:  4m 41s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 16m 50s | Avg: 8m 25s | Max: 14m 26s | Hits: 98%/320

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 16m 50s | Avg:  8m 25s | Max: 14m 26s | Hits:  98%/320   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 16m 50s | Avg:  8m 25s | Max: 14m 26s | Hits:  98%/320   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 16m 50s | Avg:  8m 25s | Max: 14m 26s | Hits:  98%/320   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 16m 50s | Avg:  8m 25s | Max: 14m 26s | Hits:  98%/320   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 16m 50s | Avg:  8m 25s | Max: 14m 26s | Hits:  98%/320   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 16m 50s | Avg:  8m 25s | Max: 14m 26s | Hits:  98%/320   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 16m 50s | Avg:  8m 25s | Max: 14m 26s | Hits:  98%/320   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 24s | Avg:  2m 24s | Max:  2m 24s | Hits:  98%/160   
      🟩 Test               Pass: 100%/1   | Total: 14m 26s | Avg: 14m 26s | Max: 14m 26s | Hits:  98%/160   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 11m | Avg: 1h 11m | Max: 1h 11m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 97)

# Runner
68 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-arm64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

@miscco miscco merged commit ed54f48 into NVIDIA:branch/3.0.x Apr 22, 2025
110 of 111 checks passed
@github-project-automation github-project-automation Bot moved this from In Review to Done in CCCL Apr 22, 2025
@alliepiper alliepiper deleted the backport_3.0.x_on_cudaStream_t branch September 5, 2025 07:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

3 participants