Skip to content

[Backport branch/3.0.x] Fix uninitialized read in local atomic code path.#4425

Merged
wmaxey merged 1 commit intobranch/3.0.xfrom
backport-4352-to-branch/3.0.x
Apr 12, 2025
Merged

[Backport branch/3.0.x] Fix uninitialized read in local atomic code path.#4425
wmaxey merged 1 commit intobranch/3.0.xfrom
backport-4352-to-branch/3.0.x

Conversation

@github-actions
Copy link
Copy Markdown
Contributor

Description

Backport of #4352 to branch/3.0.x.

* Fix uninitialized read in local atomic code path.

* PTX assumes '=' operands are always overwritten. For this code path the predicated mov instruction will only sometimes overwrite the original value. The compiler may or may not initialize `__temp`. This patch fixes this by always signing a 0 or 1 to the output register removing the need to initialize `__temp`. An alternative is to use `+` instead of `=` on the output operand.

* Create a test to cover the PTX path of local storage atomics regardless of CTK version

* Disable test for nvrtc

* Use new additional compile flags

* Try and fix checking for compile flags

* Update comments in is_local codepath

* Make test compatible with older NVCC

* Revert "Try and fix checking for compile flags"

This reverts commit a846ea5.

* Remove unroll pragma, it is unneeded for repro.

---------

Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>
(cherry picked from commit 3200156)
@github-actions github-actions Bot requested a review from a team as a code owner April 12, 2025 01:05
@github-actions github-actions Bot requested a review from wmaxey April 12, 2025 01:05
@github-project-automation github-project-automation Bot moved this to Todo in CCCL Apr 12, 2025
@copy-pr-bot
Copy link
Copy Markdown
Contributor

copy-pr-bot Bot commented Apr 12, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@github-project-automation github-project-automation Bot moved this from Todo to In Review in CCCL Apr 12, 2025
@wmaxey
Copy link
Copy Markdown
Member

wmaxey commented Apr 12, 2025

/ok to test 97bf5ec

@wmaxey wmaxey enabled auto-merge (squash) April 12, 2025 01:34
@github-actions
Copy link
Copy Markdown
Contributor Author

🟩 CI finished in 11h 06m: Pass: 100%/162 | Total: 1d 16h | Avg: 15m 05s | Max: 1h 16m | Hits: 91%/252465
  • 🟩 cub: Pass: 100%/45 | Total: 17h 56m | Avg: 23m 54s | Max: 1h 16m | Hits: 93%/53743

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 17h 43m | Avg: 24m 44s | Max:  1h 16m | Hits:  93%/51301 
      🟩 arm64              Pass: 100%/2   | Total: 12m 40s | Avg:  6m 20s | Max:  6m 36s | Hits:  99%/2442  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 26m | Avg:  1h 05m | Max:  1h 09m | Hits:  69%/5936  
      🟩 12.6               Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 16m | Hits:  67%/2260  
      🟩 12.8               Pass: 100%/38  | Total: 10h 00m | Avg: 15m 48s | Max:  1h 16m | Hits:  98%/45547 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 27s | Avg:  5m 13s | Max:  5m 29s | Hits: 100%/2108  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 26m | Avg:  1h 05m | Max:  1h 09m | Hits:  69%/5936  
      🟩 nvcc12.6           Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 16m | Hits:  67%/2260  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  9h 50m | Avg: 16m 23s | Max:  1h 16m | Hits:  98%/43439 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 27s | Avg:  5m 13s | Max:  5m 29s | Hits: 100%/2108  
      🟩 nvcc               Pass: 100%/43  | Total: 17h 45m | Avg: 24m 47s | Max:  1h 16m | Hits:  93%/51635 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 17m | Avg: 34m 26s | Max:  1h 03m | Hits:  84%/4892  
      🟩 Clang15            Pass: 100%/2   | Total: 13m 05s | Avg:  6m 32s | Max:  6m 48s | Hits: 100%/2442  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 45s | Avg:  6m 22s | Max:  6m 23s | Hits: 100%/2442  
      🟩 Clang17            Pass: 100%/2   | Total: 12m 43s | Avg:  6m 21s | Max:  6m 22s | Hits: 100%/2442  
      🟩 Clang18            Pass: 100%/7   | Total:  1h 15m | Avg: 10m 47s | Max: 26m 02s | Hits: 100%/8213  
      🟩 GCC7               Pass: 100%/2   | Total:  1h 10m | Avg: 35m 28s | Max:  1h 04m | Hits:  83%/2446  
      🟩 GCC8               Pass: 100%/1   | Total:  6m 50s | Avg:  6m 50s | Max:  6m 50s | Hits:  99%/1223  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 14m | Avg: 37m 29s | Max:  1h 07m | Hits:  83%/2446  
      🟩 GCC10              Pass: 100%/2   | Total: 13m 24s | Avg:  6m 42s | Max:  6m 47s | Hits:  99%/2446  
      🟩 GCC11              Pass: 100%/2   | Total: 13m 32s | Avg:  6m 46s | Max:  6m 48s | Hits:  99%/2442  
      🟩 GCC12              Pass: 100%/2   | Total: 13m 54s | Avg:  6m 57s | Max:  6m 59s | Hits:  99%/2442  
      🟩 GCC13              Pass: 100%/11  | Total:  3h 03m | Avg: 16m 41s | Max: 27m 20s | Hits:  99%/13431 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 16m | Hits:  75%/2088  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 32m | Avg:  1h 16m | Max:  1h 16m | Hits:  75%/2088  
      🟩 NVHPC25.1          Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 16m | Hits:  67%/2260  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  4h 11m | Avg: 14m 49s | Max:  1h 03m | Hits:  96%/20431 
      🟩 GCC                Pass: 100%/22  | Total:  6h 17m | Avg: 17m 08s | Max:  1h 07m | Hits:  96%/26876 
      🟩 MSVC               Pass: 100%/4   | Total:  4h 58m | Avg:  1h 14m | Max:  1h 16m | Hits:  75%/4176  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 16m | Hits:  67%/2260  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 55m 16s | Avg: 18m 25s | Max: 25m 12s | Hits:  99%/3663  
      🟩 rtx2080            Pass: 100%/34  | Total: 14h 21m | Avg: 25m 20s | Max:  1h 16m | Hits:  91%/40312 
      🟩 rtxa6000           Pass: 100%/8   | Total:  2h 39m | Avg: 19m 55s | Max: 27m 20s | Hits:  99%/9768  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 14h 41m | Avg: 23m 49s | Max:  1h 16m | Hits:  92%/43975 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 25m 45s | Avg: 25m 45s | Max: 25m 45s | Hits:  99%/1221  
      🟩 GraphCapture       Pass: 100%/1   | Total: 21m 07s | Avg: 21m 07s | Max: 21m 07s | Hits:  99%/1221  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 18m | Avg: 26m 11s | Max: 27m 20s | Hits:  99%/3663  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 09m | Avg: 23m 09s | Max: 25m 11s | Hits:  99%/3663  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 55m 16s | Avg: 18m 25s | Max: 25m 12s | Hits:  99%/3663  
      🟩 90;90a;100         Pass: 100%/1   | Total:  7m 46s | Avg:  7m 46s | Max:  7m 46s | Hits:  99%/1221  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  9h 37m | Avg: 28m 52s | Max:  1h 16m | Hits:  90%/23647 
      🟩 20                 Pass: 100%/25  | Total:  8h 18m | Avg: 19m 57s | Max:  1h 16m | Hits:  96%/30096 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 11h 03m | Avg: 14m 44s | Max: 54m 10s | Hits: 94%/80181

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 19m 09s | Avg:  9m 34s | Max: 11m 38s | Hits:  99%/3566  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 10h 52m | Avg: 15m 10s | Max: 54m 10s | Hits:  94%/76616 
      🟩 arm64              Pass: 100%/2   | Total: 10m 55s | Avg:  5m 27s | Max:  5m 30s | Hits:  99%/3565  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  2h 50m | Avg: 34m 10s | Max: 40m 25s | Hits:  78%/8906  
      🟩 12.6               Pass: 100%/2   | Total:  1h 44m | Avg: 52m 05s | Max: 54m 10s | Hits:  65%/3564  
      🟩 12.8               Pass: 100%/38  | Total:  6h 28m | Avg: 10m 12s | Max: 49m 51s | Hits:  98%/67711 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 40s | Avg:  5m 20s | Max:  5m 22s | Hits:  99%/3564  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  2h 50m | Avg: 34m 10s | Max: 40m 25s | Hits:  78%/8906  
      🟩 nvcc12.6           Pass: 100%/2   | Total:  1h 44m | Avg: 52m 05s | Max: 54m 10s | Hits:  65%/3564  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  6h 17m | Avg: 10m 29s | Max: 49m 51s | Hits:  98%/64147 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 40s | Avg:  5m 20s | Max:  5m 22s | Hits:  99%/3564  
      🟩 nvcc               Pass: 100%/43  | Total: 10h 52m | Avg: 15m 10s | Max: 54m 10s | Hits:  94%/76617 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 14m | Avg: 18m 33s | Max: 32m 57s | Hits:  88%/7128  
      🟩 Clang15            Pass: 100%/2   | Total: 11m 06s | Avg:  5m 33s | Max:  5m 34s | Hits:  99%/3564  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 27s | Avg:  6m 13s | Max:  6m 18s | Hits:  99%/3564  
      🟩 Clang17            Pass: 100%/2   | Total: 11m 31s | Avg:  5m 45s | Max:  6m 04s | Hits:  99%/3564  
      🟩 Clang18            Pass: 100%/7   | Total: 45m 11s | Avg:  6m 27s | Max: 10m 39s | Hits:  99%/12474 
      🟩 GCC7               Pass: 100%/2   | Total: 39m 38s | Avg: 19m 49s | Max: 34m 06s | Hits:  88%/3566  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 30s | Avg:  5m 30s | Max:  5m 30s | Hits:  99%/1783  
      🟩 GCC9               Pass: 100%/2   | Total: 38m 56s | Avg: 19m 28s | Max: 32m 57s | Hits:  88%/3566  
      🟩 GCC10              Pass: 100%/2   | Total: 11m 21s | Avg:  5m 40s | Max:  5m 48s | Hits:  99%/3566  
      🟩 GCC11              Pass: 100%/2   | Total: 12m 40s | Avg:  6m 20s | Max:  6m 41s | Hits:  99%/3566  
      🟩 GCC12              Pass: 100%/2   | Total: 12m 38s | Avg:  6m 19s | Max:  6m 32s | Hits:  99%/3566  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 22m | Avg:  8m 13s | Max: 11m 49s | Hits:  99%/17830 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 21m | Avg: 40m 40s | Max: 40m 55s | Hits:  81%/3552  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 00m | Avg: 40m 03s | Max: 49m 51s | Hits:  87%/5328  
      🟩 NVHPC25.1          Pass: 100%/2   | Total:  1h 44m | Avg: 52m 05s | Max: 54m 10s | Hits:  65%/3564  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 34m | Avg:  9m 05s | Max: 32m 57s | Hits:  97%/30294 
      🟩 GCC                Pass: 100%/21  | Total:  3h 23m | Avg:  9m 40s | Max: 34m 06s | Hits:  97%/37443 
      🟩 MSVC               Pass: 100%/5   | Total:  3h 21m | Avg: 40m 17s | Max: 49m 51s | Hits:  84%/8880  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 44m | Avg: 52m 05s | Max: 54m 10s | Hits:  65%/3564  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 16m 42s | Avg:  8m 21s | Max: 11m 43s | Hits:  99%/3566  
      🟩 rtx2080            Pass: 100%/33  | Total:  8h 17m | Avg: 15m 05s | Max: 54m 10s | Hits:  93%/58802 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 28m | Avg: 14m 51s | Max: 49m 51s | Hits:  97%/17813 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  9h 32m | Avg: 15m 03s | Max: 54m 10s | Hits:  93%/67709 
      🟩 TestCPU            Pass: 100%/3   | Total: 45m 12s | Avg: 15m 04s | Max: 28m 46s | Hits:  99%/5341  
      🟩 TestGPU            Pass: 100%/4   | Total: 45m 49s | Avg: 11m 27s | Max: 11m 49s | Hits:  99%/7131  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 16m 42s | Avg:  8m 21s | Max: 11m 43s | Hits:  99%/3566  
      🟩 90;90a;100         Pass: 100%/1   | Total:  7m 11s | Avg:  7m 11s | Max:  7m 11s | Hits:  99%/1783  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  5h 50m | Avg: 17m 30s | Max: 54m 10s | Hits:  91%/35631 
      🟩 20                 Pass: 100%/23  | Total:  4h 53m | Avg: 12m 46s | Max: 50m 00s | Hits:  96%/40984 
    
  • 🟩 libcudacxx: Pass: 100%/43 | Total: 7h 40m | Avg: 10m 42s | Max: 37m 24s | Hits: 86%/106391

    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total:  7h 23m | Avg: 10m 49s | Max: 37m 24s | Hits:  86%/100572
      🟩 arm64              Pass: 100%/2   | Total: 16m 16s | Avg:  8m 08s | Max:  8m 12s | Hits:  86%/5819  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 44m 29s | Avg:  8m 53s | Max: 23m 09s | Hits:  93%/14174 
      🟩 12.6               Pass: 100%/2   | Total: 51m 59s | Avg: 25m 59s | Max: 37m 24s | Hits:  61%/5766  
      🟩 12.8               Pass: 100%/36  | Total:  6h 03m | Avg: 10m 06s | Max: 26m 16s | Hits:  87%/86451 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 44m 21s | Avg: 22m 10s | Max: 23m 20s | Hits:  27%/5780  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 44m 29s | Avg:  8m 53s | Max: 23m 09s | Hits:  93%/14174 
      🟩 nvcc12.6           Pass: 100%/2   | Total: 51m 59s | Avg: 25m 59s | Max: 37m 24s | Hits:  61%/5766  
      🟩 nvcc12.8           Pass: 100%/34  | Total:  5h 19m | Avg:  9m 23s | Max: 26m 16s | Hits:  91%/80671 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 44m 21s | Avg: 22m 10s | Max: 23m 20s | Hits:  27%/5780  
      🟩 nvcc               Pass: 100%/41  | Total:  6h 55m | Avg: 10m 08s | Max: 37m 24s | Hits:  90%/100611
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 20m 20s | Avg:  5m 05s | Max:  7m 10s | Hits:  95%/11528 
      🟩 Clang15            Pass: 100%/2   | Total: 35m 47s | Avg: 17m 53s | Max: 26m 16s | Hits:  59%/5776  
      🟩 Clang16            Pass: 100%/2   | Total: 13m 23s | Avg:  6m 41s | Max:  8m 45s | Hits:  93%/5776  
      🟩 Clang17            Pass: 100%/2   | Total:  9m 04s | Avg:  4m 32s | Max:  4m 34s | Hits:  99%/5776  
      🟩 Clang18            Pass: 100%/6   | Total:  1h 10m | Avg: 11m 48s | Max: 23m 20s | Hits:  68%/14465 
      🟩 GCC7               Pass: 100%/2   | Total: 13m 19s | Avg:  6m 39s | Max:  7m 23s | Hits:  88%/5714  
      🟩 GCC8               Pass: 100%/1   | Total:  4m 00s | Avg:  4m 00s | Max:  4m 00s | Hits:  99%/2867  
      🟩 GCC9               Pass: 100%/2   | Total: 12m 02s | Avg:  6m 01s | Max:  7m 47s | Hits:  93%/5726  
      🟩 GCC10              Pass: 100%/2   | Total:  8m 30s | Avg:  4m 15s | Max:  4m 16s | Hits:  99%/5782  
      🟩 GCC11              Pass: 100%/2   | Total: 12m 12s | Avg:  6m 06s | Max:  7m 54s | Hits:  94%/5778  
      🟩 GCC12              Pass: 100%/2   | Total:  8m 24s | Avg:  4m 12s | Max:  4m 13s | Hits:  99%/5778  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 46m | Avg: 10m 36s | Max: 19m 25s | Hits:  91%/14726 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 46m 19s | Avg: 23m 09s | Max: 23m 10s | Hits:  89%/5432  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 47m 57s | Avg: 23m 58s | Max: 25m 27s | Hits:  90%/5501  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 51m 59s | Avg: 25m 59s | Max: 37m 24s | Hits:  61%/5766  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/16  | Total:  2h 29m | Avg:  9m 20s | Max: 26m 16s | Hits:  81%/43321 
      🟩 GCC                Pass: 100%/21  | Total:  2h 44m | Avg:  7m 50s | Max: 19m 25s | Hits:  94%/46371 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 34m | Avg: 23m 34s | Max: 25m 27s | Hits:  90%/10933 
      🟩 NVHPC              Pass: 100%/2   | Total: 51m 59s | Avg: 25m 59s | Max: 37m 24s | Hits:  61%/5766  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 18m 15s | Avg:  9m 07s | Max: 13m 55s | Hits:  99%/2999  
      🟩 rtx2080            Pass: 100%/41  | Total:  7h 21m | Avg: 10m 46s | Max: 37m 24s | Hits:  86%/103392
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  6h 28m | Avg: 10m 29s | Max: 37m 24s | Hits:  86%/106351
      🟩 NVRTC              Pass: 100%/2   | Total: 37m 17s | Avg: 18m 38s | Max: 19m 25s | Hits:  90%/40    
      🟩 Test               Pass: 100%/3   | Total: 32m 38s | Avg: 10m 52s | Max: 13m 55s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 14s | Avg:  2m 14s | Max:  2m 14s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 37m 17s | Avg: 18m 38s | Max: 19m 25s | Hits:  90%/40    
      🟩 90                 Pass: 100%/2   | Total: 18m 15s | Avg:  9m 07s | Max: 13m 55s | Hits:  99%/2999  
      🟩 90;90a;100         Pass: 100%/1   | Total: 17m 46s | Avg: 17m 46s | Max: 17m 46s | Hits:  85%/2999  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 55m | Avg: 11m 13s | Max: 37m 24s | Hits:  87%/56876 
      🟩 20                 Pass: 100%/21  | Total:  3h 42m | Avg: 10m 35s | Max: 26m 16s | Hits:  86%/49515 
    
  • 🟩 cudax: Pass: 100%/22 | Total: 2h 18m | Avg: 6m 17s | Max: 14m 20s | Hits: 95%/11830

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  2h 05m | Avg:  6m 57s | Max: 14m 20s | Hits:  94%/9494  
      🟩 arm64              Pass: 100%/4   | Total: 13m 17s | Avg:  3m 19s | Max:  3m 26s | Hits:  99%/2336  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 13m 32s | Avg: 13m 32s | Max: 13m 32s | Hits:  70%/281   
      🟩 12.6               Pass: 100%/2   | Total: 19m 03s | Avg:  9m 31s | Max:  9m 36s | Hits:  63%/752   
      🟩 12.8               Pass: 100%/19  | Total:  1h 46m | Avg:  5m 34s | Max: 14m 20s | Hits:  98%/10797 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 13m 32s | Avg: 13m 32s | Max: 13m 32s | Hits:  70%/281   
      🟩 nvcc12.6           Pass: 100%/2   | Total: 19m 03s | Avg:  9m 31s | Max:  9m 36s | Hits:  63%/752   
      🟩 nvcc12.8           Pass: 100%/19  | Total:  1h 46m | Avg:  5m 34s | Max: 14m 20s | Hits:  98%/10797 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/22  | Total:  2h 18m | Avg:  6m 17s | Max: 14m 20s | Hits:  95%/11830 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 27s | Avg:  3m 27s | Max:  3m 27s | Hits: 100%/586   
      🟩 Clang15            Pass: 100%/1   | Total:  3m 29s | Avg:  3m 29s | Max:  3m 29s | Hits: 100%/584   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 42s | Avg:  3m 42s | Max:  3m 42s | Hits: 100%/584   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 33s | Avg:  3m 33s | Max:  3m 33s | Hits: 100%/584   
      🟩 Clang18            Pass: 100%/4   | Total: 22m 54s | Avg:  5m 43s | Max: 12m 43s | Hits: 100%/2336  
      🟩 GCC10              Pass: 100%/1   | Total:  3m 18s | Avg:  3m 18s | Max:  3m 18s | Hits:  99%/586   
      🟩 GCC11              Pass: 100%/1   | Total:  3m 33s | Avg:  3m 33s | Max:  3m 33s | Hits:  99%/584   
      🟩 GCC12              Pass: 100%/2   | Total: 16m 42s | Avg:  8m 21s | Max: 13m 01s | Hits:  99%/1168  
      🟩 GCC13              Pass: 100%/6   | Total: 31m 02s | Avg:  5m 10s | Max: 14m 18s | Hits:  98%/3504  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 13m 32s | Avg: 13m 32s | Max: 13m 32s | Hits:  70%/281   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 14m 20s | Avg: 14m 20s | Max: 14m 20s | Hits:  70%/281   
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 19m 03s | Avg:  9m 31s | Max:  9m 36s | Hits:  63%/752   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 37m 05s | Avg:  4m 38s | Max: 12m 43s | Hits: 100%/4674  
      🟩 GCC                Pass: 100%/10  | Total: 54m 35s | Avg:  5m 27s | Max: 14m 18s | Hits:  99%/5842  
      🟩 MSVC               Pass: 100%/2   | Total: 27m 52s | Avg: 13m 56s | Max: 14m 20s | Hits:  70%/562   
      🟩 NVHPC              Pass: 100%/2   | Total: 19m 03s | Avg:  9m 31s | Max:  9m 36s | Hits:  63%/752   
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 17m 27s | Avg:  8m 43s | Max: 14m 18s | Hits:  99%/1168  
      🟩 rtx2080            Pass: 100%/20  | Total:  2h 01m | Avg:  6m 03s | Max: 14m 20s | Hits:  95%/10662 
    🟩 jobs
      🟩 Build              Pass: 100%/19  | Total:  1h 38m | Avg:  5m 11s | Max: 14m 20s | Hits:  95%/10078 
      🟩 Test               Pass: 100%/3   | Total: 40m 02s | Avg: 13m 20s | Max: 14m 18s | Hits:  99%/1752  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 21m 17s | Avg:  7m 05s | Max: 14m 18s | Hits:  98%/1752  
      🟩 90a                Pass: 100%/1   | Total:  3m 09s | Avg:  3m 09s | Max:  3m 09s | Hits:  99%/584   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 20m 10s | Avg:  5m 02s | Max:  9m 36s | Hits:  92%/2128  
      🟩 20                 Pass: 100%/18  | Total:  1h 58m | Avg:  6m 34s | Max: 14m 20s | Hits:  96%/9702  
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 18m 53s | Avg: 4m 43s | Max: 5m 24s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 21s | Avg:  5m 10s | Max:  5m 24s
      🟩 arm64              Pass: 100%/2   | Total:  8m 32s | Avg:  4m 16s | Max:  4m 23s
    🟩 ctk
      🟩 12.6               Pass: 100%/4   | Total: 18m 53s | Avg:  4m 43s | Max:  5m 24s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/4   | Total: 18m 53s | Avg:  4m 43s | Max:  5m 24s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 18m 53s | Avg:  4m 43s | Max:  5m 24s
    🟩 cxx
      🟩 NVHPC25.1          Pass: 100%/4   | Total: 18m 53s | Avg:  4m 43s | Max:  5m 24s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 18m 53s | Avg:  4m 43s | Max:  5m 24s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 18m 53s | Avg:  4m 43s | Max:  5m 24s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 18m 53s | Avg:  4m 43s | Max:  5m 24s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  9m 06s | Avg:  4m 33s | Max:  4m 57s
      🟩 20                 Pass: 100%/2   | Total:  9m 47s | Avg:  4m 53s | Max:  5m 24s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 16m 54s | Avg: 8m 27s | Max: 14m 33s | Hits: 98%/320

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 16m 54s | Avg:  8m 27s | Max: 14m 33s | Hits:  98%/320   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 16m 54s | Avg:  8m 27s | Max: 14m 33s | Hits:  98%/320   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 16m 54s | Avg:  8m 27s | Max: 14m 33s | Hits:  98%/320   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 16m 54s | Avg:  8m 27s | Max: 14m 33s | Hits:  98%/320   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 16m 54s | Avg:  8m 27s | Max: 14m 33s | Hits:  98%/320   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 16m 54s | Avg:  8m 27s | Max: 14m 33s | Hits:  98%/320   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 16m 54s | Avg:  8m 27s | Max: 14m 33s | Hits:  98%/320   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 21s | Avg:  2m 21s | Max:  2m 21s | Hits:  98%/160   
      🟩 Test               Pass: 100%/1   | Total: 14m 33s | Avg: 14m 33s | Max: 14m 33s | Hits:  98%/160   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 11m | Avg: 1h 11m | Max: 1h 11m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 162)

# Runner
113 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@wmaxey wmaxey merged commit 67a0867 into branch/3.0.x Apr 12, 2025
177 checks passed
@github-project-automation github-project-automation Bot moved this from In Review to Done in CCCL Apr 12, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

1 participant