Skip to content

Use arch=native in benchmark/tuning presets/docs #5216

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Jul 14, 2025

Conversation

bernhardmgruber
Copy link
Contributor

No description provided.

@bernhardmgruber bernhardmgruber requested a review from a team as a code owner July 11, 2025 12:40
@github-project-automation github-project-automation bot moved this to Todo in CCCL Jul 11, 2025
@cccl-authenticator-app cccl-authenticator-app bot moved this from Todo to In Review in CCCL Jul 11, 2025
Copy link
Contributor

🟩 CI finished in 1h 29m: Pass: 100%/205 | Total: 1d 13h | Avg: 10m 55s | Max: 45m 15s | Hits: 97%/337616
  • 🟩 cub: Pass: 100%/50 | Total: 12h 15m | Avg: 14m 42s | Max: 45m 15s | Hits: 99%/61220

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total: 12h 01m | Avg: 15m 01s | Max: 45m 15s | Hits:  99%/58704 
      🟩 arm64              Pass: 100%/2   | Total: 14m 25s | Avg:  7m 12s | Max:  8m 22s | Hits:  99%/2516  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 55m 08s | Avg: 11m 01s | Max: 26m 29s | Hits:  99%/6105  
      🟩 12.9               Pass: 100%/45  | Total: 11h 20m | Avg: 15m 07s | Max: 45m 15s | Hits:  99%/55115 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 10m 30s | Avg:  5m 15s | Max:  5m 27s | Hits:  99%/2165  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 55m 08s | Avg: 11m 01s | Max: 26m 29s | Hits:  99%/6105  
      🟩 nvcc12.9           Pass: 100%/43  | Total: 11h 10m | Avg: 15m 35s | Max: 45m 15s | Hits:  99%/52950 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 30s | Avg:  5m 15s | Max:  5m 27s | Hits:  99%/2165  
      🟩 nvcc               Pass: 100%/48  | Total: 12h 05m | Avg: 15m 06s | Max: 45m 15s | Hits:  99%/59055 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 26m 42s | Avg:  6m 40s | Max:  7m 35s | Hits:  99%/5034  
      🟩 Clang15            Pass: 100%/2   | Total: 13m 39s | Avg:  6m 49s | Max:  6m 52s | Hits:  99%/2513  
      🟩 Clang16            Pass: 100%/2   | Total: 14m 03s | Avg:  7m 01s | Max:  7m 13s | Hits:  99%/2513  
      🟩 Clang17            Pass: 100%/2   | Total: 13m 27s | Avg:  6m 43s | Max:  6m 51s | Hits:  99%/2513  
      🟩 Clang18            Pass: 100%/2   | Total: 13m 31s | Avg:  6m 45s | Max:  6m 49s | Hits:  99%/2513  
      🟩 Clang19            Pass: 100%/7   | Total:  1h 36m | Avg: 13m 43s | Max: 36m 37s | Hits:  99%/8449  
      🟩 GCC7               Pass: 100%/2   | Total: 17m 01s | Avg:  8m 30s | Max:  8m 52s | Hits:  99%/2516  
      🟩 GCC8               Pass: 100%/1   | Total:  8m 28s | Avg:  8m 28s | Max:  8m 28s | Hits:  99%/1258  
      🟩 GCC9               Pass: 100%/2   | Total: 16m 51s | Avg:  8m 25s | Max:  8m 42s | Hits:  99%/2516  
      🟩 GCC10              Pass: 100%/2   | Total: 18m 04s | Avg:  9m 02s | Max:  9m 24s | Hits:  99%/2517  
      🟩 GCC11              Pass: 100%/2   | Total: 17m 04s | Avg:  8m 32s | Max:  8m 41s | Hits:  99%/2513  
      🟩 GCC12              Pass: 100%/2   | Total: 18m 27s | Avg:  9m 13s | Max:  9m 20s | Hits:  99%/2513  
      🟩 GCC13              Pass: 100%/12  | Total:  4h 32m | Avg: 22m 44s | Max: 45m 15s | Hits:  99%/15105 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 55m 39s | Avg: 27m 49s | Max: 29m 10s | Hits:  99%/2144  
      🟩 MSVC14.43          Pass: 100%/4   | Total:  1h 48m | Avg: 27m 12s | Max: 29m 29s | Hits:  99%/4288  
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 25m 00s | Avg: 12m 30s | Max: 12m 35s | Hits:  98%/2315  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 57m | Avg:  9m 20s | Max: 36m 37s | Hits:  99%/23535 
      🟩 GCC                Pass: 100%/23  | Total:  6h 08m | Avg: 16m 02s | Max: 45m 15s | Hits:  99%/28938 
      🟩 MSVC               Pass: 100%/6   | Total:  2h 44m | Avg: 27m 24s | Max: 29m 29s | Hits:  99%/6432  
      🟩 NVHPC              Pass: 100%/2   | Total: 25m 00s | Avg: 12m 30s | Max: 12m 35s | Hits:  98%/2315  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 09m | Avg: 23m 03s | Max: 36m 50s | Hits:  99%/3777  
      🟩 rtx2080            Pass: 100%/39  | Total:  7h 04m | Avg: 10m 52s | Max: 29m 29s | Hits:  99%/47377 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 02m | Avg: 30m 17s | Max: 45m 15s | Hits:  99%/10066 
    🟩 jobs
      🟩 Build              Pass: 100%/42  | Total:  7h 28m | Avg: 10m 40s | Max: 29m 29s | Hits:  99%/51152 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 42m 00s | Avg: 42m 00s | Max: 42m 00s | Hits:  99%/1259  
      🟩 GraphCapture       Pass: 100%/1   | Total: 36m 54s | Avg: 36m 54s | Max: 36m 54s | Hits:  99%/1259  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 58m | Avg: 39m 34s | Max: 45m 15s | Hits:  99%/3775  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 29m | Avg: 29m 56s | Max: 35m 28s | Hits:  99%/3775  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 09m | Avg: 23m 03s | Max: 36m 50s | Hits:  99%/3777  
      🟩 90;90a             Pass: 100%/2   | Total: 32m 00s | Avg: 16m 00s | Max: 23m 59s | Hits:  99%/2331  
      🟩 100;120            Pass: 100%/2   | Total: 34m 39s | Avg: 17m 19s | Max: 26m 25s | Hits:  99%/2331  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 49m | Avg: 10m 55s | Max: 29m 29s | Hits:  99%/25567 
      🟩 20                 Pass: 100%/29  | Total:  8h 26m | Avg: 17m 27s | Max: 45m 15s | Hits:  99%/35653 
    
  • 🟩 thrust: Pass: 100%/50 | Total: 9h 23m | Avg: 11m 16s | Max: 34m 57s | Hits: 99%/95621

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 20m 41s | Avg: 10m 20s | Max: 12m 52s | Hits:  99%/3828  
    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  9h 12m | Avg: 11m 30s | Max: 34m 57s | Hits:  99%/91794 
      🟩 arm64              Pass: 100%/2   | Total: 11m 47s | Avg:  5m 53s | Max:  6m 43s | Hits:  99%/3827  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 54m 07s | Avg: 10m 49s | Max: 29m 56s | Hits:  99%/9560  
      🟩 12.9               Pass: 100%/45  | Total:  8h 29m | Avg: 11m 19s | Max: 34m 57s | Hits:  99%/86061 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 10m 52s | Avg:  5m 26s | Max:  5m 30s | Hits: 100%/3826  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 54m 07s | Avg: 10m 49s | Max: 29m 56s | Hits:  99%/9560  
      🟩 nvcc12.9           Pass: 100%/43  | Total:  8h 18m | Avg: 11m 36s | Max: 34m 57s | Hits:  99%/82235 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 52s | Avg:  5m 26s | Max:  5m 30s | Hits: 100%/3826  
      🟩 nvcc               Pass: 100%/48  | Total:  9h 13m | Avg: 11m 31s | Max: 34m 57s | Hits:  99%/91795 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 22m 53s | Avg:  5m 43s | Max:  6m 16s | Hits: 100%/7652  
      🟩 Clang15            Pass: 100%/2   | Total: 12m 32s | Avg:  6m 16s | Max:  6m 19s | Hits: 100%/3826  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 01s | Avg:  6m 00s | Max:  6m 07s | Hits: 100%/3826  
      🟩 Clang17            Pass: 100%/2   | Total: 12m 02s | Avg:  6m 01s | Max:  6m 06s | Hits: 100%/3826  
      🟩 Clang18            Pass: 100%/2   | Total: 11m 32s | Avg:  5m 46s | Max:  5m 47s | Hits: 100%/3826  
      🟩 Clang19            Pass: 100%/7   | Total: 46m 05s | Avg:  6m 35s | Max:  9m 54s | Hits: 100%/13391 
      🟩 GCC7               Pass: 100%/2   | Total: 13m 49s | Avg:  6m 54s | Max:  7m 31s | Hits:  99%/3828  
      🟩 GCC8               Pass: 100%/1   | Total:  7m 22s | Avg:  7m 22s | Max:  7m 22s | Hits:  99%/1914  
      🟩 GCC9               Pass: 100%/2   | Total: 15m 32s | Avg:  7m 46s | Max:  8m 07s | Hits:  99%/3828  
      🟩 GCC10              Pass: 100%/2   | Total: 15m 13s | Avg:  7m 36s | Max:  7m 42s | Hits:  99%/3828  
      🟩 GCC11              Pass: 100%/2   | Total: 15m 11s | Avg:  7m 35s | Max:  7m 57s | Hits:  99%/3828  
      🟩 GCC12              Pass: 100%/2   | Total: 16m 04s | Avg:  8m 02s | Max:  8m 09s | Hits:  99%/3828  
      🟩 GCC13              Pass: 100%/11  | Total:  1h 36m | Avg:  8m 48s | Max: 13m 02s | Hits:  99%/21054 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 57m 44s | Avg: 28m 52s | Max: 29m 56s | Hits:  99%/3812  
      🟩 MSVC14.43          Pass: 100%/5   | Total:  2h 23m | Avg: 28m 41s | Max: 34m 17s | Hits:  99%/9530  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  1h 05m | Avg: 32m 51s | Max: 34m 57s | Hits:  99%/3824  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 57m | Avg:  6m 09s | Max:  9m 54s | Hits: 100%/36347 
      🟩 GCC                Pass: 100%/22  | Total:  3h 00m | Avg:  8m 10s | Max: 13m 02s | Hits:  99%/42108 
      🟩 MSVC               Pass: 100%/7   | Total:  3h 21m | Avg: 28m 44s | Max: 34m 17s | Hits:  99%/13342 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 05m | Avg: 32m 51s | Max: 34m 57s | Hits:  99%/3824  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 17m 47s | Avg:  8m 53s | Max: 12m 07s | Hits:  99%/3828  
      🟩 rtx2080            Pass: 100%/38  | Total:  6h 47m | Avg: 10m 44s | Max: 34m 57s | Hits:  99%/72672 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 18m | Avg: 13m 49s | Max: 34m 17s | Hits:  99%/19121 
    🟩 jobs
      🟩 Build              Pass: 100%/43  | Total:  7h 44m | Avg: 10m 47s | Max: 34m 57s | Hits:  99%/82233 
      🟩 TestCPU            Pass: 100%/3   | Total: 51m 41s | Avg: 17m 13s | Max: 34m 17s | Hits:  99%/5733  
      🟩 TestGPU            Pass: 100%/4   | Total: 47m 55s | Avg: 11m 58s | Max: 13m 02s | Hits:  99%/7655  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 17m 47s | Avg:  8m 53s | Max: 12m 07s | Hits:  99%/3828  
      🟩 90;90a             Pass: 100%/2   | Total: 30m 32s | Avg: 15m 16s | Max: 24m 02s | Hits:  99%/3820  
      🟩 100;120            Pass: 100%/2   | Total: 32m 10s | Avg: 16m 05s | Max: 25m 19s | Hits:  99%/3820  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 58m | Avg: 11m 22s | Max: 34m 57s | Hits:  99%/40160 
      🟩 20                 Pass: 100%/27  | Total:  5h 04m | Avg: 11m 16s | Max: 34m 17s | Hits:  99%/51633 
    
  • 🟩 libcudacxx: Pass: 100%/48 | Total: 8h 54m | Avg: 11m 07s | Max: 33m 14s | Hits: 95%/164034

    🟩 cpu
      🟩 amd64              Pass: 100%/46  | Total:  8h 45m | Avg: 11m 25s | Max: 33m 14s | Hits:  95%/156697
      🟩 arm64              Pass: 100%/2   | Total:  8m 55s | Avg:  4m 27s | Max:  4m 31s | Hits:  99%/7337  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 44m 25s | Avg:  8m 53s | Max: 27m 02s | Hits:  99%/17977 
      🟩 12.9               Pass: 100%/43  | Total:  8h 09m | Avg: 11m 23s | Max: 33m 14s | Hits:  95%/146057
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 53m 36s | Avg: 26m 48s | Max: 28m 07s | Hits:  28%/7301  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 44m 25s | Avg:  8m 53s | Max: 27m 02s | Hits:  99%/17977 
      🟩 nvcc12.9           Pass: 100%/41  | Total:  7h 16m | Avg: 10m 38s | Max: 33m 14s | Hits:  98%/138756
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 53m 36s | Avg: 26m 48s | Max: 28m 07s | Hits:  28%/7301  
      🟩 nvcc               Pass: 100%/46  | Total:  8h 00m | Avg: 10m 26s | Max: 33m 14s | Hits:  98%/156733
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 18m 23s | Avg:  4m 35s | Max:  4m 57s | Hits:  99%/14558 
      🟩 Clang15            Pass: 100%/2   | Total: 10m 09s | Avg:  5m 04s | Max:  5m 09s | Hits:  99%/7297  
      🟩 Clang16            Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  5m 02s | Hits:  99%/7297  
      🟩 Clang17            Pass: 100%/2   | Total:  9m 48s | Avg:  4m 54s | Max:  4m 57s | Hits:  99%/7297  
      🟩 Clang18            Pass: 100%/2   | Total: 21m 30s | Avg: 10m 45s | Max: 16m 25s | Hits:  88%/7297  
      🟩 Clang19            Pass: 100%/6   | Total:  1h 29m | Avg: 14m 51s | Max: 28m 07s | Hits:  75%/21934 
      🟩 GCC7               Pass: 100%/2   | Total:  8m 41s | Avg:  4m 20s | Max:  4m 27s | Hits:  99%/7233  
      🟩 GCC8               Pass: 100%/1   | Total:  4m 36s | Avg:  4m 36s | Max:  4m 36s | Hits:  99%/3627  
      🟩 GCC9               Pass: 100%/2   | Total:  9m 31s | Avg:  4m 45s | Max:  4m 58s | Hits:  99%/7245  
      🟩 GCC10              Pass: 100%/2   | Total:  9m 21s | Avg:  4m 40s | Max:  4m 49s | Hits:  99%/7299  
      🟩 GCC11              Pass: 100%/2   | Total:  9m 46s | Avg:  4m 53s | Max:  5m 09s | Hits:  99%/7295  
      🟩 GCC12              Pass: 100%/2   | Total:  9m 34s | Avg:  4m 47s | Max:  4m 49s | Hits:  99%/7299  
      🟩 GCC13              Pass: 100%/11  | Total:  2h 04m | Avg: 11m 19s | Max: 24m 01s | Hits:  99%/29687 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 59m 31s | Avg: 29m 45s | Max: 32m 29s | Hits:  99%/6969  
      🟩 MSVC14.43          Pass: 100%/4   | Total:  1h 59m | Avg: 29m 49s | Max: 33m 14s | Hits:  99%/14417 
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 20m 40s | Avg: 10m 20s | Max: 10m 24s | Hits:  98%/7283  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  2h 38m | Avg:  8m 48s | Max: 28m 07s | Hits:  90%/65680 
      🟩 GCC                Pass: 100%/22  | Total:  2h 55m | Avg:  7m 59s | Max: 24m 01s | Hits:  99%/69685 
      🟩 MSVC               Pass: 100%/6   | Total:  2h 58m | Avg: 29m 48s | Max: 33m 14s | Hits:  99%/21386 
      🟩 NVHPC              Pass: 100%/2   | Total: 20m 40s | Avg: 10m 20s | Max: 10m 24s | Hits:  98%/7283  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 26m 34s | Avg: 13m 17s | Max: 21m 52s | Hits:  99%/7504  
      🟩 rtx2080            Pass: 100%/46  | Total:  8h 27m | Avg: 11m 02s | Max: 33m 14s | Hits:  95%/156530
    🟩 jobs
      🟩 Build              Pass: 100%/42  | Total:  6h 58m | Avg:  9m 57s | Max: 33m 14s | Hits:  95%/152903
      🟩 NVRTC              Pass: 100%/2   | Total: 47m 02s | Avg: 23m 31s | Max: 24m 01s | Hits:  90%/42    
      🟩 Test               Pass: 100%/3   | Total:  1h 06m | Avg: 22m 16s | Max: 23m 52s | Hits:  99%/11089 
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 54s | Avg:  1m 54s | Max:  1m 54s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 47m 02s | Avg: 23m 31s | Max: 24m 01s | Hits:  90%/42    
      🟩 90                 Pass: 100%/2   | Total: 26m 34s | Avg: 13m 17s | Max: 21m 52s | Hits:  99%/7504  
      🟩 90;90a             Pass: 100%/2   | Total: 35m 11s | Avg: 17m 35s | Max: 30m 07s | Hits:  99%/7450  
      🟩 100;120            Pass: 100%/2   | Total: 32m 28s | Avg: 16m 14s | Max: 27m 08s | Hits:  99%/7450  
    🟩 std
      🟩 17                 Pass: 100%/22  | Total:  3h 55m | Avg: 10m 41s | Max: 32m 29s | Hits:  95%/75742 
      🟩 20                 Pass: 100%/25  | Total:  4h 56m | Avg: 11m 52s | Max: 33m 14s | Hits:  96%/88292 
    
  • 🟩 cudax: Pass: 100%/28 | Total: 2h 36m | Avg: 5m 34s | Max: 11m 29s | Hits: 99%/16246

    🟩 cpu
      🟩 amd64              Pass: 100%/24  | Total:  2h 23m | Avg:  5m 58s | Max: 11m 29s | Hits:  98%/13754 
      🟩 arm64              Pass: 100%/4   | Total: 12m 53s | Avg:  3m 13s | Max:  3m 32s | Hits:  99%/2492  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 18m 01s | Avg:  6m 00s | Max: 11m 29s | Hits:  99%/1568  
      🟩 12.9               Pass: 100%/25  | Total:  2h 18m | Avg:  5m 31s | Max: 11m 02s | Hits:  99%/14678 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 18m 01s | Avg:  6m 00s | Max: 11m 29s | Hits:  99%/1568  
      🟩 nvcc12.9           Pass: 100%/25  | Total:  2h 18m | Avg:  5m 31s | Max: 11m 02s | Hits:  99%/14678 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/28  | Total:  2h 36m | Avg:  5m 34s | Max: 11m 29s | Hits:  99%/16246 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total:  6m 24s | Avg:  3m 12s | Max:  3m 22s | Hits: 100%/1248  
      🟩 Clang15            Pass: 100%/1   | Total:  3m 23s | Avg:  3m 23s | Max:  3m 23s | Hits: 100%/623   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 30s | Avg:  3m 30s | Max:  3m 30s | Hits: 100%/623   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 26s | Avg:  3m 26s | Max:  3m 26s | Hits: 100%/623   
      🟩 Clang18            Pass: 100%/1   | Total:  3m 18s | Avg:  3m 18s | Max:  3m 18s | Hits: 100%/623   
      🟩 Clang19            Pass: 100%/4   | Total: 19m 04s | Avg:  4m 46s | Max:  9m 46s | Hits: 100%/2492  
      🟩 GCC10              Pass: 100%/2   | Total:  7m 09s | Avg:  3m 34s | Max:  3m 39s | Hits:  99%/1248  
      🟩 GCC11              Pass: 100%/1   | Total:  3m 48s | Avg:  3m 48s | Max:  3m 48s | Hits:  99%/623   
      🟩 GCC12              Pass: 100%/1   | Total:  6m 30s | Avg:  6m 30s | Max:  6m 30s | Hits:  90%/623   
      🟩 GCC13              Pass: 100%/8   | Total: 41m 13s | Avg:  5m 09s | Max:  9m 54s | Hits:  99%/4984  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 29s | Avg: 11m 29s | Max: 11m 29s | Hits:  95%/322   
      🟩 MSVC14.43          Pass: 100%/3   | Total: 32m 41s | Avg: 10m 53s | Max: 11m 02s | Hits:  95%/972   
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 14m 14s | Avg:  7m 07s | Max:  7m 10s | Hits:  97%/1242  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 39m 05s | Avg:  3m 54s | Max:  9m 46s | Hits: 100%/6232  
      🟩 GCC                Pass: 100%/12  | Total: 58m 40s | Avg:  4m 53s | Max:  9m 54s | Hits:  98%/7478  
      🟩 MSVC               Pass: 100%/4   | Total: 44m 10s | Avg: 11m 02s | Max: 11m 29s | Hits:  95%/1294  
      🟩 NVHPC              Pass: 100%/2   | Total: 14m 14s | Avg:  7m 07s | Max:  7m 10s | Hits:  97%/1242  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 12m 30s | Avg:  6m 15s | Max:  9m 14s | Hits:  99%/1246  
      🟩 rtx2080            Pass: 100%/26  | Total:  2h 23m | Avg:  5m 31s | Max: 11m 29s | Hits:  98%/15000 
    🟩 jobs
      🟩 Build              Pass: 100%/25  | Total:  2h 07m | Avg:  5m 05s | Max: 11m 29s | Hits:  98%/14377 
      🟩 Test               Pass: 100%/3   | Total: 28m 54s | Avg:  9m 38s | Max:  9m 54s | Hits:  99%/1869  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 12m 30s | Avg:  6m 15s | Max:  9m 14s | Hits:  99%/1246  
      🟩 90;90a             Pass: 100%/2   | Total: 14m 27s | Avg:  7m 13s | Max: 10m 42s | Hits:  98%/947   
      🟩 100;120            Pass: 100%/2   | Total: 14m 52s | Avg:  7m 26s | Max: 11m 02s | Hits:  98%/947   
    🟩 std
      🟩 17                 Pass: 100%/3   | Total: 13m 38s | Avg:  4m 32s | Max:  7m 10s | Hits:  99%/1867  
      🟩 20                 Pass: 100%/25  | Total:  2h 22m | Avg:  5m 42s | Max: 11m 29s | Hits:  99%/14379 
    
  • 🟩 python: Pass: 100%/18 | Total: 3h 14m | Avg: 10m 48s | Max: 20m 28s

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  3h 14m | Avg: 10m 48s | Max: 20m 28s
    🟩 ctk
      🟩 12.9               Pass: 100%/18  | Total:  3h 14m | Avg: 10m 48s | Max: 20m 28s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/18  | Total:  3h 14m | Avg: 10m 48s | Max: 20m 28s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/18  | Total:  3h 14m | Avg: 10m 48s | Max: 20m 28s
    🟩 cxx
      🟩 GCC13              Pass: 100%/18  | Total:  3h 14m | Avg: 10m 48s | Max: 20m 28s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/18  | Total:  3h 14m | Avg: 10m 48s | Max: 20m 28s
    🟩 gpu
      🟩 h100               Pass: 100%/8   | Total:  1h 18m | Avg:  9m 50s | Max: 16m 43s
      🟩 rtxa6000           Pass: 100%/10  | Total:  1h 55m | Avg: 11m 34s | Max: 20m 28s
    🟩 jobs
      🟩 Build cuda.cccl    Pass: 100%/2   | Total: 19m 31s | Avg:  9m 45s | Max:  9m 48s
      🟩 Test cuda.cccl.cooperative Pass: 100%/4   | Total:  1h 08m | Avg: 17m 12s | Max: 20m 28s
      🟩 Test cuda.cccl.examples Pass: 100%/4   | Total: 20m 22s | Avg:  5m 05s | Max:  6m 28s
      🟩 Test cuda.cccl.headers Pass: 100%/4   | Total: 18m 16s | Avg:  4m 34s | Max:  5m 25s
      🟩 Test cuda.cccl.parallel Pass: 100%/4   | Total:  1h 07m | Avg: 16m 53s | Max: 18m 10s
    🟩 py_version
      🟩 3.10               Pass: 100%/9   | Total:  1h 41m | Avg: 11m 14s | Max: 20m 22s
      🟩 3.13               Pass: 100%/9   | Total:  1h 33m | Avg: 10m 22s | Max: 20m 28s
    
  • 🟩 packaging: Pass: 100%/4 | Total: 13m 20s | Avg: 3m 20s | Max: 3m 33s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 13m 20s | Avg:  3m 20s | Max:  3m 33s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total:  6m 48s | Avg:  3m 24s | Max:  3m 25s
      🟩 12.9               Pass: 100%/2   | Total:  6m 32s | Avg:  3m 16s | Max:  3m 33s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total:  6m 48s | Avg:  3m 24s | Max:  3m 25s
      🟩 nvcc12.9           Pass: 100%/2   | Total:  6m 32s | Avg:  3m 16s | Max:  3m 33s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 13m 20s | Avg:  3m 20s | Max:  3m 33s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 25s | Avg:  3m 25s | Max:  3m 25s
      🟩 Clang19            Pass: 100%/1   | Total:  2m 59s | Avg:  2m 59s | Max:  2m 59s
      🟩 GCC12              Pass: 100%/1   | Total:  3m 23s | Avg:  3m 23s | Max:  3m 23s
      🟩 GCC13              Pass: 100%/1   | Total:  3m 33s | Avg:  3m 33s | Max:  3m 33s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  6m 24s | Avg:  3m 12s | Max:  3m 25s
      🟩 GCC                Pass: 100%/2   | Total:  6m 56s | Avg:  3m 28s | Max:  3m 33s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 13m 20s | Avg:  3m 20s | Max:  3m 33s
    🟩 jobs
      🟩 Test               Pass: 100%/4   | Total: 13m 20s | Avg:  3m 20s | Max:  3m 33s
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 16m 23s | Avg: 4m 05s | Max: 4m 18s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  8m 03s | Avg:  4m 01s | Max:  4m 05s
      🟩 arm64              Pass: 100%/2   | Total:  8m 20s | Avg:  4m 10s | Max:  4m 18s
    🟩 ctk
      🟩 12.9               Pass: 100%/4   | Total: 16m 23s | Avg:  4m 05s | Max:  4m 18s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/4   | Total: 16m 23s | Avg:  4m 05s | Max:  4m 18s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 16m 23s | Avg:  4m 05s | Max:  4m 18s
    🟩 cxx
      🟩 NVHPC25.5          Pass: 100%/4   | Total: 16m 23s | Avg:  4m 05s | Max:  4m 18s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 16m 23s | Avg:  4m 05s | Max:  4m 18s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 16m 23s | Avg:  4m 05s | Max:  4m 18s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 16m 23s | Avg:  4m 05s | Max:  4m 18s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 16s | Avg:  4m 08s | Max:  4m 18s
      🟩 20                 Pass: 100%/2   | Total:  8m 07s | Avg:  4m 03s | Max:  4m 05s
    
  • 🟩 cccl_c_parallel: Pass: 100%/3 | Total: 26m 59s | Avg: 8m 59s | Max: 14m 07s | Hits: 98%/495

    🟩 cpu
      🟩 amd64              Pass: 100%/3   | Total: 26m 59s | Avg:  8m 59s | Max: 14m 07s | Hits:  98%/495   
    🟩 ctk
      🟩 12.9               Pass: 100%/3   | Total: 26m 59s | Avg:  8m 59s | Max: 14m 07s | Hits:  98%/495   
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/3   | Total: 26m 59s | Avg:  8m 59s | Max: 14m 07s | Hits:  98%/495   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/3   | Total: 26m 59s | Avg:  8m 59s | Max: 14m 07s | Hits:  98%/495   
    🟩 cxx
      🟩 GCC13              Pass: 100%/3   | Total: 26m 59s | Avg:  8m 59s | Max: 14m 07s | Hits:  98%/495   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/3   | Total: 26m 59s | Avg:  8m 59s | Max: 14m 07s | Hits:  98%/495   
    🟩 gpu
      🟩 h100               Pass: 100%/1   | Total: 14m 07s | Avg: 14m 07s | Max: 14m 07s | Hits:  98%/165   
      🟩 rtx2080            Pass: 100%/2   | Total: 12m 52s | Avg:  6m 26s | Max: 10m 39s | Hits:  98%/330   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 13s | Avg:  2m 13s | Max:  2m 13s | Hits:  98%/165   
      🟩 Test               Pass: 100%/2   | Total: 24m 46s | Avg: 12m 23s | Max: 14m 07s | Hits:  98%/330   
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
CCCL Packaging
libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- CCCL Packaging
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 205)

# Runner
128 linux-amd64-cpu16
23 windows-amd64-cpu16
14 linux-amd64-gpu-h100-latest-1
14 linux-amd64-gpu-rtxa6000-latest-1
12 linux-arm64-cpu16
11 linux-amd64-gpu-rtx2080-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@bernhardmgruber bernhardmgruber merged commit 182da25 into NVIDIA:main Jul 14, 2025
217 checks passed
@github-project-automation github-project-automation bot moved this from In Review to Done in CCCL Jul 14, 2025
@bernhardmgruber bernhardmgruber deleted the native_docs branch July 14, 2025 08:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

2 participants