Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add BabelStream flavors as thrust::transform benchmarks #1921

Merged
merged 1 commit into from
Jul 4, 2024

Conversation

bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Jun 27, 2024

The Thrust implementation of BabelStream uses a few versions of thrust::transform. Given the importance of this public benchmark, we should add these uses of thrust::transform (mul, add, triad and nstream) to your benchmarks as well. The copy and dot benchmarks are covered by existing thrust benchmarks.

@bernhardmgruber bernhardmgruber added the thrust For all items related to Thrust. label Jun 27, 2024
Copy link
Contributor

🟨 CI finished in 1h 31m: Pass: 99%/249 | Total: 1d 07h | Avg: 7m 30s | Max: 58m 28s | Hits: 99%/247587
  • 🟨 cub: Pass: 99%/131 | Total: 20h 13m | Avg: 9m 15s | Max: 58m 28s | Hits: 99%/108321

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  99%/123 | Total: 19h 36m | Avg:  9m 34s | Max: 58m 28s | Hits:  99%/101505
      🟩 arm64              Pass: 100%/8   | Total: 36m 49s | Avg:  4m 36s | Max:  5m 17s | Hits:  99%/6816  
    🔍 ctk: 12.4 🔍
      🟩 11.1               Pass: 100%/15  | Total:  1h 06m | Avg:  4m 27s | Max: 14m 21s | Hits:  99%/11568 
      🟩 11.8               Pass: 100%/3   | Total: 13m 42s | Avg:  4m 34s | Max:  4m 40s | Hits:  99%/2556  
      🔍 12.4               Pass:  99%/113 | Total: 18h 53m | Avg: 10m 01s | Max: 58m 28s | Hits:  99%/94197 
    🔍 cudacxx: nvcc12.4 🔍
      🟩 ClangCUDA17        Pass: 100%/2   | Total:  7m 27s | Avg:  3m 43s | Max:  3m 48s | Hits: 100%/1408  
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 06m | Avg:  4m 27s | Max: 14m 21s | Hits:  99%/11568 
      🟩 nvcc11.8           Pass: 100%/3   | Total: 13m 42s | Avg:  4m 34s | Max:  4m 40s | Hits:  99%/2556  
      🔍 nvcc12.4           Pass:  99%/111 | Total: 18h 45m | Avg: 10m 08s | Max: 58m 28s | Hits:  99%/92789 
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 27s | Avg:  3m 43s | Max:  3m 48s | Hits: 100%/1408  
      🔍 nvcc               Pass:  99%/129 | Total: 20h 06m | Avg:  9m 21s | Max: 58m 28s | Hits:  99%/106913
    🔍 cxx: Clang17 🔍
      🟩 Clang9             Pass: 100%/6   | Total: 27m 58s | Avg:  4m 39s | Max:  5m 20s | Hits: 100%/4890  
      🟩 Clang10            Pass: 100%/3   | Total: 15m 26s | Avg:  5m 08s | Max:  5m 20s | Hits: 100%/2562  
      🟩 Clang11            Pass: 100%/4   | Total: 17m 25s | Avg:  4m 21s | Max:  4m 28s | Hits: 100%/3416  
      🟩 Clang12            Pass: 100%/4   | Total: 17m 52s | Avg:  4m 28s | Max:  4m 33s | Hits: 100%/3416  
      🟩 Clang13            Pass: 100%/4   | Total: 17m 36s | Avg:  4m 24s | Max:  4m 33s | Hits: 100%/3416  
      🟩 Clang14            Pass: 100%/4   | Total: 18m 25s | Avg:  4m 36s | Max:  4m 45s | Hits: 100%/3416  
      🟩 Clang15            Pass: 100%/4   | Total: 18m 43s | Avg:  4m 40s | Max:  4m 52s | Hits: 100%/3408  
      🟩 Clang16            Pass: 100%/4   | Total: 18m 05s | Avg:  4m 31s | Max:  4m 35s | Hits: 100%/3408  
      🔍 Clang17            Pass:  96%/26  | Total:  7h 42m | Avg: 17m 48s | Max: 58m 28s | Hits: 100%/21004 
      🟩 GCC6               Pass: 100%/2   | Total:  7m 32s | Avg:  3m 46s | Max:  3m 53s | Hits:  99%/1552  
      🟩 GCC7               Pass: 100%/6   | Total: 23m 38s | Avg:  3m 56s | Max:  4m 22s | Hits:  99%/4893  
      🟩 GCC8               Pass: 100%/6   | Total: 24m 25s | Avg:  4m 04s | Max:  4m 37s | Hits:  99%/4893  
      🟩 GCC9               Pass: 100%/6   | Total: 24m 09s | Avg:  4m 01s | Max:  4m 35s | Hits:  99%/4893  
      🟩 GCC10              Pass: 100%/4   | Total: 17m 52s | Avg:  4m 28s | Max:  4m 38s | Hits:  99%/3416  
      🟩 GCC11              Pass: 100%/7   | Total: 31m 21s | Avg:  4m 28s | Max:  4m 40s | Hits:  99%/5964  
      🟩 GCC12              Pass: 100%/4   | Total: 18m 24s | Avg:  4m 36s | Max:  4m 44s | Hits:  99%/3408  
      🟩 GCC13              Pass: 100%/28  | Total:  5h 59m | Avg: 12m 50s | Max: 27m 08s | Hits:  99%/23856 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 15m 13s | Avg:  5m 04s | Max:  5m 30s | Hits: 100%/2340  
      🟩 MSVC14.16          Pass: 100%/1   | Total: 14m 21s | Avg: 14m 21s | Max: 14m 21s | Hits:  98%/695   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 24m 23s | Avg: 12m 11s | Max: 12m 23s | Hits:  98%/1390  
      🟩 MSVC14.39          Pass: 100%/3   | Total: 38m 31s | Avg: 12m 50s | Max: 13m 02s | Hits:  98%/2085  
    🔍 cxx_family: Clang 🔍
      🔍 Clang              Pass:  98%/59  | Total: 10h 14m | Avg: 10m 24s | Max: 58m 28s | Hits: 100%/48936 
      🟩 GCC                Pass: 100%/63  | Total:  8h 26m | Avg:  8m 02s | Max: 27m 08s | Hits:  99%/52875 
      🟩 Intel              Pass: 100%/3   | Total: 15m 13s | Avg:  5m 04s | Max:  5m 30s | Hits: 100%/2340  
      🟩 MSVC               Pass: 100%/6   | Total:  1h 17m | Avg: 12m 52s | Max: 14m 21s | Hits:  98%/4170  
    🔍 jobs: HostLaunch 🔍
      🟩 Build              Pass: 100%/99  | Total:  8h 06m | Avg:  4m 55s | Max: 14m 21s | Hits:  99%/81909 
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  2h 24m | Avg: 18m 06s | Max: 21m 00s | Hits:  99%/6816  
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 42m | Avg: 20m 21s | Max: 31m 27s | Hits:  99%/6816  
      🔍 HostLaunch         Pass:  87%/8   | Total:  3h 04m | Avg: 23m 01s | Max: 45m 40s | Hits:  99%/5964  
      🟩 TestGPU            Pass: 100%/8   | Total:  3h 54m | Avg: 29m 22s | Max: 58m 28s | Hits:  99%/6816  
    🔍 std: 17 🔍
      🟩 11                 Pass: 100%/34  | Total:  5h 26m | Avg:  9m 36s | Max: 58m 28s | Hits:  99%/28539 
      🟩 14                 Pass: 100%/37  | Total:  5h 48m | Avg:  9m 25s | Max: 42m 15s | Hits:  99%/30624 
      🔍 17                 Pass:  97%/36  | Total:  4h 56m | Avg:  8m 14s | Max: 31m 27s | Hits:  99%/29005 
      🟩 20                 Pass: 100%/24  | Total:  4h 01m | Avg: 10m 04s | Max: 27m 08s | Hits:  99%/20153 
    🟨 gpu
      🟨 v100               Pass:  99%/131 | Total: 20h 13m | Avg:  9m 15s | Max: 58m 28s | Hits:  99%/108321
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 13m 42s | Avg:  4m 34s | Max:  4m 40s | Hits:  99%/2556  
      🟩 90a                Pass: 100%/4   | Total: 14m 41s | Avg:  3m 40s | Max:  3m 50s | Hits:  99%/3408  
    
  • 🟩 thrust: Pass: 100%/118 | Total: 10h 55m | Avg: 5m 33s | Max: 21m 04s | Hits: 99%/139266

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total: 10h 25m | Avg:  5m 41s | Max: 21m 04s | Hits:  99%/129822
      🟩 arm64              Pass: 100%/8   | Total: 29m 42s | Avg:  3m 42s | Max:  4m 15s | Hits:  99%/9444  
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 00m | Avg:  4m 03s | Max: 14m 50s | Hits:  99%/17705 
      🟩 11.8               Pass: 100%/3   | Total: 11m 14s | Avg:  3m 44s | Max:  4m 09s | Hits:  99%/3543  
      🟩 12.4               Pass: 100%/100 | Total:  9h 43m | Avg:  5m 50s | Max: 21m 04s | Hits:  99%/118018
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  3m 54s | Hits: 100%/2360  
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 00m | Avg:  4m 03s | Max: 14m 50s | Hits:  99%/17705 
      🟩 nvcc11.8           Pass: 100%/3   | Total: 11m 14s | Avg:  3m 44s | Max:  4m 09s | Hits:  99%/3543  
      🟩 nvcc12.4           Pass: 100%/98  | Total:  9h 35m | Avg:  5m 52s | Max: 21m 04s | Hits:  99%/115658
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  3m 54s | Hits: 100%/2360  
      🟩 nvcc               Pass: 100%/116 | Total: 10h 47m | Avg:  5m 35s | Max: 21m 04s | Hits:  99%/136906
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 23m 03s | Avg:  3m 50s | Max:  4m 21s | Hits: 100%/7080  
      🟩 Clang10            Pass: 100%/3   | Total: 13m 12s | Avg:  4m 24s | Max:  4m 46s | Hits: 100%/3540  
      🟩 Clang11            Pass: 100%/4   | Total: 15m 14s | Avg:  3m 48s | Max:  4m 01s | Hits: 100%/4720  
      🟩 Clang12            Pass: 100%/4   | Total: 15m 02s | Avg:  3m 45s | Max:  3m 57s | Hits: 100%/4720  
      🟩 Clang13            Pass: 100%/4   | Total: 14m 31s | Avg:  3m 37s | Max:  3m 50s | Hits: 100%/4720  
      🟩 Clang14            Pass: 100%/4   | Total: 14m 45s | Avg:  3m 41s | Max:  3m 53s | Hits: 100%/4720  
      🟩 Clang15            Pass: 100%/4   | Total: 15m 47s | Avg:  3m 56s | Max:  4m 24s | Hits: 100%/4720  
      🟩 Clang16            Pass: 100%/4   | Total: 15m 16s | Avg:  3m 49s | Max:  3m 54s | Hits: 100%/4720  
      🟩 Clang17            Pass: 100%/18  | Total:  1h 57m | Avg:  6m 30s | Max: 20m 00s | Hits: 100%/21240 
      🟩 GCC6               Pass: 100%/2   | Total:  6m 02s | Avg:  3m 01s | Max:  3m 05s | Hits:  99%/2360  
      🟩 GCC7               Pass: 100%/6   | Total: 20m 19s | Avg:  3m 23s | Max:  3m 51s | Hits:  99%/7086  
      🟩 GCC8               Pass: 100%/6   | Total: 20m 51s | Avg:  3m 28s | Max:  3m 49s | Hits:  99%/7086  
      🟩 GCC9               Pass: 100%/6   | Total: 21m 20s | Avg:  3m 33s | Max:  3m 57s | Hits:  99%/7086  
      🟩 GCC10              Pass: 100%/4   | Total: 15m 07s | Avg:  3m 46s | Max:  3m 57s | Hits:  99%/4724  
      🟩 GCC11              Pass: 100%/7   | Total: 26m 49s | Avg:  3m 49s | Max:  4m 09s | Hits:  99%/8267  
      🟩 GCC12              Pass: 100%/4   | Total: 16m 00s | Avg:  4m 00s | Max:  4m 12s | Hits:  99%/4724  
      🟩 GCC13              Pass: 100%/20  | Total:  2h 08m | Avg:  6m 24s | Max: 16m 55s | Hits:  99%/23620 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 13m 35s | Avg:  4m 31s | Max:  4m 37s | Hits: 100%/3549  
      🟩 MSVC14.16          Pass: 100%/1   | Total: 14m 50s | Avg: 14m 50s | Max: 14m 50s | Hits:  98%/1176  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 25m 00s | Avg: 12m 30s | Max: 12m 33s | Hits:  98%/2352  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  1h 43m | Avg: 17m 17s | Max: 21m 04s | Hits:  98%/7056  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/51  | Total:  4h 03m | Avg:  4m 46s | Max: 20m 00s | Hits: 100%/60180 
      🟩 GCC                Pass: 100%/55  | Total:  4h 14m | Avg:  4m 37s | Max: 16m 55s | Hits:  99%/64953 
      🟩 Intel              Pass: 100%/3   | Total: 13m 35s | Avg:  4m 31s | Max:  4m 37s | Hits: 100%/3549  
      🟩 MSVC               Pass: 100%/9   | Total:  2h 23m | Avg: 15m 57s | Max: 21m 04s | Hits:  98%/10584 
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total: 10h 55m | Avg:  5m 33s | Max: 21m 04s | Hits:  99%/139266
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  7h 11m | Avg:  4m 21s | Max: 14m 50s | Hits:  99%/116850
      🟩 TestCPU            Pass: 100%/11  | Total:  1h 51m | Avg: 10m 06s | Max: 21m 04s | Hits:  99%/12972 
      🟩 TestGPU            Pass: 100%/8   | Total:  1h 53m | Avg: 14m 09s | Max: 20m 00s | Hits:  99%/9444  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 11m 14s | Avg:  3m 44s | Max:  4m 09s | Hits:  99%/3543  
      🟩 90a                Pass: 100%/4   | Total: 13m 43s | Avg:  3m 25s | Max:  3m 34s | Hits:  99%/4724  
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  2h 07m | Avg:  4m 14s | Max: 12m 55s | Hits:  99%/35418 
      🟩 14                 Pass: 100%/34  | Total:  3h 22m | Avg:  5m 57s | Max: 20m 31s | Hits:  99%/40122 
      🟩 17                 Pass: 100%/33  | Total:  3h 15m | Avg:  5m 54s | Max: 21m 04s | Hits:  99%/38946 
      🟩 20                 Pass: 100%/21  | Total:  2h 10m | Avg:  6m 13s | Max: 20m 41s | Hits:  99%/24780 
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

🏃‍ Runner counts (total jobs: 249)

# Runner
178 linux-amd64-cpu16
40 linux-amd64-gpu-v100-latest-1
16 linux-arm64-cpu16
15 windows-amd64-cpu16

@alliepiper
Copy link
Collaborator

@bernhardmgruber @gevtushenko idea: what do you think about adding a bench/downstream/ (or something like that) to put these in, e.g. bench/downstream/babelstream/transform_zips.cu?

We may want to have multiple benchmarks for a downstream that cover different algorithms, and keeping those in one place might make maintenance easier, especially if some downstream benchmarks end up mixing multiple algorithms with no_par or similar.

@bernhardmgruber
Copy link
Contributor Author

@bernhardmgruber @gevtushenko idea: what do you think about adding a bench/downstream/ (or something like that) to put these in, e.g. bench/downstream/babelstream/transform_zips.cu?

I would like that! I would then add the full set of BabelStream benchmarks, which also contain thrust::copy and thrust::inner_product. The reason I don't want to just use upstream BabelStream is that I want to benefit from nvbench and the tuning infrastructure. I also think BabelStream is important enough that it should be included in our tuning effort.

@bernhardmgruber
Copy link
Contributor Author

We discussed this PR with @gevtushenko and concluded that the babelstream kernels are just variants of thrust::transform and therefore belong into basic.cu for the tuning to work correctly.

Copy link
Contributor

github-actions bot commented Jul 1, 2024

🟨 CI finished in 3h 51m: Pass: 93%/249 | Total: 3d 00h | Avg: 17m 35s | Max: 46m 43s | Hits: 87%/231914
  • 🟨 thrust: Pass: 92%/118 | Total: 11h 35m | Avg: 5m 53s | Max: 31m 03s | Hits: 98%/128639

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  91%/110 | Total: 10h 42m | Avg:  5m 50s | Max: 31m 03s | Hits:  99%/119195
      🟩 arm64              Pass: 100%/8   | Total: 53m 08s | Avg:  6m 38s | Max: 27m 52s | Hits:  95%/9444  
    🔍 ctk: 12.4 🔍
      🟩 11.1               Pass: 100%/15  | Total:  1h 00m | Avg:  4m 00s | Max: 15m 02s | Hits:  99%/17705 
      🟩 11.8               Pass: 100%/3   | Total: 38m 13s | Avg: 12m 44s | Max: 31m 03s | Hits:  88%/3543  
      🔍 12.4               Pass:  91%/100 | Total:  9h 57m | Avg:  5m 58s | Max: 27m 52s | Hits:  98%/107391
    🔍 cudacxx: nvcc12.4 🔍
      🟩 ClangCUDA17        Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  3m 55s | Hits: 100%/2360  
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 00m | Avg:  4m 00s | Max: 15m 02s | Hits:  99%/17705 
      🟩 nvcc11.8           Pass: 100%/3   | Total: 38m 13s | Avg: 12m 44s | Max: 31m 03s | Hits:  88%/3543  
      🔍 nvcc12.4           Pass:  90%/98  | Total:  9h 49m | Avg:  6m 01s | Max: 27m 52s | Hits:  98%/105031
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  3m 55s | Hits: 100%/2360  
      🔍 nvcc               Pass:  92%/116 | Total: 11h 28m | Avg:  5m 55s | Max: 31m 03s | Hits:  98%/126279
    🔍 sm: 90a 🔍
      🟩 60;70;80;90        Pass: 100%/3   | Total: 38m 13s | Avg: 12m 44s | Max: 31m 03s | Hits:  88%/3543  
      🔍 90a                Pass:  75%/4   | Total: 12m 59s | Avg:  3m 14s | Max:  3m 19s | Hits:  99%/3543  
    🟨 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 22m 47s | Avg:  3m 47s | Max:  4m 15s | Hits: 100%/7080  
      🟨 Clang10            Pass:  66%/3   | Total: 12m 47s | Avg:  4m 15s | Max:  4m 22s | Hits: 100%/2360  
      🟨 Clang11            Pass:  75%/4   | Total: 14m 14s | Avg:  3m 33s | Max:  3m 47s | Hits: 100%/3540  
      🟩 Clang12            Pass: 100%/4   | Total: 14m 37s | Avg:  3m 39s | Max:  3m 46s | Hits: 100%/4720  
      🟩 Clang13            Pass: 100%/4   | Total: 14m 53s | Avg:  3m 43s | Max:  3m 50s | Hits:  99%/4720  
      🟩 Clang14            Pass: 100%/4   | Total: 14m 48s | Avg:  3m 42s | Max:  3m 52s | Hits: 100%/4720  
      🟩 Clang15            Pass: 100%/4   | Total: 14m 40s | Avg:  3m 40s | Max:  3m 45s | Hits: 100%/4720  
      🟩 Clang16            Pass: 100%/4   | Total: 14m 58s | Avg:  3m 44s | Max:  3m 58s | Hits: 100%/4720  
      🟨 Clang17            Pass:  88%/18  | Total:  1h 57m | Avg:  6m 32s | Max: 20m 09s | Hits: 100%/18880 
      🟩 GCC6               Pass: 100%/2   | Total:  6m 02s | Avg:  3m 01s | Max:  3m 04s | Hits:  99%/2360  
      🟩 GCC7               Pass: 100%/6   | Total: 20m 30s | Avg:  3m 25s | Max:  3m 47s | Hits:  99%/7086  
      🟩 GCC8               Pass: 100%/6   | Total: 20m 07s | Avg:  3m 21s | Max:  3m 45s | Hits:  99%/7086  
      🟨 GCC9               Pass:  83%/6   | Total: 20m 30s | Avg:  3m 25s | Max:  3m 41s | Hits:  99%/5905  
      🟨 GCC10              Pass:  75%/4   | Total: 14m 51s | Avg:  3m 42s | Max:  3m 52s | Hits:  99%/3543  
      🟩 GCC11              Pass: 100%/7   | Total: 53m 49s | Avg:  7m 41s | Max: 31m 03s | Hits:  95%/8267  
      🟩 GCC12              Pass: 100%/4   | Total: 15m 12s | Avg:  3m 48s | Max:  3m 57s | Hits:  99%/4724  
      🟨 GCC13              Pass:  90%/20  | Total:  2h 29m | Avg:  7m 28s | Max: 27m 52s | Hits:  95%/21258 
      🟨 Intel2023.2.0      Pass:  66%/3   | Total: 13m 35s | Avg:  4m 31s | Max:  4m 39s | Hits: 100%/2366  
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 02s | Avg: 15m 02s | Max: 15m 02s | Hits:  98%/1176  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 25m 12s | Avg: 12m 36s | Max: 12m 39s | Hits:  98%/2352  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  1h 39m | Avg: 16m 39s | Max: 20m 00s | Hits:  98%/7056  
    🟨 cxx_family
      🟨 Clang              Pass:  92%/51  | Total:  4h 01m | Avg:  4m 43s | Max: 20m 09s | Hits:  99%/55460 
      🟨 GCC                Pass:  92%/55  | Total:  5h 00m | Avg:  5m 27s | Max: 31m 03s | Hits:  97%/60229 
      🟨 Intel              Pass:  66%/3   | Total: 13m 35s | Avg:  4m 31s | Max:  4m 39s | Hits: 100%/2366  
      🟩 MSVC               Pass: 100%/9   | Total:  2h 20m | Avg: 15m 34s | Max: 20m 00s | Hits:  98%/10584 
    🟨 gpu
      🟨 v100               Pass:  92%/118 | Total: 11h 35m | Avg:  5m 53s | Max: 31m 03s | Hits:  98%/128639
    🟨 jobs
      🟨 Build              Pass:  92%/99  | Total:  7h 59m | Avg:  4m 50s | Max: 31m 03s | Hits:  98%/108584
      🟨 TestCPU            Pass:  90%/11  | Total:  1h 45m | Avg:  9m 37s | Max: 20m 00s | Hits:  99%/11792 
      🟨 TestGPU            Pass:  87%/8   | Total:  1h 50m | Avg: 13m 50s | Max: 20m 09s | Hits:  99%/8263  
    🟨 std
      🟨 11                 Pass:  93%/30  | Total:  2h 35m | Avg:  5m 11s | Max: 31m 03s | Hits:  97%/33058 
      🟨 14                 Pass:  85%/34  | Total:  3h 15m | Avg:  5m 45s | Max: 18m 42s | Hits:  99%/34216 
      🟨 17                 Pass:  96%/33  | Total:  3h 28m | Avg:  6m 19s | Max: 27m 52s | Hits:  98%/37766 
      🟨 20                 Pass:  95%/21  | Total:  2h 15m | Avg:  6m 27s | Max: 20m 09s | Hits:  99%/23599 
    
  • 🟨 cub: Pass: 94%/131 | Total: 2d 13h | Avg: 28m 07s | Max: 46m 43s | Hits: 73%/103275

    🔍 ctk: 12.4 🔍
      🟩 11.1               Pass: 100%/15  | Total:  7h 31m | Avg: 30m 04s | Max: 46m 43s | Hits:  61%/11568 
      🟩 11.8               Pass: 100%/3   | Total:  2h 00m | Avg: 40m 16s | Max: 42m 01s | Hits:  64%/2556  
      🔍 12.4               Pass:  93%/113 | Total:  2d 03h | Avg: 27m 32s | Max: 46m 40s | Hits:  75%/89151 
    🔍 cudacxx: nvcc12.4 🔍
      🟩 ClangCUDA17        Pass: 100%/2   | Total: 40m 34s | Avg: 20m 17s | Max: 20m 41s | Hits:  67%/1408  
      🟩 nvcc11.1           Pass: 100%/15  | Total:  7h 31m | Avg: 30m 04s | Max: 46m 43s | Hits:  61%/11568 
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 00m | Avg: 40m 16s | Max: 42m 01s | Hits:  64%/2556  
      🔍 nvcc12.4           Pass:  93%/111 | Total:  2d 03h | Avg: 27m 40s | Max: 46m 40s | Hits:  75%/87743 
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 40m 34s | Avg: 20m 17s | Max: 20m 41s | Hits:  67%/1408  
      🔍 nvcc               Pass:  94%/129 | Total:  2d 12h | Avg: 28m 14s | Max: 46m 43s | Hits:  73%/101867
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  92%/99  | Total:  2d 02h | Avg: 30m 47s | Max: 46m 43s | Hits:  64%/76011 
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  2h 24m | Avg: 18m 07s | Max: 29m 07s | Hits:  99%/6816  
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 10m | Avg: 16m 16s | Max: 18m 30s | Hits:  99%/6816  
      🟩 HostLaunch         Pass: 100%/8   | Total:  2h 32m | Avg: 19m 00s | Max: 29m 49s | Hits:  99%/6816  
      🟩 TestGPU            Pass: 100%/8   | Total:  3h 29m | Avg: 26m 07s | Max: 33m 58s | Hits:  99%/6816  
    🔍 sm: 90a 🔍
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 00m | Avg: 40m 16s | Max: 42m 01s | Hits:  64%/2556  
      🔍 90a                Pass:  75%/4   | Total:  1h 06m | Avg: 16m 31s | Max: 16m 47s | Hits:  64%/2556  
    🟨 cxx
      🟨 Clang9             Pass:  83%/6   | Total:  3h 00m | Avg: 30m 00s | Max: 31m 54s | Hits:  63%/4036  
      🟩 Clang10            Pass: 100%/3   | Total:  1h 31m | Avg: 30m 25s | Max: 32m 09s | Hits:  65%/2562  
      🟨 Clang11            Pass:  75%/4   | Total:  2h 00m | Avg: 30m 00s | Max: 30m 17s | Hits:  65%/2562  
      🟩 Clang12            Pass: 100%/4   | Total:  2h 03m | Avg: 30m 51s | Max: 34m 44s | Hits:  65%/3416  
      🟩 Clang13            Pass: 100%/4   | Total:  1h 55m | Avg: 28m 47s | Max: 31m 39s | Hits:  65%/3416  
      🟩 Clang14            Pass: 100%/4   | Total:  2h 01m | Avg: 30m 28s | Max: 32m 17s | Hits:  65%/3416  
      🟩 Clang15            Pass: 100%/4   | Total:  2h 00m | Avg: 30m 00s | Max: 30m 55s | Hits:  65%/3408  
      🟩 Clang16            Pass: 100%/4   | Total:  2h 01m | Avg: 30m 15s | Max: 32m 36s | Hits:  65%/3408  
      🟨 Clang17            Pass:  96%/26  | Total:  9h 49m | Avg: 22m 41s | Max: 34m 28s | Hits:  87%/21004 
      🟩 GCC6               Pass: 100%/2   | Total: 58m 52s | Avg: 29m 26s | Max: 30m 14s | Hits:  61%/1552  
      🟩 GCC7               Pass: 100%/6   | Total:  2h 56m | Avg: 29m 27s | Max: 31m 42s | Hits:  63%/4893  
      🟩 GCC8               Pass: 100%/6   | Total:  2h 53m | Avg: 28m 50s | Max: 30m 19s | Hits:  63%/4893  
      🟩 GCC9               Pass: 100%/6   | Total:  2h 59m | Avg: 29m 53s | Max: 31m 01s | Hits:  63%/4893  
      🟨 GCC10              Pass:  75%/4   | Total:  2h 05m | Avg: 31m 29s | Max: 34m 06s | Hits:  64%/2562  
      🟨 GCC11              Pass:  85%/7   | Total:  4h 00m | Avg: 34m 20s | Max: 42m 01s | Hits:  64%/5112  
      🟩 GCC12              Pass: 100%/4   | Total:  2h 03m | Avg: 30m 58s | Max: 31m 10s | Hits:  64%/3408  
      🟨 GCC13              Pass:  96%/28  | Total: 10h 46m | Avg: 23m 04s | Max: 33m 58s | Hits:  85%/23004 
      🟨 Intel2023.2.0      Pass:  66%/3   | Total:  1h 51m | Avg: 37m 07s | Max: 37m 37s | Hits:  62%/1560  
      🟩 MSVC14.16          Pass: 100%/1   | Total: 46m 43s | Avg: 46m 43s | Max: 46m 43s | Hits:  66%/695   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 24m | Avg: 42m 16s | Max: 43m 05s | Hits:  66%/1390  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 14m | Avg: 44m 46s | Max: 46m 40s | Hits:  66%/2085  
    🟨 cxx_family
      🟨 Clang              Pass:  94%/59  | Total:  1d 02h | Avg: 26m 49s | Max: 34m 44s | Hits:  75%/47228 
      🟨 GCC                Pass:  95%/63  | Total:  1d 04h | Avg: 27m 22s | Max: 42m 01s | Hits:  73%/50317 
      🟨 Intel              Pass:  66%/3   | Total:  1h 51m | Avg: 37m 07s | Max: 37m 37s | Hits:  62%/1560  
      🟩 MSVC               Pass: 100%/6   | Total:  4h 25m | Avg: 44m 16s | Max: 46m 43s | Hits:  66%/4170  
    🟨 std
      🟨 11                 Pass:  91%/34  | Total: 15h 42m | Avg: 27m 43s | Max: 42m 01s | Hits:  73%/25979 
      🟨 14                 Pass:  91%/37  | Total: 17h 52m | Avg: 28m 59s | Max: 46m 43s | Hits:  73%/28138 
      🟩 17                 Pass: 100%/36  | Total: 16h 36m | Avg: 27m 40s | Max: 42m 34s | Hits:  72%/29857 
      🟨 20                 Pass:  95%/24  | Total: 11h 12m | Avg: 28m 01s | Max: 45m 06s | Hits:  77%/19301 
    🟨 gpu
      🟨 v100               Pass:  94%/131 | Total:  2d 13h | Avg: 28m 07s | Max: 46m 43s | Hits:  73%/103275
    🟨 cpu
      🟨 amd64              Pass:  95%/123 | Total:  2d 09h | Avg: 27m 56s | Max: 46m 43s | Hits:  74%/97311 
      🟨 arm64              Pass:  87%/8   | Total:  4h 06m | Avg: 30m 46s | Max: 33m 24s | Hits:  64%/5964  
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

🏃‍ Runner counts (total jobs: 249)

# Runner
178 linux-amd64-cpu16
40 linux-amd64-gpu-v100-latest-1
16 linux-arm64-cpu16
15 windows-amd64-cpu16

Copy link
Contributor

github-actions bot commented Jul 2, 2024

🟩 CI finished in 3h 51m: Pass: 100%/249 | Total: 3d 00h | Avg: 17m 35s | Max: 46m 43s | Hits: 87%/248439
  • 🟩 cub: Pass: 100%/131 | Total: 2d 13h | Avg: 28m 07s | Max: 46m 43s | Hits: 73%/109173

    🟩 cpu
      🟩 amd64              Pass: 100%/123 | Total:  2d 09h | Avg: 27m 56s | Max: 46m 43s | Hits:  73%/102357
      🟩 arm64              Pass: 100%/8   | Total:  4h 06m | Avg: 30m 46s | Max: 33m 24s | Hits:  65%/6816  
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  7h 31m | Avg: 30m 04s | Max: 46m 43s | Hits:  61%/11568 
      🟩 11.8               Pass: 100%/3   | Total:  2h 00m | Avg: 40m 16s | Max: 42m 01s | Hits:  64%/2556  
      🟩 12.4               Pass: 100%/113 | Total:  2d 03h | Avg: 27m 32s | Max: 46m 40s | Hits:  75%/95049 
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total: 40m 34s | Avg: 20m 17s | Max: 20m 41s | Hits:  67%/1408  
      🟩 nvcc11.1           Pass: 100%/15  | Total:  7h 31m | Avg: 30m 04s | Max: 46m 43s | Hits:  61%/11568 
      🟩 nvcc11.8           Pass: 100%/3   | Total:  2h 00m | Avg: 40m 16s | Max: 42m 01s | Hits:  64%/2556  
      🟩 nvcc12.4           Pass: 100%/111 | Total:  2d 03h | Avg: 27m 40s | Max: 46m 40s | Hits:  75%/93641 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 40m 34s | Avg: 20m 17s | Max: 20m 41s | Hits:  67%/1408  
      🟩 nvcc               Pass: 100%/129 | Total:  2d 12h | Avg: 28m 14s | Max: 46m 43s | Hits:  73%/107765
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 00m | Avg: 30m 00s | Max: 31m 54s | Hits:  63%/4890  
      🟩 Clang10            Pass: 100%/3   | Total:  1h 31m | Avg: 30m 25s | Max: 32m 09s | Hits:  65%/2562  
      🟩 Clang11            Pass: 100%/4   | Total:  2h 00m | Avg: 30m 00s | Max: 30m 17s | Hits:  65%/3416  
      🟩 Clang12            Pass: 100%/4   | Total:  2h 03m | Avg: 30m 51s | Max: 34m 44s | Hits:  65%/3416  
      🟩 Clang13            Pass: 100%/4   | Total:  1h 55m | Avg: 28m 47s | Max: 31m 39s | Hits:  65%/3416  
      🟩 Clang14            Pass: 100%/4   | Total:  2h 01m | Avg: 30m 28s | Max: 32m 17s | Hits:  65%/3416  
      🟩 Clang15            Pass: 100%/4   | Total:  2h 00m | Avg: 30m 00s | Max: 30m 55s | Hits:  65%/3408  
      🟩 Clang16            Pass: 100%/4   | Total:  2h 01m | Avg: 30m 15s | Max: 32m 36s | Hits:  65%/3408  
      🟩 Clang17            Pass: 100%/26  | Total:  9h 49m | Avg: 22m 41s | Max: 34m 28s | Hits:  87%/21856 
      🟩 GCC6               Pass: 100%/2   | Total: 58m 52s | Avg: 29m 26s | Max: 30m 14s | Hits:  61%/1552  
      🟩 GCC7               Pass: 100%/6   | Total:  2h 56m | Avg: 29m 27s | Max: 31m 42s | Hits:  63%/4893  
      🟩 GCC8               Pass: 100%/6   | Total:  2h 53m | Avg: 28m 50s | Max: 30m 19s | Hits:  63%/4893  
      🟩 GCC9               Pass: 100%/6   | Total:  2h 59m | Avg: 29m 53s | Max: 31m 01s | Hits:  63%/4893  
      🟩 GCC10              Pass: 100%/4   | Total:  2h 05m | Avg: 31m 29s | Max: 34m 06s | Hits:  64%/3416  
      🟩 GCC11              Pass: 100%/7   | Total:  4h 00m | Avg: 34m 20s | Max: 42m 01s | Hits:  64%/5964  
      🟩 GCC12              Pass: 100%/4   | Total:  2h 03m | Avg: 30m 58s | Max: 31m 10s | Hits:  64%/3408  
      🟩 GCC13              Pass: 100%/28  | Total: 10h 46m | Avg: 23m 04s | Max: 33m 58s | Hits:  84%/23856 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 51m | Avg: 37m 07s | Max: 37m 37s | Hits:  62%/2340  
      🟩 MSVC14.16          Pass: 100%/1   | Total: 46m 43s | Avg: 46m 43s | Max: 46m 43s | Hits:  66%/695   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 24m | Avg: 42m 16s | Max: 43m 05s | Hits:  66%/1390  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 14m | Avg: 44m 46s | Max: 46m 40s | Hits:  66%/2085  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/59  | Total:  1d 02h | Avg: 26m 49s | Max: 34m 44s | Hits:  74%/49788 
      🟩 GCC                Pass: 100%/63  | Total:  1d 04h | Avg: 27m 22s | Max: 42m 01s | Hits:  73%/52875 
      🟩 Intel              Pass: 100%/3   | Total:  1h 51m | Avg: 37m 07s | Max: 37m 37s | Hits:  62%/2340  
      🟩 MSVC               Pass: 100%/6   | Total:  4h 25m | Avg: 44m 16s | Max: 46m 43s | Hits:  66%/4170  
    🟩 gpu
      🟩 v100               Pass: 100%/131 | Total:  2d 13h | Avg: 28m 07s | Max: 46m 43s | Hits:  73%/109173
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  2d 02h | Avg: 30m 47s | Max: 46m 43s | Hits:  64%/81909 
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  2h 24m | Avg: 18m 07s | Max: 29m 07s | Hits:  99%/6816  
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 10m | Avg: 16m 16s | Max: 18m 30s | Hits:  99%/6816  
      🟩 HostLaunch         Pass: 100%/8   | Total:  2h 32m | Avg: 19m 00s | Max: 29m 49s | Hits:  99%/6816  
      🟩 TestGPU            Pass: 100%/8   | Total:  3h 29m | Avg: 26m 07s | Max: 33m 58s | Hits:  99%/6816  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  2h 00m | Avg: 40m 16s | Max: 42m 01s | Hits:  64%/2556  
      🟩 90a                Pass: 100%/4   | Total:  1h 06m | Avg: 16m 31s | Max: 16m 47s | Hits:  64%/3408  
    🟩 std
      🟩 11                 Pass: 100%/34  | Total: 15h 42m | Avg: 27m 43s | Max: 42m 01s | Hits:  72%/28539 
      🟩 14                 Pass: 100%/37  | Total: 17h 52m | Avg: 28m 59s | Max: 46m 43s | Hits:  72%/30624 
      🟩 17                 Pass: 100%/36  | Total: 16h 36m | Avg: 27m 40s | Max: 42m 34s | Hits:  72%/29857 
      🟩 20                 Pass: 100%/24  | Total: 11h 12m | Avg: 28m 01s | Max: 45m 06s | Hits:  76%/20153 
    
  • 🟩 thrust: Pass: 100%/118 | Total: 11h 35m | Avg: 5m 53s | Max: 31m 03s | Hits: 98%/139266

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total: 10h 42m | Avg:  5m 50s | Max: 31m 03s | Hits:  99%/129822
      🟩 arm64              Pass: 100%/8   | Total: 53m 08s | Avg:  6m 38s | Max: 27m 52s | Hits:  95%/9444  
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 00m | Avg:  4m 00s | Max: 15m 02s | Hits:  99%/17705 
      🟩 11.8               Pass: 100%/3   | Total: 38m 13s | Avg: 12m 44s | Max: 31m 03s | Hits:  88%/3543  
      🟩 12.4               Pass: 100%/100 | Total:  9h 57m | Avg:  5m 58s | Max: 27m 52s | Hits:  99%/118018
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  3m 55s | Hits: 100%/2360  
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 00m | Avg:  4m 00s | Max: 15m 02s | Hits:  99%/17705 
      🟩 nvcc11.8           Pass: 100%/3   | Total: 38m 13s | Avg: 12m 44s | Max: 31m 03s | Hits:  88%/3543  
      🟩 nvcc12.4           Pass: 100%/98  | Total:  9h 49m | Avg:  6m 01s | Max: 27m 52s | Hits:  99%/115658
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 39s | Avg:  3m 49s | Max:  3m 55s | Hits: 100%/2360  
      🟩 nvcc               Pass: 100%/116 | Total: 11h 28m | Avg:  5m 55s | Max: 31m 03s | Hits:  98%/136906
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 22m 47s | Avg:  3m 47s | Max:  4m 15s | Hits: 100%/7080  
      🟩 Clang10            Pass: 100%/3   | Total: 12m 47s | Avg:  4m 15s | Max:  4m 22s | Hits: 100%/3540  
      🟩 Clang11            Pass: 100%/4   | Total: 14m 14s | Avg:  3m 33s | Max:  3m 47s | Hits: 100%/4720  
      🟩 Clang12            Pass: 100%/4   | Total: 14m 37s | Avg:  3m 39s | Max:  3m 46s | Hits: 100%/4720  
      🟩 Clang13            Pass: 100%/4   | Total: 14m 53s | Avg:  3m 43s | Max:  3m 50s | Hits:  99%/4720  
      🟩 Clang14            Pass: 100%/4   | Total: 14m 48s | Avg:  3m 42s | Max:  3m 52s | Hits: 100%/4720  
      🟩 Clang15            Pass: 100%/4   | Total: 14m 40s | Avg:  3m 40s | Max:  3m 45s | Hits: 100%/4720  
      🟩 Clang16            Pass: 100%/4   | Total: 14m 58s | Avg:  3m 44s | Max:  3m 58s | Hits: 100%/4720  
      🟩 Clang17            Pass: 100%/18  | Total:  1h 57m | Avg:  6m 32s | Max: 20m 09s | Hits: 100%/21240 
      🟩 GCC6               Pass: 100%/2   | Total:  6m 02s | Avg:  3m 01s | Max:  3m 04s | Hits:  99%/2360  
      🟩 GCC7               Pass: 100%/6   | Total: 20m 30s | Avg:  3m 25s | Max:  3m 47s | Hits:  99%/7086  
      🟩 GCC8               Pass: 100%/6   | Total: 20m 07s | Avg:  3m 21s | Max:  3m 45s | Hits:  99%/7086  
      🟩 GCC9               Pass: 100%/6   | Total: 20m 30s | Avg:  3m 25s | Max:  3m 41s | Hits:  99%/7086  
      🟩 GCC10              Pass: 100%/4   | Total: 14m 51s | Avg:  3m 42s | Max:  3m 52s | Hits:  99%/4724  
      🟩 GCC11              Pass: 100%/7   | Total: 53m 49s | Avg:  7m 41s | Max: 31m 03s | Hits:  95%/8267  
      🟩 GCC12              Pass: 100%/4   | Total: 15m 12s | Avg:  3m 48s | Max:  3m 57s | Hits:  99%/4724  
      🟩 GCC13              Pass: 100%/20  | Total:  2h 29m | Avg:  7m 28s | Max: 27m 52s | Hits:  95%/23620 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 13m 35s | Avg:  4m 31s | Max:  4m 39s | Hits: 100%/3549  
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 02s | Avg: 15m 02s | Max: 15m 02s | Hits:  98%/1176  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 25m 12s | Avg: 12m 36s | Max: 12m 39s | Hits:  98%/2352  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  1h 39m | Avg: 16m 39s | Max: 20m 00s | Hits:  98%/7056  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/51  | Total:  4h 01m | Avg:  4m 43s | Max: 20m 09s | Hits:  99%/60180 
      🟩 GCC                Pass: 100%/55  | Total:  5h 00m | Avg:  5m 27s | Max: 31m 03s | Hits:  97%/64953 
      🟩 Intel              Pass: 100%/3   | Total: 13m 35s | Avg:  4m 31s | Max:  4m 39s | Hits: 100%/3549  
      🟩 MSVC               Pass: 100%/9   | Total:  2h 20m | Avg: 15m 34s | Max: 20m 00s | Hits:  98%/10584 
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total: 11h 35m | Avg:  5m 53s | Max: 31m 03s | Hits:  98%/139266
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  7h 59m | Avg:  4m 50s | Max: 31m 03s | Hits:  98%/116850
      🟩 TestCPU            Pass: 100%/11  | Total:  1h 45m | Avg:  9m 37s | Max: 20m 00s | Hits:  99%/12972 
      🟩 TestGPU            Pass: 100%/8   | Total:  1h 50m | Avg: 13m 50s | Max: 20m 09s | Hits:  99%/9444  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 38m 13s | Avg: 12m 44s | Max: 31m 03s | Hits:  88%/3543  
      🟩 90a                Pass: 100%/4   | Total: 12m 59s | Avg:  3m 14s | Max:  3m 19s | Hits:  99%/4724  
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  2h 35m | Avg:  5m 11s | Max: 31m 03s | Hits:  97%/35418 
      🟩 14                 Pass: 100%/34  | Total:  3h 15m | Avg:  5m 45s | Max: 18m 42s | Hits:  99%/40122 
      🟩 17                 Pass: 100%/33  | Total:  3h 28m | Avg:  6m 19s | Max: 27m 52s | Hits:  98%/38946 
      🟩 20                 Pass: 100%/21  | Total:  2h 15m | Avg:  6m 27s | Max: 20m 09s | Hits:  99%/24780 
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

🏃‍ Runner counts (total jobs: 249)

# Runner
178 linux-amd64-cpu16
40 linux-amd64-gpu-v100-latest-1
16 linux-arm64-cpu16
15 windows-amd64-cpu16

@bernhardmgruber
Copy link
Contributor Author

bernhardmgruber commented Jul 3, 2024

Regarding the tested element types, BabelStream uses double by default and float if requested. We may want to test additional data types of especially different sizes though.

@ahendriksen uses int8 -> int128 in his measurements. According to him, the reduced throughput of integer arithmetic should not matter since the benchmark is highly memory-bound.

Alternatively, we could use int8, int16 or half, float, double and int128, so we cover all sizes and adhere to BabelStream's types.

@jrhemstad
Copy link
Collaborator

Alternatively, we could use int8, int16 or half, float, double and int128, so we cover all sizes and adhere to BabelStream's types.

I like this idea.

thrust/benchmarks/CMakeLists.txt Outdated Show resolved Hide resolved
thrust/benchmarks/bench/transform/basic.cu Outdated Show resolved Hide resolved
thrust/benchmarks/bench/transform/basic.cu Outdated Show resolved Hide resolved
See BabelStream Thrust implementation: https://github.com/UoB-HPC/BabelStream/blob/main/src/thrust/ThrustStream.cu

Co-authored-by: Georgii Evtushenko <evtushenko.georgy@gmail.com>
Copy link
Contributor

github-actions bot commented Jul 3, 2024

🟨 CI finished in 3h 58m: Pass: 99%/249 | Total: 1d 10h | Avg: 8m 23s | Max: 28m 57s | Hits: 98%/247587
  • 🟨 cub: Pass: 99%/131 | Total: 23h 39m | Avg: 10m 50s | Max: 28m 57s | Hits: 97%/108321

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  99%/123 | Total: 22h 13m | Avg: 10m 50s | Max: 28m 57s | Hits:  97%/101505
      🟩 arm64              Pass: 100%/8   | Total:  1h 25m | Avg: 10m 43s | Max: 11m 28s | Hits:  96%/6816  
    🔍 ctk: 12.4 🔍
      🟩 11.1               Pass: 100%/15  | Total:  1h 05m | Avg:  4m 22s | Max: 14m 28s | Hits:  99%/11568 
      🟩 11.8               Pass: 100%/3   | Total: 32m 01s | Avg: 10m 40s | Max: 11m 16s | Hits:  96%/2556  
      🔍 12.4               Pass:  99%/113 | Total: 22h 01m | Avg: 11m 41s | Max: 28m 57s | Hits:  97%/94197 
    🔍 cudacxx: nvcc12.4 🔍
      🟩 ClangCUDA17        Pass: 100%/2   | Total:  7m 20s | Avg:  3m 40s | Max:  3m 51s | Hits: 100%/1408  
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 05m | Avg:  4m 22s | Max: 14m 28s | Hits:  99%/11568 
      🟩 nvcc11.8           Pass: 100%/3   | Total: 32m 01s | Avg: 10m 40s | Max: 11m 16s | Hits:  96%/2556  
      🔍 nvcc12.4           Pass:  99%/111 | Total: 21h 54m | Avg: 11m 50s | Max: 28m 57s | Hits:  97%/92789 
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 20s | Avg:  3m 40s | Max:  3m 51s | Hits: 100%/1408  
      🔍 nvcc               Pass:  99%/129 | Total: 23h 32m | Avg: 10m 56s | Max: 28m 57s | Hits:  97%/106913
    🔍 cxx: Clang17 🔍
      🟩 Clang9             Pass: 100%/6   | Total: 38m 43s | Avg:  6m 27s | Max:  9m 07s | Hits:  98%/4890  
      🟩 Clang10            Pass: 100%/3   | Total: 28m 41s | Avg:  9m 33s | Max: 10m 04s | Hits:  96%/2562  
      🟩 Clang11            Pass: 100%/4   | Total: 35m 35s | Avg:  8m 53s | Max:  9m 40s | Hits:  96%/3416  
      🟩 Clang12            Pass: 100%/4   | Total: 34m 34s | Avg:  8m 38s | Max:  9m 10s | Hits:  96%/3416  
      🟩 Clang13            Pass: 100%/4   | Total: 35m 30s | Avg:  8m 52s | Max:  9m 19s | Hits:  96%/3416  
      🟩 Clang14            Pass: 100%/4   | Total: 36m 32s | Avg:  9m 08s | Max:  9m 22s | Hits:  96%/3416  
      🟩 Clang15            Pass: 100%/4   | Total: 35m 31s | Avg:  8m 52s | Max:  9m 02s | Hits:  96%/3408  
      🟩 Clang16            Pass: 100%/4   | Total: 35m 16s | Avg:  8m 49s | Max:  9m 00s | Hits:  96%/3408  
      🔍 Clang17            Pass:  96%/26  | Total:  6h 12m | Avg: 14m 18s | Max: 26m 15s | Hits:  98%/21004 
      🟩 GCC6               Pass: 100%/2   | Total:  6m 48s | Avg:  3m 24s | Max:  3m 31s | Hits:  99%/1552  
      🟩 GCC7               Pass: 100%/6   | Total: 37m 34s | Avg:  6m 15s | Max:  9m 46s | Hits:  97%/4893  
      🟩 GCC8               Pass: 100%/6   | Total: 37m 43s | Avg:  6m 17s | Max:  9m 12s | Hits:  97%/4893  
      🟩 GCC9               Pass: 100%/6   | Total: 38m 22s | Avg:  6m 23s | Max:  9m 43s | Hits:  97%/4893  
      🟩 GCC10              Pass: 100%/4   | Total: 37m 04s | Avg:  9m 16s | Max: 10m 00s | Hits:  96%/3416  
      🟩 GCC11              Pass: 100%/7   | Total:  1h 07m | Avg:  9m 39s | Max: 11m 16s | Hits:  96%/5964  
      🟩 GCC12              Pass: 100%/4   | Total: 36m 19s | Avg:  9m 04s | Max:  9m 08s | Hits:  96%/3408  
      🟩 GCC13              Pass: 100%/28  | Total:  6h 53m | Avg: 14m 47s | Max: 28m 57s | Hits:  98%/23856 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 15m 18s | Avg:  5m 06s | Max:  5m 13s | Hits: 100%/2340  
      🟩 MSVC14.16          Pass: 100%/1   | Total: 14m 28s | Avg: 14m 28s | Max: 14m 28s | Hits:  98%/695   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 23m 35s | Avg: 11m 47s | Max: 11m 58s | Hits:  98%/1390  
      🟩 MSVC14.39          Pass: 100%/3   | Total: 38m 16s | Avg: 12m 45s | Max: 13m 11s | Hits:  98%/2085  
    🔍 cxx_family: Clang 🔍
      🔍 Clang              Pass:  98%/59  | Total: 10h 52m | Avg: 11m 03s | Max: 26m 15s | Hits:  97%/48936 
      🟩 GCC                Pass: 100%/63  | Total: 11h 15m | Avg: 10m 43s | Max: 28m 57s | Hits:  97%/52875 
      🟩 Intel              Pass: 100%/3   | Total: 15m 18s | Avg:  5m 06s | Max:  5m 13s | Hits: 100%/2340  
      🟩 MSVC               Pass: 100%/6   | Total:  1h 16m | Avg: 12m 43s | Max: 14m 28s | Hits:  98%/4170  
    🔍 jobs: TestGPU 🔍
      🟩 Build              Pass: 100%/99  | Total: 13h 44m | Avg:  8m 19s | Max: 14m 28s | Hits:  97%/81909 
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  2h 28m | Avg: 18m 31s | Max: 22m 43s | Hits:  99%/6816  
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 06m | Avg: 15m 46s | Max: 17m 15s | Hits:  99%/6816  
      🟩 HostLaunch         Pass: 100%/8   | Total:  2h 20m | Avg: 17m 33s | Max: 19m 21s | Hits:  99%/6816  
      🔍 TestGPU            Pass:  87%/8   | Total:  3h 00m | Avg: 22m 33s | Max: 28m 57s | Hits:  99%/5964  
    🔍 std: 20 🔍
      🟩 11                 Pass: 100%/34  | Total:  5h 48m | Avg: 10m 14s | Max: 24m 22s | Hits:  97%/28539 
      🟩 14                 Pass: 100%/37  | Total:  6h 33m | Avg: 10m 38s | Max: 28m 57s | Hits:  97%/30624 
      🟩 17                 Pass: 100%/36  | Total:  6h 31m | Avg: 10m 52s | Max: 26m 15s | Hits:  97%/29857 
      🔍 20                 Pass:  95%/24  | Total:  4h 46m | Avg: 11m 56s | Max: 25m 57s | Hits:  97%/19301 
    🟨 gpu
      🟨 v100               Pass:  99%/131 | Total: 23h 39m | Avg: 10m 50s | Max: 28m 57s | Hits:  97%/108321
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 32m 01s | Avg: 10m 40s | Max: 11m 16s | Hits:  96%/2556  
      🟩 90a                Pass: 100%/4   | Total: 23m 21s | Avg:  5m 50s | Max:  5m 59s | Hits:  96%/3408  
    
  • 🟩 thrust: Pass: 100%/118 | Total: 11h 11m | Avg: 5m 41s | Max: 26m 59s | Hits: 99%/139266

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total: 10h 17m | Avg:  5m 36s | Max: 20m 10s | Hits:  99%/129822
      🟩 arm64              Pass: 100%/8   | Total: 54m 13s | Avg:  6m 46s | Max: 26m 59s | Hits:  90%/9444  
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 01m | Avg:  4m 05s | Max: 15m 26s | Hits:  99%/17705 
      🟩 11.8               Pass: 100%/3   | Total: 11m 04s | Avg:  3m 41s | Max:  3m 56s | Hits:  99%/3543  
      🟩 12.4               Pass: 100%/100 | Total:  9h 59m | Avg:  5m 59s | Max: 26m 59s | Hits:  99%/118018
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total:  7m 24s | Avg:  3m 42s | Max:  3m 49s | Hits: 100%/2360  
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 01m | Avg:  4m 05s | Max: 15m 26s | Hits:  99%/17705 
      🟩 nvcc11.8           Pass: 100%/3   | Total: 11m 04s | Avg:  3m 41s | Max:  3m 56s | Hits:  99%/3543  
      🟩 nvcc12.4           Pass: 100%/98  | Total:  9h 51m | Avg:  6m 02s | Max: 26m 59s | Hits:  99%/115658
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 24s | Avg:  3m 42s | Max:  3m 49s | Hits: 100%/2360  
      🟩 nvcc               Pass: 100%/116 | Total: 11h 04m | Avg:  5m 43s | Max: 26m 59s | Hits:  99%/136906
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 22m 53s | Avg:  3m 48s | Max:  4m 20s | Hits: 100%/7080  
      🟩 Clang10            Pass: 100%/3   | Total: 12m 53s | Avg:  4m 17s | Max:  4m 27s | Hits: 100%/3540  
      🟩 Clang11            Pass: 100%/4   | Total: 15m 00s | Avg:  3m 45s | Max:  4m 08s | Hits: 100%/4720  
      🟩 Clang12            Pass: 100%/4   | Total: 14m 30s | Avg:  3m 37s | Max:  3m 42s | Hits: 100%/4720  
      🟩 Clang13            Pass: 100%/4   | Total: 15m 07s | Avg:  3m 46s | Max:  4m 11s | Hits: 100%/4720  
      🟩 Clang14            Pass: 100%/4   | Total: 14m 40s | Avg:  3m 40s | Max:  3m 45s | Hits: 100%/4720  
      🟩 Clang15            Pass: 100%/4   | Total: 14m 59s | Avg:  3m 44s | Max:  3m 54s | Hits: 100%/4720  
      🟩 Clang16            Pass: 100%/4   | Total: 15m 04s | Avg:  3m 46s | Max:  3m 56s | Hits: 100%/4720  
      🟩 Clang17            Pass: 100%/18  | Total:  1h 55m | Avg:  6m 24s | Max: 14m 22s | Hits: 100%/21240 
      🟩 GCC6               Pass: 100%/2   | Total:  6m 46s | Avg:  3m 23s | Max:  3m 49s | Hits:  99%/2360  
      🟩 GCC7               Pass: 100%/6   | Total: 20m 23s | Avg:  3m 23s | Max:  3m 53s | Hits:  99%/7086  
      🟩 GCC8               Pass: 100%/6   | Total: 19m 37s | Avg:  3m 16s | Max:  3m 33s | Hits:  99%/7086  
      🟩 GCC9               Pass: 100%/6   | Total: 21m 33s | Avg:  3m 35s | Max:  4m 13s | Hits:  99%/7086  
      🟩 GCC10              Pass: 100%/4   | Total: 15m 07s | Avg:  3m 46s | Max:  4m 06s | Hits:  99%/4724  
      🟩 GCC11              Pass: 100%/7   | Total: 26m 44s | Avg:  3m 49s | Max:  4m 08s | Hits:  99%/8267  
      🟩 GCC12              Pass: 100%/4   | Total: 15m 25s | Avg:  3m 51s | Max:  4m 03s | Hits:  99%/4724  
      🟩 GCC13              Pass: 100%/20  | Total:  2h 31m | Avg:  7m 33s | Max: 26m 59s | Hits:  95%/23620 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 13m 52s | Avg:  4m 37s | Max:  4m 44s | Hits: 100%/3549  
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 26s | Avg: 15m 26s | Max: 15m 26s | Hits:  98%/1176  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 25m 26s | Avg: 12m 43s | Max: 12m 45s | Hits:  98%/2352  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  1h 39m | Avg: 16m 38s | Max: 20m 10s | Hits:  98%/7056  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/51  | Total:  4h 00m | Avg:  4m 42s | Max: 14m 22s | Hits: 100%/60180 
      🟩 GCC                Pass: 100%/55  | Total:  4h 36m | Avg:  5m 01s | Max: 26m 59s | Hits:  98%/64953 
      🟩 Intel              Pass: 100%/3   | Total: 13m 52s | Avg:  4m 37s | Max:  4m 44s | Hits: 100%/3549  
      🟩 MSVC               Pass: 100%/9   | Total:  2h 20m | Avg: 15m 37s | Max: 20m 10s | Hits:  98%/10584 
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total: 11h 11m | Avg:  5m 41s | Max: 26m 59s | Hits:  99%/139266
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  7h 32m | Avg:  4m 34s | Max: 26m 59s | Hits:  99%/116850
      🟩 TestCPU            Pass: 100%/11  | Total:  1h 46m | Avg:  9m 38s | Max: 20m 10s | Hits:  99%/12972 
      🟩 TestGPU            Pass: 100%/8   | Total:  1h 53m | Avg: 14m 10s | Max: 15m 35s | Hits:  99%/9444  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 11m 04s | Avg:  3m 41s | Max:  3m 56s | Hits:  99%/3543  
      🟩 90a                Pass: 100%/4   | Total: 13m 08s | Avg:  3m 17s | Max:  3m 26s | Hits:  99%/4724  
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  2h 34m | Avg:  5m 08s | Max: 26m 59s | Hits:  97%/35418 
      🟩 14                 Pass: 100%/34  | Total:  3h 16m | Avg:  5m 47s | Max: 18m 38s | Hits:  99%/40122 
      🟩 17                 Pass: 100%/33  | Total:  3h 06m | Avg:  5m 39s | Max: 19m 19s | Hits:  99%/38946 
      🟩 20                 Pass: 100%/21  | Total:  2h 13m | Avg:  6m 21s | Max: 20m 10s | Hits:  99%/24780 
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

🏃‍ Runner counts (total jobs: 249)

# Runner
178 linux-amd64-cpu16
40 linux-amd64-gpu-v100-latest-1
16 linux-arm64-cpu16
15 windows-amd64-cpu16

Copy link
Contributor

github-actions bot commented Jul 3, 2024

🟩 CI finished in 5h 28m: Pass: 100%/249 | Total: 1d 11h | Avg: 8m 31s | Max: 36m 34s | Hits: 98%/248439
  • 🟩 cub: Pass: 100%/131 | Total: 1d 00h | Avg: 11m 05s | Max: 36m 34s | Hits: 97%/109173

    🟩 cpu
      🟩 amd64              Pass: 100%/123 | Total: 22h 46m | Avg: 11m 06s | Max: 36m 34s | Hits:  97%/102357
      🟩 arm64              Pass: 100%/8   | Total:  1h 25m | Avg: 10m 43s | Max: 11m 28s | Hits:  96%/6816  
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 05m | Avg:  4m 22s | Max: 14m 28s | Hits:  99%/11568 
      🟩 11.8               Pass: 100%/3   | Total: 32m 01s | Avg: 10m 40s | Max: 11m 16s | Hits:  96%/2556  
      🟩 12.4               Pass: 100%/113 | Total: 22h 34m | Avg: 11m 59s | Max: 36m 34s | Hits:  97%/95049 
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total:  7m 20s | Avg:  3m 40s | Max:  3m 51s | Hits: 100%/1408  
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 05m | Avg:  4m 22s | Max: 14m 28s | Hits:  99%/11568 
      🟩 nvcc11.8           Pass: 100%/3   | Total: 32m 01s | Avg: 10m 40s | Max: 11m 16s | Hits:  96%/2556  
      🟩 nvcc12.4           Pass: 100%/111 | Total: 22h 26m | Avg: 12m 08s | Max: 36m 34s | Hits:  97%/93641 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 20s | Avg:  3m 40s | Max:  3m 51s | Hits: 100%/1408  
      🟩 nvcc               Pass: 100%/129 | Total:  1d 00h | Avg: 11m 11s | Max: 36m 34s | Hits:  97%/107765
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 38m 43s | Avg:  6m 27s | Max:  9m 07s | Hits:  98%/4890  
      🟩 Clang10            Pass: 100%/3   | Total: 28m 41s | Avg:  9m 33s | Max: 10m 04s | Hits:  96%/2562  
      🟩 Clang11            Pass: 100%/4   | Total: 35m 35s | Avg:  8m 53s | Max:  9m 40s | Hits:  96%/3416  
      🟩 Clang12            Pass: 100%/4   | Total: 34m 34s | Avg:  8m 38s | Max:  9m 10s | Hits:  96%/3416  
      🟩 Clang13            Pass: 100%/4   | Total: 35m 30s | Avg:  8m 52s | Max:  9m 19s | Hits:  96%/3416  
      🟩 Clang14            Pass: 100%/4   | Total: 36m 32s | Avg:  9m 08s | Max:  9m 22s | Hits:  96%/3416  
      🟩 Clang15            Pass: 100%/4   | Total: 35m 31s | Avg:  8m 52s | Max:  9m 02s | Hits:  96%/3408  
      🟩 Clang16            Pass: 100%/4   | Total: 35m 16s | Avg:  8m 49s | Max:  9m 00s | Hits:  96%/3408  
      🟩 Clang17            Pass: 100%/26  | Total:  6h 44m | Avg: 15m 33s | Max: 36m 34s | Hits:  98%/21856 
      🟩 GCC6               Pass: 100%/2   | Total:  6m 48s | Avg:  3m 24s | Max:  3m 31s | Hits:  99%/1552  
      🟩 GCC7               Pass: 100%/6   | Total: 37m 34s | Avg:  6m 15s | Max:  9m 46s | Hits:  97%/4893  
      🟩 GCC8               Pass: 100%/6   | Total: 37m 43s | Avg:  6m 17s | Max:  9m 12s | Hits:  97%/4893  
      🟩 GCC9               Pass: 100%/6   | Total: 38m 22s | Avg:  6m 23s | Max:  9m 43s | Hits:  97%/4893  
      🟩 GCC10              Pass: 100%/4   | Total: 37m 04s | Avg:  9m 16s | Max: 10m 00s | Hits:  96%/3416  
      🟩 GCC11              Pass: 100%/7   | Total:  1h 07m | Avg:  9m 39s | Max: 11m 16s | Hits:  96%/5964  
      🟩 GCC12              Pass: 100%/4   | Total: 36m 19s | Avg:  9m 04s | Max:  9m 08s | Hits:  96%/3408  
      🟩 GCC13              Pass: 100%/28  | Total:  6h 53m | Avg: 14m 47s | Max: 28m 57s | Hits:  98%/23856 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 15m 18s | Avg:  5m 06s | Max:  5m 13s | Hits: 100%/2340  
      🟩 MSVC14.16          Pass: 100%/1   | Total: 14m 28s | Avg: 14m 28s | Max: 14m 28s | Hits:  98%/695   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 23m 35s | Avg: 11m 47s | Max: 11m 58s | Hits:  98%/1390  
      🟩 MSVC14.39          Pass: 100%/3   | Total: 38m 16s | Avg: 12m 45s | Max: 13m 11s | Hits:  98%/2085  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/59  | Total: 11h 25m | Avg: 11m 36s | Max: 36m 34s | Hits:  97%/49788 
      🟩 GCC                Pass: 100%/63  | Total: 11h 15m | Avg: 10m 43s | Max: 28m 57s | Hits:  97%/52875 
      🟩 Intel              Pass: 100%/3   | Total: 15m 18s | Avg:  5m 06s | Max:  5m 13s | Hits: 100%/2340  
      🟩 MSVC               Pass: 100%/6   | Total:  1h 16m | Avg: 12m 43s | Max: 14m 28s | Hits:  98%/4170  
    🟩 gpu
      🟩 v100               Pass: 100%/131 | Total:  1d 00h | Avg: 11m 05s | Max: 36m 34s | Hits:  97%/109173
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total: 13h 44m | Avg:  8m 19s | Max: 14m 28s | Hits:  97%/81909 
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  2h 28m | Avg: 18m 31s | Max: 22m 43s | Hits:  99%/6816  
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 06m | Avg: 15m 46s | Max: 17m 15s | Hits:  99%/6816  
      🟩 HostLaunch         Pass: 100%/8   | Total:  2h 20m | Avg: 17m 33s | Max: 19m 21s | Hits:  99%/6816  
      🟩 TestGPU            Pass: 100%/8   | Total:  3h 33m | Avg: 26m 37s | Max: 36m 34s | Hits:  99%/6816  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 32m 01s | Avg: 10m 40s | Max: 11m 16s | Hits:  96%/2556  
      🟩 90a                Pass: 100%/4   | Total: 23m 21s | Avg:  5m 50s | Max:  5m 59s | Hits:  96%/3408  
    🟩 std
      🟩 11                 Pass: 100%/34  | Total:  5h 48m | Avg: 10m 14s | Max: 24m 22s | Hits:  97%/28539 
      🟩 14                 Pass: 100%/37  | Total:  6h 33m | Avg: 10m 38s | Max: 28m 57s | Hits:  97%/30624 
      🟩 17                 Pass: 100%/36  | Total:  6h 31m | Avg: 10m 52s | Max: 26m 15s | Hits:  97%/29857 
      🟩 20                 Pass: 100%/24  | Total:  5h 18m | Avg: 13m 17s | Max: 36m 34s | Hits:  97%/20153 
    
  • 🟩 thrust: Pass: 100%/118 | Total: 11h 11m | Avg: 5m 41s | Max: 26m 59s | Hits: 99%/139266

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total: 10h 17m | Avg:  5m 36s | Max: 20m 10s | Hits:  99%/129822
      🟩 arm64              Pass: 100%/8   | Total: 54m 13s | Avg:  6m 46s | Max: 26m 59s | Hits:  90%/9444  
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 01m | Avg:  4m 05s | Max: 15m 26s | Hits:  99%/17705 
      🟩 11.8               Pass: 100%/3   | Total: 11m 04s | Avg:  3m 41s | Max:  3m 56s | Hits:  99%/3543  
      🟩 12.4               Pass: 100%/100 | Total:  9h 59m | Avg:  5m 59s | Max: 26m 59s | Hits:  99%/118018
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total:  7m 24s | Avg:  3m 42s | Max:  3m 49s | Hits: 100%/2360  
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 01m | Avg:  4m 05s | Max: 15m 26s | Hits:  99%/17705 
      🟩 nvcc11.8           Pass: 100%/3   | Total: 11m 04s | Avg:  3m 41s | Max:  3m 56s | Hits:  99%/3543  
      🟩 nvcc12.4           Pass: 100%/98  | Total:  9h 51m | Avg:  6m 02s | Max: 26m 59s | Hits:  99%/115658
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 24s | Avg:  3m 42s | Max:  3m 49s | Hits: 100%/2360  
      🟩 nvcc               Pass: 100%/116 | Total: 11h 04m | Avg:  5m 43s | Max: 26m 59s | Hits:  99%/136906
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 22m 53s | Avg:  3m 48s | Max:  4m 20s | Hits: 100%/7080  
      🟩 Clang10            Pass: 100%/3   | Total: 12m 53s | Avg:  4m 17s | Max:  4m 27s | Hits: 100%/3540  
      🟩 Clang11            Pass: 100%/4   | Total: 15m 00s | Avg:  3m 45s | Max:  4m 08s | Hits: 100%/4720  
      🟩 Clang12            Pass: 100%/4   | Total: 14m 30s | Avg:  3m 37s | Max:  3m 42s | Hits: 100%/4720  
      🟩 Clang13            Pass: 100%/4   | Total: 15m 07s | Avg:  3m 46s | Max:  4m 11s | Hits: 100%/4720  
      🟩 Clang14            Pass: 100%/4   | Total: 14m 40s | Avg:  3m 40s | Max:  3m 45s | Hits: 100%/4720  
      🟩 Clang15            Pass: 100%/4   | Total: 14m 59s | Avg:  3m 44s | Max:  3m 54s | Hits: 100%/4720  
      🟩 Clang16            Pass: 100%/4   | Total: 15m 04s | Avg:  3m 46s | Max:  3m 56s | Hits: 100%/4720  
      🟩 Clang17            Pass: 100%/18  | Total:  1h 55m | Avg:  6m 24s | Max: 14m 22s | Hits: 100%/21240 
      🟩 GCC6               Pass: 100%/2   | Total:  6m 46s | Avg:  3m 23s | Max:  3m 49s | Hits:  99%/2360  
      🟩 GCC7               Pass: 100%/6   | Total: 20m 23s | Avg:  3m 23s | Max:  3m 53s | Hits:  99%/7086  
      🟩 GCC8               Pass: 100%/6   | Total: 19m 37s | Avg:  3m 16s | Max:  3m 33s | Hits:  99%/7086  
      🟩 GCC9               Pass: 100%/6   | Total: 21m 33s | Avg:  3m 35s | Max:  4m 13s | Hits:  99%/7086  
      🟩 GCC10              Pass: 100%/4   | Total: 15m 07s | Avg:  3m 46s | Max:  4m 06s | Hits:  99%/4724  
      🟩 GCC11              Pass: 100%/7   | Total: 26m 44s | Avg:  3m 49s | Max:  4m 08s | Hits:  99%/8267  
      🟩 GCC12              Pass: 100%/4   | Total: 15m 25s | Avg:  3m 51s | Max:  4m 03s | Hits:  99%/4724  
      🟩 GCC13              Pass: 100%/20  | Total:  2h 31m | Avg:  7m 33s | Max: 26m 59s | Hits:  95%/23620 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 13m 52s | Avg:  4m 37s | Max:  4m 44s | Hits: 100%/3549  
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 26s | Avg: 15m 26s | Max: 15m 26s | Hits:  98%/1176  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 25m 26s | Avg: 12m 43s | Max: 12m 45s | Hits:  98%/2352  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  1h 39m | Avg: 16m 38s | Max: 20m 10s | Hits:  98%/7056  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/51  | Total:  4h 00m | Avg:  4m 42s | Max: 14m 22s | Hits: 100%/60180 
      🟩 GCC                Pass: 100%/55  | Total:  4h 36m | Avg:  5m 01s | Max: 26m 59s | Hits:  98%/64953 
      🟩 Intel              Pass: 100%/3   | Total: 13m 52s | Avg:  4m 37s | Max:  4m 44s | Hits: 100%/3549  
      🟩 MSVC               Pass: 100%/9   | Total:  2h 20m | Avg: 15m 37s | Max: 20m 10s | Hits:  98%/10584 
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total: 11h 11m | Avg:  5m 41s | Max: 26m 59s | Hits:  99%/139266
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  7h 32m | Avg:  4m 34s | Max: 26m 59s | Hits:  99%/116850
      🟩 TestCPU            Pass: 100%/11  | Total:  1h 46m | Avg:  9m 38s | Max: 20m 10s | Hits:  99%/12972 
      🟩 TestGPU            Pass: 100%/8   | Total:  1h 53m | Avg: 14m 10s | Max: 15m 35s | Hits:  99%/9444  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 11m 04s | Avg:  3m 41s | Max:  3m 56s | Hits:  99%/3543  
      🟩 90a                Pass: 100%/4   | Total: 13m 08s | Avg:  3m 17s | Max:  3m 26s | Hits:  99%/4724  
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  2h 34m | Avg:  5m 08s | Max: 26m 59s | Hits:  97%/35418 
      🟩 14                 Pass: 100%/34  | Total:  3h 16m | Avg:  5m 47s | Max: 18m 38s | Hits:  99%/40122 
      🟩 17                 Pass: 100%/33  | Total:  3h 06m | Avg:  5m 39s | Max: 19m 19s | Hits:  99%/38946 
      🟩 20                 Pass: 100%/21  | Total:  2h 13m | Avg:  6m 21s | Max: 20m 10s | Hits:  99%/24780 
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

🏃‍ Runner counts (total jobs: 249)

# Runner
178 linux-amd64-cpu16
40 linux-amd64-gpu-v100-latest-1
16 linux-arm64-cpu16
15 windows-amd64-cpu16

@bernhardmgruber bernhardmgruber merged commit c64da31 into NVIDIA:main Jul 4, 2024
261 of 264 checks passed
@bernhardmgruber bernhardmgruber deleted the babelstream branch July 4, 2024 00:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
thrust For all items related to Thrust.
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

4 participants