Skip to content

Implement cuda::overflow_cast#4151

Merged
miscco merged 6 commits intoNVIDIA:mainfrom
davebayer:overflow_cast
Mar 18, 2025
Merged

Implement cuda::overflow_cast#4151
miscco merged 6 commits intoNVIDIA:mainfrom
davebayer:overflow_cast

Conversation

@davebayer
Copy link
Contributor

This PR implements a cuda::overflow_cast utility for casting integers with overflow detection.

It is a similar utility to std::saturate_cast which resides in <numeric> header, so I've put it in the <cuda/numeric> header, too.

@davebayer davebayer requested a review from a team as a code owner March 15, 2025 08:42
@davebayer davebayer requested a review from griwes March 15, 2025 08:42
@github-project-automation github-project-automation bot moved this to Todo in CCCL Mar 15, 2025
@copy-pr-bot
Copy link
Contributor

copy-pr-bot bot commented Mar 15, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@cccl-authenticator-app cccl-authenticator-app bot moved this from Todo to In Review in CCCL Mar 15, 2025
@davebayer davebayer changed the title implement cuda::overflow_cast Implement cuda::overflow_cast Mar 15, 2025
@miscco
Copy link
Contributor

miscco commented Mar 15, 2025

/ok to test

}

// 4. Test structured bindings
#if __cpp_structured_bindings >= 201606L
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is that failing somewhere?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It is not, I just thought I cannot count on having structured bindings

@github-actions
Copy link
Contributor

🟨 CI finished in 1h 09m: Pass: 74%/162 | Total: 21h 28m | Avg: 7m 57s | Max: 1h 08m | Hits: 98%/146285
  • 🟨 libcudacxx: Pass: 2%/43 | Total: 2h 39m | Avg: 3m 43s | Max: 21m 32s

    🟨 jobs
      🟥 Build              Pass:   0%/37  | Total:  2h 01m | Avg:  3m 16s | Max: 13m 18s
      🟥 NVRTC              Pass:   0%/2   | Total: 36m 43s | Avg: 18m 21s | Max: 21m 32s
      🟥 Test               Pass:   0%/3  
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 57s | Avg:  1m 57s | Max:  1m 57s
    🟨 cpu
      🟨 amd64              Pass:   2%/41  | Total:  2h 36m | Avg:  3m 48s | Max: 21m 32s
      🟥 arm64              Pass:   0%/2   | Total:  3m 38s | Avg:  1m 49s | Max:  1m 57s
    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total: 18m 24s | Avg:  3m 40s | Max: 10m 24s
      🟥 12.6               Pass:   0%/2   | Total: 10m 02s | Avg:  5m 01s | Max:  5m 03s
      🟨 12.8               Pass:   2%/36  | Total:  2h 11m | Avg:  3m 39s | Max: 21m 32s
    🟨 cudacxx
      🟥 ClangCUDA18        Pass:   0%/2   | Total:  4m 39s | Avg:  2m 19s | Max:  2m 22s
      🟥 nvcc12.0           Pass:   0%/5   | Total: 18m 24s | Avg:  3m 40s | Max: 10m 24s
      🟥 nvcc12.6           Pass:   0%/2   | Total: 10m 02s | Avg:  5m 01s | Max:  5m 03s
      🟨 nvcc12.8           Pass:   2%/34  | Total:  2h 06m | Avg:  3m 43s | Max: 21m 32s
    🟨 cudacxx_family
      🟥 ClangCUDA          Pass:   0%/2   | Total:  4m 39s | Avg:  2m 19s | Max:  2m 22s
      🟨 nvcc               Pass:   2%/41  | Total:  2h 35m | Avg:  3m 47s | Max: 21m 32s
    🟨 cxx
      🟥 Clang14            Pass:   0%/4   | Total:  8m 45s | Avg:  2m 11s | Max:  2m 17s
      🟥 Clang15            Pass:   0%/2   | Total:  4m 23s | Avg:  2m 11s | Max:  2m 12s
      🟥 Clang16            Pass:   0%/2   | Total:  4m 34s | Avg:  2m 17s | Max:  2m 18s
      🟥 Clang17            Pass:   0%/2   | Total:  4m 42s | Avg:  2m 21s | Max:  2m 24s
      🟥 Clang18            Pass:   0%/6   | Total: 11m 14s | Avg:  1m 52s | Max:  2m 22s
      🟥 GCC7               Pass:   0%/2   | Total:  4m 06s | Avg:  2m 03s | Max:  2m 12s
      🟥 GCC8               Pass:   0%/1   | Total:  1m 59s | Avg:  1m 59s | Max:  1m 59s
      🟥 GCC9               Pass:   0%/2   | Total:  3m 48s | Avg:  1m 54s | Max:  1m 59s
      🟥 GCC10              Pass:   0%/2   | Total:  3m 58s | Avg:  1m 59s | Max:  2m 02s
      🟥 GCC11              Pass:   0%/2   | Total:  4m 09s | Avg:  2m 04s | Max:  2m 08s
      🟥 GCC12              Pass:   0%/2   | Total:  4m 05s | Avg:  2m 02s | Max:  2m 03s
      🟨 GCC13              Pass:  10%/10  | Total: 48m 55s | Avg:  4m 53s | Max: 21m 32s
      🟥 MSVC14.29          Pass:   0%/2   | Total: 20m 51s | Avg: 10m 25s | Max: 10m 27s
      🟥 MSVC14.42          Pass:   0%/2   | Total: 24m 27s | Avg: 12m 13s | Max: 13m 18s
      🟥 NVHPC25.1          Pass:   0%/2   | Total: 10m 02s | Avg:  5m 01s | Max:  5m 03s
    🟨 cxx_family
      🟥 Clang              Pass:   0%/16  | Total: 33m 38s | Avg:  2m 06s | Max:  2m 24s
      🟨 GCC                Pass:   4%/21  | Total:  1h 11m | Avg:  3m 22s | Max: 21m 32s
      🟥 MSVC               Pass:   0%/4   | Total: 45m 18s | Avg: 11m 19s | Max: 13m 18s
      🟥 NVHPC              Pass:   0%/2   | Total: 10m 02s | Avg:  5m 01s | Max:  5m 03s
    🟨 gpu
      🟥 h100               Pass:   0%/2   | Total:  2m 04s | Avg:  1m 02s | Max:  2m 04s
      🟨 rtx2080            Pass:   2%/41  | Total:  2h 37m | Avg:  3m 51s | Max: 21m 32s
    🟥 sm
      🟥 75                 Pass:   0%/2   | Total: 36m 43s | Avg: 18m 21s | Max: 21m 32s
      🟥 90                 Pass:   0%/2   | Total:  2m 04s | Avg:  1m 02s | Max:  2m 04s
      🟥 90;90a;100         Pass:   0%/1   | Total:  2m 13s | Avg:  2m 13s | Max:  2m 13s
    🟥 std
      🟥 17                 Pass:   0%/21  | Total:  1h 26m | Avg:  4m 06s | Max: 15m 11s
      🟥 20                 Pass:   0%/21  | Total:  1h 11m | Avg:  3m 24s | Max: 21m 32s
    
  • 🟩 cub: Pass: 100%/45 | Total: 8h 36m | Avg: 11m 28s | Max: 49m 06s | Hits: 98%/53614

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  8h 24m | Avg: 11m 44s | Max: 49m 06s | Hits:  98%/51178 
      🟩 arm64              Pass: 100%/2   | Total: 11m 19s | Avg:  5m 39s | Max:  5m 56s | Hits:  99%/2436  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 41m 29s | Avg:  8m 17s | Max: 17m 42s | Hits:  99%/5922  
      🟩 12.6               Pass: 100%/2   | Total: 21m 16s | Avg: 10m 38s | Max: 10m 40s | Hits:  98%/2254  
      🟩 12.8               Pass: 100%/38  | Total:  7h 33m | Avg: 11m 55s | Max: 49m 06s | Hits:  98%/45438 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 40s | Avg:  4m 50s | Max:  4m 55s | Hits:  99%/2104  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 41m 29s | Avg:  8m 17s | Max: 17m 42s | Hits:  99%/5922  
      🟩 nvcc12.6           Pass: 100%/2   | Total: 21m 16s | Avg: 10m 38s | Max: 10m 40s | Hits:  98%/2254  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  7h 23m | Avg: 12m 19s | Max: 49m 06s | Hits:  98%/43334 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 40s | Avg:  4m 50s | Max:  4m 55s | Hits:  99%/2104  
      🟩 nvcc               Pass: 100%/43  | Total:  8h 26m | Avg: 11m 46s | Max: 49m 06s | Hits:  98%/51510 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 24m 21s | Avg:  6m 05s | Max:  6m 32s | Hits: 100%/4880  
      🟩 Clang15            Pass: 100%/2   | Total: 12m 48s | Avg:  6m 24s | Max:  6m 36s | Hits: 100%/2436  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 51s | Avg:  6m 25s | Max:  6m 38s | Hits: 100%/2436  
      🟩 Clang17            Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 27s | Hits: 100%/2436  
      🟩 Clang18            Pass: 100%/7   | Total:  1h 15m | Avg: 10m 46s | Max: 24m 56s | Hits:  99%/8194  
      🟩 GCC7               Pass: 100%/2   | Total: 12m 06s | Avg:  6m 03s | Max:  6m 17s | Hits:  99%/2440  
      🟩 GCC8               Pass: 100%/1   | Total:  6m 17s | Avg:  6m 17s | Max:  6m 17s | Hits:  99%/1220  
      🟩 GCC9               Pass: 100%/2   | Total: 13m 04s | Avg:  6m 32s | Max:  6m 38s | Hits:  99%/2440  
      🟩 GCC10              Pass: 100%/2   | Total: 13m 22s | Avg:  6m 41s | Max:  6m 46s | Hits:  99%/2440  
      🟩 GCC11              Pass: 100%/2   | Total: 13m 32s | Avg:  6m 46s | Max:  6m 52s | Hits:  99%/2436  
      🟩 GCC12              Pass: 100%/2   | Total: 14m 29s | Avg:  7m 14s | Max:  7m 25s | Hits:  99%/2436  
      🟩 GCC13              Pass: 100%/11  | Total:  3h 30m | Avg: 19m 08s | Max: 49m 06s | Hits:  96%/13398 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 36m 09s | Avg: 18m 04s | Max: 18m 27s | Hits:  99%/2084  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 37m 15s | Avg: 18m 37s | Max: 18m 38s | Hits:  99%/2084  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 21m 16s | Avg: 10m 38s | Max: 10m 40s | Hits:  98%/2254  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 17m | Avg:  8m 07s | Max: 24m 56s | Hits:  99%/20382 
      🟩 GCC                Pass: 100%/22  | Total:  4h 43m | Avg: 12m 53s | Max: 49m 06s | Hits:  98%/26810 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 13m | Avg: 18m 21s | Max: 18m 38s | Hits:  99%/4168  
      🟩 NVHPC              Pass: 100%/2   | Total: 21m 16s | Avg: 10m 38s | Max: 10m 40s | Hits:  98%/2254  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 53m 01s | Avg: 17m 40s | Max: 23m 52s | Hits:  99%/3654  
      🟩 rtx2080            Pass: 100%/34  | Total:  5h 12m | Avg:  9m 12s | Max: 49m 06s | Hits:  98%/40216 
      🟩 rtxa6000           Pass: 100%/8   | Total:  2h 30m | Avg: 18m 45s | Max: 24m 56s | Hits:  99%/9744  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  5h 32m | Avg:  8m 59s | Max: 49m 06s | Hits:  98%/43870 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 32s | Avg: 21m 32s | Max: 21m 32s | Hits:  99%/1218  
      🟩 GraphCapture       Pass: 100%/1   | Total: 18m 51s | Avg: 18m 51s | Max: 18m 51s | Hits:  99%/1218  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 12m | Avg: 24m 01s | Max: 24m 56s | Hits:  99%/3654  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 10m | Avg: 23m 39s | Max: 24m 44s | Hits:  99%/3654  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 53m 01s | Avg: 17m 40s | Max: 23m 52s | Hits:  99%/3654  
      🟩 90;90a;100         Pass: 100%/1   | Total: 49m 06s | Avg: 49m 06s | Max: 49m 06s | Hits:  69%/1218  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  2h 46m | Avg:  8m 19s | Max: 18m 37s | Hits:  99%/23591 
      🟩 20                 Pass: 100%/25  | Total:  5h 49m | Avg: 13m 58s | Max: 49m 06s | Hits:  98%/30023 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 6h 07m | Avg: 8m 09s | Max: 27m 05s | Hits: 99%/80541

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 16m 31s | Avg:  8m 15s | Max: 10m 59s | Hits:  99%/3582  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  5h 57m | Avg:  8m 18s | Max: 27m 05s | Hits:  99%/76960 
      🟩 arm64              Pass: 100%/2   | Total:  9m 46s | Avg:  4m 53s | Max:  5m 08s | Hits:  99%/3581  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 38m 28s | Avg:  7m 41s | Max: 19m 09s | Hits:  99%/8946  
      🟩 12.6               Pass: 100%/2   | Total: 30m 10s | Avg: 15m 05s | Max: 16m 16s | Hits:  99%/3580  
      🟩 12.8               Pass: 100%/38  | Total:  4h 58m | Avg:  7m 51s | Max: 27m 05s | Hits:  99%/68015 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 51s | Avg:  4m 55s | Max:  4m 56s | Hits: 100%/3580  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 38m 28s | Avg:  7m 41s | Max: 19m 09s | Hits:  99%/8946  
      🟩 nvcc12.6           Pass: 100%/2   | Total: 30m 10s | Avg: 15m 05s | Max: 16m 16s | Hits:  99%/3580  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  4h 48m | Avg:  8m 01s | Max: 27m 05s | Hits:  99%/64435 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 51s | Avg:  4m 55s | Max:  4m 56s | Hits: 100%/3580  
      🟩 nvcc               Pass: 100%/43  | Total:  5h 57m | Avg:  8m 18s | Max: 27m 05s | Hits:  99%/76961 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 20m 22s | Avg:  5m 05s | Max:  5m 24s | Hits: 100%/7160  
      🟩 Clang15            Pass: 100%/2   | Total: 10m 45s | Avg:  5m 22s | Max:  5m 31s | Hits: 100%/3580  
      🟩 Clang16            Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 47s | Hits: 100%/3580  
      🟩 Clang17            Pass: 100%/2   | Total: 10m 57s | Avg:  5m 28s | Max:  5m 34s | Hits: 100%/3580  
      🟩 Clang18            Pass: 100%/7   | Total: 43m 04s | Avg:  6m 09s | Max: 10m 13s | Hits: 100%/12530 
      🟩 GCC7               Pass: 100%/2   | Total: 10m 10s | Avg:  5m 05s | Max:  5m 24s | Hits:  99%/3582  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 15s | Avg:  5m 15s | Max:  5m 15s | Hits:  99%/1791  
      🟩 GCC9               Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  5m 41s | Hits:  99%/3582  
      🟩 GCC10              Pass: 100%/2   | Total: 11m 15s | Avg:  5m 37s | Max:  5m 38s | Hits:  99%/3582  
      🟩 GCC11              Pass: 100%/2   | Total: 11m 01s | Avg:  5m 30s | Max:  5m 37s | Hits:  99%/3582  
      🟩 GCC12              Pass: 100%/2   | Total: 11m 51s | Avg:  5m 55s | Max:  5m 58s | Hits:  99%/3582  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 15m | Avg:  7m 30s | Max: 11m 25s | Hits:  99%/17910 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 37m 40s | Avg: 18m 50s | Max: 19m 09s | Hits:  99%/3568  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  1h 08m | Avg: 22m 43s | Max: 27m 05s | Hits:  99%/5352  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 30m 10s | Avg: 15m 05s | Max: 16m 16s | Hits:  99%/3580  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  1h 36m | Avg:  5m 39s | Max: 10m 13s | Hits: 100%/30430 
      🟩 GCC                Pass: 100%/21  | Total:  2h 15m | Avg:  6m 26s | Max: 11m 25s | Hits:  99%/37611 
      🟩 MSVC               Pass: 100%/5   | Total:  1h 45m | Avg: 21m 10s | Max: 27m 05s | Hits:  99%/8920  
      🟩 NVHPC              Pass: 100%/2   | Total: 30m 10s | Avg: 15m 05s | Max: 16m 16s | Hits:  99%/3580  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 15m 59s | Avg:  7m 59s | Max: 11m 16s | Hits:  99%/3582  
      🟩 rtx2080            Pass: 100%/33  | Total:  3h 57m | Avg:  7m 11s | Max: 19m 33s | Hits:  99%/59066 
      🟩 rtx4090            Pass: 100%/10  | Total:  1h 54m | Avg: 11m 24s | Max: 27m 05s | Hits:  99%/17893 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  4h 40m | Avg:  7m 22s | Max: 21m 32s | Hits:  99%/68013 
      🟩 TestCPU            Pass: 100%/3   | Total: 43m 15s | Avg: 14m 25s | Max: 27m 05s | Hits:  99%/5365  
      🟩 TestGPU            Pass: 100%/4   | Total: 43m 53s | Avg: 10m 58s | Max: 11m 25s | Hits:  99%/7163  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 15m 59s | Avg:  7m 59s | Max: 11m 16s | Hits:  99%/3582  
      🟩 90;90a;100         Pass: 100%/1   | Total:  5m 46s | Avg:  5m 46s | Max:  5m 46s | Hits:  99%/1791  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  2h 39m | Avg:  7m 58s | Max: 19m 33s | Hits:  99%/35791 
      🟩 20                 Pass: 100%/23  | Total:  3h 11m | Avg:  8m 19s | Max: 27m 05s | Hits:  99%/41168 
    
  • 🟩 cudax: Pass: 100%/22 | Total: 2h 21m | Avg: 6m 25s | Max: 14m 17s | Hits: 92%/11810

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  1h 48m | Avg:  6m 01s | Max: 14m 17s | Hits:  98%/9478  
      🟩 arm64              Pass: 100%/4   | Total: 32m 56s | Avg:  8m 14s | Max: 13m 56s | Hits:  70%/2332  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total:  8m 31s | Avg:  8m 31s | Max:  8m 31s | Hits:  95%/281   
      🟩 12.6               Pass: 100%/2   | Total: 12m 17s | Avg:  6m 08s | Max:  6m 09s | Hits:  96%/750   
      🟩 12.8               Pass: 100%/19  | Total:  2h 00m | Avg:  6m 20s | Max: 14m 17s | Hits:  92%/10779 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total:  8m 31s | Avg:  8m 31s | Max:  8m 31s | Hits:  95%/281   
      🟩 nvcc12.6           Pass: 100%/2   | Total: 12m 17s | Avg:  6m 08s | Max:  6m 09s | Hits:  96%/750   
      🟩 nvcc12.8           Pass: 100%/19  | Total:  2h 00m | Avg:  6m 20s | Max: 14m 17s | Hits:  92%/10779 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/22  | Total:  2h 21m | Avg:  6m 25s | Max: 14m 17s | Hits:  92%/11810 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 28s | Avg:  3m 28s | Max:  3m 28s | Hits: 100%/585   
      🟩 Clang15            Pass: 100%/1   | Total:  3m 34s | Avg:  3m 34s | Max:  3m 34s | Hits: 100%/583   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 24s | Avg:  3m 24s | Max:  3m 24s | Hits: 100%/583   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 26s | Avg:  3m 26s | Max:  3m 26s | Hits: 100%/583   
      🟩 Clang18            Pass: 100%/4   | Total: 43m 30s | Avg: 10m 52s | Max: 13m 56s | Hits:  70%/2332  
      🟩 GCC10              Pass: 100%/1   | Total:  3m 09s | Avg:  3m 09s | Max:  3m 09s | Hits:  99%/585   
      🟩 GCC11              Pass: 100%/1   | Total:  4m 18s | Avg:  4m 18s | Max:  4m 18s | Hits:  95%/583   
      🟩 GCC12              Pass: 100%/2   | Total: 16m 47s | Avg:  8m 23s | Max: 13m 31s | Hits:  99%/1166  
      🟩 GCC13              Pass: 100%/6   | Total: 29m 59s | Avg:  4m 59s | Max: 14m 17s | Hits:  97%/3498  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  8m 31s | Avg:  8m 31s | Max:  8m 31s | Hits:  95%/281   
      🟩 MSVC14.42          Pass: 100%/1   | Total:  9m 00s | Avg:  9m 00s | Max:  9m 00s | Hits:  95%/281   
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 12m 17s | Avg:  6m 08s | Max:  6m 09s | Hits:  96%/750   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 57m 22s | Avg:  7m 10s | Max: 13m 56s | Hits:  85%/4666  
      🟩 GCC                Pass: 100%/10  | Total: 54m 13s | Avg:  5m 25s | Max: 14m 17s | Hits:  98%/5832  
      🟩 MSVC               Pass: 100%/2   | Total: 17m 31s | Avg:  8m 45s | Max:  9m 00s | Hits:  95%/562   
      🟩 NVHPC              Pass: 100%/2   | Total: 12m 17s | Avg:  6m 08s | Max:  6m 09s | Hits:  96%/750   
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 17m 52s | Avg:  8m 56s | Max: 14m 17s | Hits:  97%/1166  
      🟩 rtx2080            Pass: 100%/20  | Total:  2h 03m | Avg:  6m 10s | Max: 13m 56s | Hits:  92%/10644 
    🟩 jobs
      🟩 Build              Pass: 100%/19  | Total:  1h 40m | Avg:  5m 18s | Max: 13m 56s | Hits:  91%/10061 
      🟩 Test               Pass: 100%/3   | Total: 40m 41s | Avg: 13m 33s | Max: 14m 17s | Hits:  99%/1749  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 21m 26s | Avg:  7m 08s | Max: 14m 17s | Hits:  96%/1749  
      🟩 90a                Pass: 100%/1   | Total:  2m 52s | Avg:  2m 52s | Max:  2m 52s | Hits:  99%/583   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 25m 48s | Avg:  6m 27s | Max: 13m 19s | Hits:  81%/2124  
      🟩 20                 Pass: 100%/18  | Total:  1h 55m | Avg:  6m 25s | Max: 14m 17s | Hits:  95%/9686  
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 17m 23s | Avg: 4m 20s | Max: 4m 47s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 32s | Avg:  4m 46s | Max:  4m 47s
      🟩 arm64              Pass: 100%/2   | Total:  7m 51s | Avg:  3m 55s | Max:  4m 08s
    🟩 ctk
      🟩 12.6               Pass: 100%/4   | Total: 17m 23s | Avg:  4m 20s | Max:  4m 47s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/4   | Total: 17m 23s | Avg:  4m 20s | Max:  4m 47s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 17m 23s | Avg:  4m 20s | Max:  4m 47s
    🟩 cxx
      🟩 NVHPC25.1          Pass: 100%/4   | Total: 17m 23s | Avg:  4m 20s | Max:  4m 47s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 17m 23s | Avg:  4m 20s | Max:  4m 47s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 17m 23s | Avg:  4m 20s | Max:  4m 47s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 17m 23s | Avg:  4m 20s | Max:  4m 47s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 55s | Avg:  4m 27s | Max:  4m 47s
      🟩 20                 Pass: 100%/2   | Total:  8m 28s | Avg:  4m 14s | Max:  4m 45s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 17m 36s | Avg: 8m 48s | Max: 15m 32s | Hits: 98%/320

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 17m 36s | Avg:  8m 48s | Max: 15m 32s | Hits:  98%/320   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 17m 36s | Avg:  8m 48s | Max: 15m 32s | Hits:  98%/320   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 17m 36s | Avg:  8m 48s | Max: 15m 32s | Hits:  98%/320   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 17m 36s | Avg:  8m 48s | Max: 15m 32s | Hits:  98%/320   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 17m 36s | Avg:  8m 48s | Max: 15m 32s | Hits:  98%/320   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 17m 36s | Avg:  8m 48s | Max: 15m 32s | Hits:  98%/320   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 17m 36s | Avg:  8m 48s | Max: 15m 32s | Hits:  98%/320   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 04s | Avg:  2m 04s | Max:  2m 04s | Hits:  98%/160   
      🟩 Test               Pass: 100%/1   | Total: 15m 32s | Avg: 15m 32s | Max: 15m 32s | Hits:  98%/160   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 08m | Avg: 1h 08m | Max: 1h 08m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 162)

# Runner
113 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@bernhardmgruber
Copy link
Contributor

FYI: This seems related to #2278. The API proposed here seems to not automatically error out if an overflow is detected (which is alright), but I definitely want #2278 as well, a narrowing cast that will trap/throw/assert etc. It's too easy to forget not to check a flag, if not done automatically.

@miscco
Copy link
Contributor

miscco commented Mar 17, 2025

/ok to test

@github-actions
Copy link
Contributor

🟨 CI finished in 1h 16m: Pass: 98%/162 | Total: 1d 00h | Avg: 9m 07s | Max: 1h 05m | Hits: 94%/252319
  • 🟨 libcudacxx: Pass: 95%/43 | Total: 6h 53m | Avg: 9m 37s | Max: 32m 00s | Hits: 88%/106034

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  95%/41  | Total:  6h 46m | Avg:  9m 54s | Max: 32m 00s | Hits:  87%/100233
      🟩 arm64              Pass: 100%/2   | Total:  7m 32s | Avg:  3m 46s | Max:  4m 00s | Hits:  99%/5801  
    🔍 ctk: 12.8 🔍
      🟩 12.0               Pass: 100%/5   | Total: 37m 03s | Avg:  7m 24s | Max: 21m 27s | Hits:  99%/14134 
      🟩 12.6               Pass: 100%/2   | Total: 40m 41s | Avg: 20m 20s | Max: 32m 00s | Hits:  65%/5748  
      🔍 12.8               Pass:  94%/36  | Total:  5h 36m | Avg:  9m 20s | Max: 27m 40s | Hits:  88%/86152 
    🔍 cudacxx: nvcc12.8 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 42m 43s | Avg: 21m 21s | Max: 22m 34s | Hits:  27%/5762  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 37m 03s | Avg:  7m 24s | Max: 21m 27s | Hits:  99%/14134 
      🟩 nvcc12.6           Pass: 100%/2   | Total: 40m 41s | Avg: 20m 20s | Max: 32m 00s | Hits:  65%/5748  
      🔍 nvcc12.8           Pass:  94%/34  | Total:  4h 53m | Avg:  8m 37s | Max: 27m 40s | Hits:  92%/80390 
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 42m 43s | Avg: 21m 21s | Max: 22m 34s | Hits:  27%/5762  
      🔍 nvcc               Pass:  95%/41  | Total:  6h 11m | Avg:  9m 03s | Max: 32m 00s | Hits:  91%/100272
    🔍 cxx: GCC13 🔍
      🟩 Clang14            Pass: 100%/4   | Total: 34m 05s | Avg:  8m 31s | Max: 21m 07s | Hits:  83%/11492 
      🟩 Clang15            Pass: 100%/2   | Total:  9m 14s | Avg:  4m 37s | Max:  4m 58s | Hits:  99%/5758  
      🟩 Clang16            Pass: 100%/2   | Total: 29m 47s | Avg: 14m 53s | Max: 24m 53s | Hits:  71%/5758  
      🟩 Clang17            Pass: 100%/2   | Total:  9m 04s | Avg:  4m 32s | Max:  4m 39s | Hits:  99%/5758  
      🟩 Clang18            Pass: 100%/6   | Total:  1h 04m | Avg: 10m 45s | Max: 22m 34s | Hits:  70%/14420 
      🟩 GCC7               Pass: 100%/2   | Total:  7m 00s | Avg:  3m 30s | Max:  3m 41s | Hits:  99%/5696  
      🟩 GCC8               Pass: 100%/1   | Total:  3m 54s | Avg:  3m 54s | Max:  3m 54s | Hits:  98%/2858  
      🟩 GCC9               Pass: 100%/2   | Total:  7m 31s | Avg:  3m 45s | Max:  3m 55s | Hits:  99%/5708  
      🟩 GCC10              Pass: 100%/2   | Total:  7m 59s | Avg:  3m 59s | Max:  4m 03s | Hits:  98%/5764  
      🟩 GCC11              Pass: 100%/2   | Total:  7m 52s | Avg:  3m 56s | Max:  4m 02s | Hits:  99%/5760  
      🟩 GCC12              Pass: 100%/2   | Total:  8m 29s | Avg:  4m 14s | Max:  4m 34s | Hits:  98%/5760  
      🔍 GCC13              Pass:  80%/10  | Total:  1h 41m | Avg: 10m 10s | Max: 27m 40s | Hits:  85%/14641 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 40m 51s | Avg: 20m 25s | Max: 21m 27s | Hits:  98%/5424  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 40m 58s | Avg: 20m 29s | Max: 21m 01s | Hits:  98%/5489  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 40m 41s | Avg: 20m 20s | Max: 32m 00s | Hits:  65%/5748  
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/16  | Total:  2h 26m | Avg:  9m 10s | Max: 24m 53s | Hits:  81%/43186 
      🔍 GCC                Pass:  90%/21  | Total:  2h 24m | Avg:  6m 53s | Max: 27m 40s | Hits:  94%/46187 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 21m | Avg: 20m 27s | Max: 21m 27s | Hits:  98%/10913 
      🟩 NVHPC              Pass: 100%/2   | Total: 40m 41s | Avg: 20m 20s | Max: 32m 00s | Hits:  65%/5748  
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/2   | Total: 17m 28s | Avg:  8m 44s | Max: 13m 31s | Hits:  99%/2990  
      🔍 rtx2080            Pass:  95%/41  | Total:  6h 36m | Avg:  9m 39s | Max: 32m 00s | Hits:  88%/103044
    🚨 jobs: NVRTC 🚨
      🟩 Build              Pass: 100%/37  | Total:  5h 46m | Avg:  9m 22s | Max: 32m 00s | Hits:  88%/106034
      🔥 NVRTC              Pass:   0%/2   | Total: 34m 09s | Avg: 17m 04s | Max: 18m 14s
      🟩 Test               Pass: 100%/3   | Total: 31m 02s | Avg: 10m 20s | Max: 13m 31s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 00s | Avg:  2m 00s | Max:  2m 00s
    🚨 sm: 75 🚨
      🔥 75                 Pass:   0%/2   | Total: 34m 09s | Avg: 17m 04s | Max: 18m 14s
      🟩 90                 Pass: 100%/2   | Total: 17m 28s | Avg:  8m 44s | Max: 13m 31s | Hits:  99%/2990  
      🟩 90;90a;100         Pass: 100%/1   | Total: 27m 40s | Avg: 27m 40s | Max: 27m 40s | Hits:  34%/2990  
    🟨 std
      🟨 17                 Pass:  95%/21  | Total:  3h 10m | Avg:  9m 05s | Max: 32m 00s | Hits:  92%/56691 
      🟨 20                 Pass:  95%/21  | Total:  3h 40m | Avg: 10m 30s | Max: 27m 40s | Hits:  84%/49343 
    
  • 🟩 cub: Pass: 100%/45 | Total: 7h 50m | Avg: 10m 27s | Max: 25m 00s | Hits: 99%/53614

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  7h 39m | Avg: 10m 40s | Max: 25m 00s | Hits:  99%/51178 
      🟩 arm64              Pass: 100%/2   | Total: 11m 20s | Avg:  5m 40s | Max:  5m 56s | Hits:  99%/2436  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 41m 31s | Avg:  8m 18s | Max: 18m 22s | Hits:  99%/5922  
      🟩 12.6               Pass: 100%/2   | Total: 21m 29s | Avg: 10m 44s | Max: 10m 45s | Hits:  98%/2254  
      🟩 12.8               Pass: 100%/38  | Total:  6h 47m | Avg: 10m 43s | Max: 25m 00s | Hits:  99%/45438 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 45s | Avg:  4m 52s | Max:  5m 04s | Hits: 100%/2104  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 41m 31s | Avg:  8m 18s | Max: 18m 22s | Hits:  99%/5922  
      🟩 nvcc12.6           Pass: 100%/2   | Total: 21m 29s | Avg: 10m 44s | Max: 10m 45s | Hits:  98%/2254  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  6h 37m | Avg: 11m 02s | Max: 25m 00s | Hits:  99%/43334 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 45s | Avg:  4m 52s | Max:  5m 04s | Hits: 100%/2104  
      🟩 nvcc               Pass: 100%/43  | Total:  7h 40m | Avg: 10m 42s | Max: 25m 00s | Hits:  99%/51510 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 23m 51s | Avg:  5m 57s | Max:  6m 20s | Hits: 100%/4880  
      🟩 Clang15            Pass: 100%/2   | Total: 12m 53s | Avg:  6m 26s | Max:  6m 28s | Hits: 100%/2436  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 43s | Avg:  6m 21s | Max:  6m 24s | Hits: 100%/2436  
      🟩 Clang17            Pass: 100%/2   | Total: 12m 27s | Avg:  6m 13s | Max:  6m 18s | Hits: 100%/2436  
      🟩 Clang18            Pass: 100%/7   | Total:  1h 13m | Avg: 10m 26s | Max: 23m 52s | Hits: 100%/8194  
      🟩 GCC7               Pass: 100%/2   | Total: 12m 26s | Avg:  6m 13s | Max:  6m 50s | Hits:  99%/2440  
      🟩 GCC8               Pass: 100%/1   | Total:  6m 05s | Avg:  6m 05s | Max:  6m 05s | Hits:  99%/1220  
      🟩 GCC9               Pass: 100%/2   | Total: 12m 49s | Avg:  6m 24s | Max:  6m 33s | Hits:  99%/2440  
      🟩 GCC10              Pass: 100%/2   | Total: 14m 03s | Avg:  7m 01s | Max:  7m 02s | Hits:  99%/2440  
      🟩 GCC11              Pass: 100%/2   | Total: 13m 03s | Avg:  6m 31s | Max:  6m 44s | Hits:  99%/2436  
      🟩 GCC12              Pass: 100%/2   | Total: 14m 24s | Avg:  7m 12s | Max:  7m 24s | Hits:  99%/2436  
      🟩 GCC13              Pass: 100%/11  | Total:  2h 45m | Avg: 15m 01s | Max: 25m 00s | Hits:  99%/13398 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 37m 11s | Avg: 18m 35s | Max: 18m 49s | Hits:  99%/2084  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 38m 33s | Avg: 19m 16s | Max: 19m 19s | Hits:  99%/2084  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 21m 29s | Avg: 10m 44s | Max: 10m 45s | Hits:  98%/2254  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 15m | Avg:  7m 56s | Max: 23m 52s | Hits: 100%/20382 
      🟩 GCC                Pass: 100%/22  | Total:  3h 58m | Avg: 10m 49s | Max: 25m 00s | Hits:  99%/26810 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 15m | Avg: 18m 56s | Max: 19m 19s | Hits:  99%/4168  
      🟩 NVHPC              Pass: 100%/2   | Total: 21m 29s | Avg: 10m 44s | Max: 10m 45s | Hits:  98%/2254  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 49m 22s | Avg: 16m 27s | Max: 23m 33s | Hits:  99%/3654  
      🟩 rtx2080            Pass: 100%/34  | Total:  4h 33m | Avg:  8m 03s | Max: 19m 19s | Hits:  99%/40216 
      🟩 rtxa6000           Pass: 100%/8   | Total:  2h 27m | Avg: 18m 23s | Max: 25m 00s | Hits:  99%/9744  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  4h 51m | Avg:  7m 53s | Max: 19m 19s | Hits:  99%/43870 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 32s | Avg: 21m 32s | Max: 21m 32s | Hits:  99%/1218  
      🟩 GraphCapture       Pass: 100%/1   | Total: 18m 30s | Avg: 18m 30s | Max: 18m 30s | Hits:  99%/1218  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 10m | Avg: 23m 37s | Max: 23m 52s | Hits:  99%/3654  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 07m | Avg: 22m 32s | Max: 25m 00s | Hits:  99%/3654  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 49m 22s | Avg: 16m 27s | Max: 23m 33s | Hits:  99%/3654  
      🟩 90;90a;100         Pass: 100%/1   | Total:  7m 15s | Avg:  7m 15s | Max:  7m 15s | Hits:  99%/1218  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  2h 49m | Avg:  8m 27s | Max: 19m 14s | Hits:  99%/23591 
      🟩 20                 Pass: 100%/25  | Total:  5h 01m | Avg: 12m 02s | Max: 25m 00s | Hits:  99%/30023 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 6h 14m | Avg: 8m 19s | Max: 30m 12s | Hits: 99%/80541

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 17m 28s | Avg:  8m 44s | Max: 11m 05s | Hits:  99%/3582  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  6h 04m | Avg:  8m 29s | Max: 30m 12s | Hits:  99%/76960 
      🟩 arm64              Pass: 100%/2   | Total:  9m 43s | Avg:  4m 51s | Max:  5m 06s | Hits:  99%/3581  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 40m 32s | Avg:  8m 06s | Max: 20m 55s | Hits:  99%/8946  
      🟩 12.6               Pass: 100%/2   | Total: 29m 12s | Avg: 14m 36s | Max: 14m 58s | Hits:  99%/3580  
      🟩 12.8               Pass: 100%/38  | Total:  5h 04m | Avg:  8m 01s | Max: 30m 12s | Hits:  99%/68015 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 58s | Avg:  4m 59s | Max:  4m 59s | Hits: 100%/3580  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 40m 32s | Avg:  8m 06s | Max: 20m 55s | Hits:  99%/8946  
      🟩 nvcc12.6           Pass: 100%/2   | Total: 29m 12s | Avg: 14m 36s | Max: 14m 58s | Hits:  99%/3580  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  4h 54m | Avg:  8m 11s | Max: 30m 12s | Hits:  99%/64435 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 58s | Avg:  4m 59s | Max:  4m 59s | Hits: 100%/3580  
      🟩 nvcc               Pass: 100%/43  | Total:  6h 04m | Avg:  8m 28s | Max: 30m 12s | Hits:  99%/76961 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 20m 18s | Avg:  5m 04s | Max:  5m 21s | Hits: 100%/7160  
      🟩 Clang15            Pass: 100%/2   | Total: 10m 59s | Avg:  5m 29s | Max:  5m 36s | Hits: 100%/3580  
      🟩 Clang16            Pass: 100%/2   | Total: 11m 17s | Avg:  5m 38s | Max:  5m 47s | Hits: 100%/3580  
      🟩 Clang17            Pass: 100%/2   | Total: 11m 21s | Avg:  5m 40s | Max:  5m 46s | Hits: 100%/3580  
      🟩 Clang18            Pass: 100%/7   | Total: 43m 21s | Avg:  6m 11s | Max: 10m 18s | Hits: 100%/12530 
      🟩 GCC7               Pass: 100%/2   | Total: 10m 31s | Avg:  5m 15s | Max:  5m 36s | Hits:  99%/3582  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 09s | Avg:  5m 09s | Max:  5m 09s | Hits:  99%/1791  
      🟩 GCC9               Pass: 100%/2   | Total: 10m 57s | Avg:  5m 28s | Max:  5m 52s | Hits:  99%/3582  
      🟩 GCC10              Pass: 100%/2   | Total: 11m 16s | Avg:  5m 38s | Max:  5m 51s | Hits:  99%/3582  
      🟩 GCC11              Pass: 100%/2   | Total: 11m 30s | Avg:  5m 45s | Max:  5m 47s | Hits:  99%/3582  
      🟩 GCC12              Pass: 100%/2   | Total: 12m 27s | Avg:  6m 13s | Max:  6m 19s | Hits:  99%/3582  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 16m | Avg:  7m 36s | Max: 11m 18s | Hits:  99%/17910 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 40m 22s | Avg: 20m 11s | Max: 20m 55s | Hits:  99%/3568  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  1h 09m | Avg: 23m 17s | Max: 30m 12s | Hits:  99%/5352  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 29m 12s | Avg: 14m 36s | Max: 14m 58s | Hits:  99%/3580  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  1h 37m | Avg:  5m 43s | Max: 10m 18s | Hits: 100%/30430 
      🟩 GCC                Pass: 100%/21  | Total:  2h 17m | Avg:  6m 33s | Max: 11m 18s | Hits:  99%/37611 
      🟩 MSVC               Pass: 100%/5   | Total:  1h 50m | Avg: 22m 02s | Max: 30m 12s | Hits:  99%/8920  
      🟩 NVHPC              Pass: 100%/2   | Total: 29m 12s | Avg: 14m 36s | Max: 14m 58s | Hits:  99%/3580  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 16m 05s | Avg:  8m 02s | Max: 11m 04s | Hits:  99%/3582  
      🟩 rtx2080            Pass: 100%/33  | Total:  4h 02m | Avg:  7m 20s | Max: 20m 55s | Hits:  99%/59066 
      🟩 rtx4090            Pass: 100%/10  | Total:  1h 56m | Avg: 11m 38s | Max: 30m 12s | Hits:  99%/17893 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  4h 45m | Avg:  7m 30s | Max: 20m 55s | Hits:  99%/68013 
      🟩 TestCPU            Pass: 100%/3   | Total: 45m 21s | Avg: 15m 07s | Max: 30m 12s | Hits:  99%/5365  
      🟩 TestGPU            Pass: 100%/4   | Total: 43m 45s | Avg: 10m 56s | Max: 11m 18s | Hits:  99%/7163  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 16m 05s | Avg:  8m 02s | Max: 11m 04s | Hits:  99%/3582  
      🟩 90;90a;100         Pass: 100%/1   | Total:  5m 54s | Avg:  5m 54s | Max:  5m 54s | Hits:  99%/1791  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  2h 43m | Avg:  8m 09s | Max: 20m 55s | Hits:  99%/35791 
      🟩 20                 Pass: 100%/23  | Total:  3h 13m | Avg:  8m 25s | Max: 30m 12s | Hits:  99%/41168 
    
  • 🟩 cudax: Pass: 100%/22 | Total: 2h 00m | Avg: 5m 28s | Max: 16m 42s | Hits: 99%/11810

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  1h 48m | Avg:  6m 03s | Max: 16m 42s | Hits:  99%/9478  
      🟩 arm64              Pass: 100%/4   | Total: 11m 23s | Avg:  2m 50s | Max:  2m 55s | Hits:  99%/2332  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total:  8m 24s | Avg:  8m 24s | Max:  8m 24s | Hits:  95%/281   
      🟩 12.6               Pass: 100%/2   | Total: 11m 11s | Avg:  5m 35s | Max:  5m 38s | Hits:  96%/750   
      🟩 12.8               Pass: 100%/19  | Total:  1h 40m | Avg:  5m 18s | Max: 16m 42s | Hits:  99%/10779 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total:  8m 24s | Avg:  8m 24s | Max:  8m 24s | Hits:  95%/281   
      🟩 nvcc12.6           Pass: 100%/2   | Total: 11m 11s | Avg:  5m 35s | Max:  5m 38s | Hits:  96%/750   
      🟩 nvcc12.8           Pass: 100%/19  | Total:  1h 40m | Avg:  5m 18s | Max: 16m 42s | Hits:  99%/10779 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/22  | Total:  2h 00m | Avg:  5m 28s | Max: 16m 42s | Hits:  99%/11810 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 18s | Avg:  3m 18s | Max:  3m 18s | Hits: 100%/585   
      🟩 Clang15            Pass: 100%/1   | Total:  3m 38s | Avg:  3m 38s | Max:  3m 38s | Hits: 100%/583   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 30s | Avg:  3m 30s | Max:  3m 30s | Hits: 100%/583   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 45s | Avg:  3m 45s | Max:  3m 45s | Hits: 100%/583   
      🟩 Clang18            Pass: 100%/4   | Total: 26m 00s | Avg:  6m 30s | Max: 16m 42s | Hits: 100%/2332  
      🟩 GCC10              Pass: 100%/1   | Total:  3m 33s | Avg:  3m 33s | Max:  3m 33s | Hits:  99%/585   
      🟩 GCC11              Pass: 100%/1   | Total:  3m 35s | Avg:  3m 35s | Max:  3m 35s | Hits:  99%/583   
      🟩 GCC12              Pass: 100%/2   | Total: 16m 01s | Avg:  8m 00s | Max: 12m 31s | Hits:  99%/1166  
      🟩 GCC13              Pass: 100%/6   | Total: 28m 29s | Avg:  4m 44s | Max: 14m 01s | Hits:  99%/3498  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  8m 24s | Avg:  8m 24s | Max:  8m 24s | Hits:  95%/281   
      🟩 MSVC14.42          Pass: 100%/1   | Total:  8m 57s | Avg:  8m 57s | Max:  8m 57s | Hits:  95%/281   
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 11m 11s | Avg:  5m 35s | Max:  5m 38s | Hits:  96%/750   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 40m 11s | Avg:  5m 01s | Max: 16m 42s | Hits: 100%/4666  
      🟩 GCC                Pass: 100%/10  | Total: 51m 38s | Avg:  5m 09s | Max: 14m 01s | Hits:  99%/5832  
      🟩 MSVC               Pass: 100%/2   | Total: 17m 21s | Avg:  8m 40s | Max:  8m 57s | Hits:  95%/562   
      🟩 NVHPC              Pass: 100%/2   | Total: 11m 11s | Avg:  5m 35s | Max:  5m 38s | Hits:  96%/750   
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 16m 58s | Avg:  8m 29s | Max: 14m 01s | Hits:  99%/1166  
      🟩 rtx2080            Pass: 100%/20  | Total:  1h 43m | Avg:  5m 10s | Max: 16m 42s | Hits:  99%/10644 
    🟩 jobs
      🟩 Build              Pass: 100%/19  | Total:  1h 17m | Avg:  4m 03s | Max:  8m 57s | Hits:  99%/10061 
      🟩 Test               Pass: 100%/3   | Total: 43m 14s | Avg: 14m 24s | Max: 16m 42s | Hits:  99%/1749  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 19m 48s | Avg:  6m 36s | Max: 14m 01s | Hits:  99%/1749  
      🟩 90a                Pass: 100%/1   | Total:  3m 07s | Avg:  3m 07s | Max:  3m 07s | Hits:  99%/583   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 14m 01s | Avg:  3m 30s | Max:  5m 33s | Hits:  99%/2124  
      🟩 20                 Pass: 100%/18  | Total:  1h 46m | Avg:  5m 54s | Max: 16m 42s | Hits:  99%/9686  
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 16m 29s | Avg: 4m 07s | Max: 4m 43s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 25s | Avg:  4m 42s | Max:  4m 43s
      🟩 arm64              Pass: 100%/2   | Total:  7m 04s | Avg:  3m 32s | Max:  3m 41s
    🟩 ctk
      🟩 12.6               Pass: 100%/4   | Total: 16m 29s | Avg:  4m 07s | Max:  4m 43s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/4   | Total: 16m 29s | Avg:  4m 07s | Max:  4m 43s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 16m 29s | Avg:  4m 07s | Max:  4m 43s
    🟩 cxx
      🟩 NVHPC25.1          Pass: 100%/4   | Total: 16m 29s | Avg:  4m 07s | Max:  4m 43s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 16m 29s | Avg:  4m 07s | Max:  4m 43s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 16m 29s | Avg:  4m 07s | Max:  4m 43s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 16m 29s | Avg:  4m 07s | Max:  4m 43s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 24s | Avg:  4m 12s | Max:  4m 43s
      🟩 20                 Pass: 100%/2   | Total:  8m 05s | Avg:  4m 02s | Max:  4m 42s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 16m 41s | Avg: 8m 20s | Max: 14m 33s | Hits: 98%/320

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 16m 41s | Avg:  8m 20s | Max: 14m 33s | Hits:  98%/320   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 16m 41s | Avg:  8m 20s | Max: 14m 33s | Hits:  98%/320   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 16m 41s | Avg:  8m 20s | Max: 14m 33s | Hits:  98%/320   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 16m 41s | Avg:  8m 20s | Max: 14m 33s | Hits:  98%/320   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 16m 41s | Avg:  8m 20s | Max: 14m 33s | Hits:  98%/320   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 16m 41s | Avg:  8m 20s | Max: 14m 33s | Hits:  98%/320   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 16m 41s | Avg:  8m 20s | Max: 14m 33s | Hits:  98%/320   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 08s | Avg:  2m 08s | Max:  2m 08s | Hits:  98%/160   
      🟩 Test               Pass: 100%/1   | Total: 14m 33s | Avg: 14m 33s | Max: 14m 33s | Hits:  98%/160   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 05m | Avg: 1h 05m | Max: 1h 05m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 05m | Avg:  1h 05m | Max:  1h 05m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 162)

# Runner
113 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@miscco
Copy link
Contributor

miscco commented Mar 17, 2025

/ok to test

@github-actions
Copy link
Contributor

🟨 CI finished in 1h 01m: Pass: 73%/162 | Total: 22h 18m | Avg: 8m 15s | Max: 27m 36s | Hits: 99%/146322
  • 🟨 libcudacxx: Pass: 2%/43 | Total: 5h 08m | Avg: 7m 11s | Max: 23m 09s

    🟨 jobs
      🟥 Build              Pass:   0%/37  | Total:  4h 33m | Avg:  7m 22s | Max: 23m 09s
      🟥 NVRTC              Pass:   0%/2   | Total: 33m 46s | Avg: 16m 53s | Max: 18m 31s
      🟥 Test               Pass:   0%/3  
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 04s | Avg:  2m 04s | Max:  2m 04s
    🟨 cpu
      🟨 amd64              Pass:   2%/41  | Total:  5h 00m | Avg:  7m 19s | Max: 23m 09s
      🟥 arm64              Pass:   0%/2   | Total:  8m 29s | Avg:  4m 14s | Max:  4m 55s
    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total: 34m 53s | Avg:  6m 58s | Max: 20m 05s
      🟥 12.6               Pass:   0%/2   | Total: 17m 35s | Avg:  8m 47s | Max:  8m 53s
      🟨 12.8               Pass:   2%/36  | Total:  4h 16m | Avg:  7m 07s | Max: 23m 09s
    🟨 cudacxx
      🟥 ClangCUDA18        Pass:   0%/2   | Total: 44m 41s | Avg: 22m 20s | Max: 23m 09s
      🟥 nvcc12.0           Pass:   0%/5   | Total: 34m 53s | Avg:  6m 58s | Max: 20m 05s
      🟥 nvcc12.6           Pass:   0%/2   | Total: 17m 35s | Avg:  8m 47s | Max:  8m 53s
      🟨 nvcc12.8           Pass:   2%/34  | Total:  3h 31m | Avg:  6m 13s | Max: 21m 01s
    🟨 cudacxx_family
      🟥 ClangCUDA          Pass:   0%/2   | Total: 44m 41s | Avg: 22m 20s | Max: 23m 09s
      🟨 nvcc               Pass:   2%/41  | Total:  4h 24m | Avg:  6m 26s | Max: 21m 01s
    🟨 cxx
      🟥 Clang14            Pass:   0%/4   | Total: 17m 04s | Avg:  4m 16s | Max:  4m 31s
      🟥 Clang15            Pass:   0%/2   | Total: 15m 17s | Avg:  7m 38s | Max: 10m 41s
      🟥 Clang16            Pass:   0%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  4m 55s
      🟥 Clang17            Pass:   0%/2   | Total:  8m 46s | Avg:  4m 23s | Max:  4m 27s
      🟥 Clang18            Pass:   0%/6   | Total: 58m 32s | Avg:  9m 45s | Max: 23m 09s
      🟥 GCC7               Pass:   0%/2   | Total:  7m 05s | Avg:  3m 32s | Max:  3m 46s
      🟥 GCC8               Pass:   0%/1   | Total:  4m 08s | Avg:  4m 08s | Max:  4m 08s
      🟥 GCC9               Pass:   0%/2   | Total:  7m 31s | Avg:  3m 45s | Max:  4m 11s
      🟥 GCC10              Pass:   0%/2   | Total:  8m 22s | Avg:  4m 11s | Max:  4m 13s
      🟥 GCC11              Pass:   0%/2   | Total:  8m 08s | Avg:  4m 04s | Max:  4m 09s
      🟥 GCC12              Pass:   0%/2   | Total:  8m 35s | Avg:  4m 17s | Max:  4m 29s
      🟨 GCC13              Pass:  10%/10  | Total: 56m 45s | Avg:  5m 40s | Max: 18m 31s
      🟥 MSVC14.29          Pass:   0%/2   | Total: 41m 02s | Avg: 20m 31s | Max: 20m 57s
      🟥 MSVC14.42          Pass:   0%/2   | Total: 40m 20s | Avg: 20m 10s | Max: 21m 01s
      🟥 NVHPC25.1          Pass:   0%/2   | Total: 17m 35s | Avg:  8m 47s | Max:  8m 53s
    🟨 cxx_family
      🟥 Clang              Pass:   0%/16  | Total:  1h 49m | Avg:  6m 50s | Max: 23m 09s
      🟨 GCC                Pass:   4%/21  | Total:  1h 40m | Avg:  4m 47s | Max: 18m 31s
      🟥 MSVC               Pass:   0%/4   | Total:  1h 21m | Avg: 20m 20s | Max: 21m 01s
      🟥 NVHPC              Pass:   0%/2   | Total: 17m 35s | Avg:  8m 47s | Max:  8m 53s
    🟨 gpu
      🟥 h100               Pass:   0%/2   | Total:  4m 17s | Avg:  2m 08s | Max:  4m 17s
      🟨 rtx2080            Pass:   2%/41  | Total:  5h 04m | Avg:  7m 25s | Max: 23m 09s
    🟥 sm
      🟥 75                 Pass:   0%/2   | Total: 33m 46s | Avg: 16m 53s | Max: 18m 31s
      🟥 90                 Pass:   0%/2   | Total:  4m 17s | Avg:  2m 08s | Max:  4m 17s
      🟥 90;90a;100         Pass:   0%/1   | Total:  4m 21s | Avg:  4m 21s | Max:  4m 21s
    🟥 std
      🟥 17                 Pass:   0%/21  | Total:  2h 53m | Avg:  8m 16s | Max: 21m 32s
      🟥 20                 Pass:   0%/21  | Total:  2h 13m | Avg:  6m 20s | Max: 23m 09s
    
  • 🟥 python: Pass: 0%/1 | Total: 10m 04s | Avg: 10m 04s | Max: 10m 04s

    🟥 cpu
      🟥 amd64              Pass:   0%/1   | Total: 10m 04s | Avg: 10m 04s | Max: 10m 04s
    🟥 ctk
      🟥 12.8               Pass:   0%/1   | Total: 10m 04s | Avg: 10m 04s | Max: 10m 04s
    🟥 cudacxx
      🟥 nvcc12.8           Pass:   0%/1   | Total: 10m 04s | Avg: 10m 04s | Max: 10m 04s
    🟥 cudacxx_family
      🟥 nvcc               Pass:   0%/1   | Total: 10m 04s | Avg: 10m 04s | Max: 10m 04s
    🟥 cxx
      🟥 GCC13              Pass:   0%/1   | Total: 10m 04s | Avg: 10m 04s | Max: 10m 04s
    🟥 cxx_family
      🟥 GCC                Pass:   0%/1   | Total: 10m 04s | Avg: 10m 04s | Max: 10m 04s
    🟥 gpu
      🟥 rtx2080            Pass:   0%/1   | Total: 10m 04s | Avg: 10m 04s | Max: 10m 04s
    🟥 jobs
      🟥 Test               Pass:   0%/1   | Total: 10m 04s | Avg: 10m 04s | Max: 10m 04s
    
  • 🟩 cub: Pass: 100%/45 | Total: 8h 00m | Avg: 10m 40s | Max: 25m 21s | Hits: 99%/53651

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  7h 48m | Avg: 10m 54s | Max: 25m 21s | Hits:  99%/51213 
      🟩 arm64              Pass: 100%/2   | Total: 11m 16s | Avg:  5m 38s | Max:  5m 52s | Hits:  99%/2438  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 41m 40s | Avg:  8m 20s | Max: 18m 16s | Hits:  99%/5926  
      🟩 12.6               Pass: 100%/2   | Total: 22m 00s | Avg: 11m 00s | Max: 11m 07s | Hits:  98%/2254  
      🟩 12.8               Pass: 100%/38  | Total:  6h 56m | Avg: 10m 57s | Max: 25m 21s | Hits:  99%/45471 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 13s | Avg:  5m 06s | Max:  5m 11s | Hits: 100%/2104  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 41m 40s | Avg:  8m 20s | Max: 18m 16s | Hits:  99%/5926  
      🟩 nvcc12.6           Pass: 100%/2   | Total: 22m 00s | Avg: 11m 00s | Max: 11m 07s | Hits:  98%/2254  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  6h 46m | Avg: 11m 16s | Max: 25m 21s | Hits:  99%/43367 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 13s | Avg:  5m 06s | Max:  5m 11s | Hits: 100%/2104  
      🟩 nvcc               Pass: 100%/43  | Total:  7h 49m | Avg: 10m 55s | Max: 25m 21s | Hits:  99%/51547 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 23m 27s | Avg:  5m 51s | Max:  6m 27s | Hits: 100%/4884  
      🟩 Clang15            Pass: 100%/2   | Total: 13m 10s | Avg:  6m 35s | Max:  6m 43s | Hits: 100%/2438  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 40s | Avg:  6m 20s | Max:  6m 30s | Hits: 100%/2438  
      🟩 Clang17            Pass: 100%/2   | Total: 12m 30s | Avg:  6m 15s | Max:  6m 24s | Hits: 100%/2438  
      🟩 Clang18            Pass: 100%/7   | Total:  1h 13m | Avg: 10m 30s | Max: 22m 50s | Hits: 100%/8199  
      🟩 GCC7               Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 20s | Hits:  99%/2442  
      🟩 GCC8               Pass: 100%/1   | Total:  6m 30s | Avg:  6m 30s | Max:  6m 30s | Hits:  99%/1221  
      🟩 GCC9               Pass: 100%/2   | Total: 13m 02s | Avg:  6m 31s | Max:  6m 51s | Hits:  99%/2442  
      🟩 GCC10              Pass: 100%/2   | Total: 13m 23s | Avg:  6m 41s | Max:  6m 47s | Hits:  99%/2442  
      🟩 GCC11              Pass: 100%/2   | Total: 13m 39s | Avg:  6m 49s | Max:  6m 52s | Hits:  99%/2438  
      🟩 GCC12              Pass: 100%/2   | Total: 14m 10s | Avg:  7m 05s | Max:  7m 18s | Hits:  99%/2438  
      🟩 GCC13              Pass: 100%/11  | Total:  2h 50m | Avg: 15m 32s | Max: 25m 21s | Hits:  99%/13409 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 38m 40s | Avg: 19m 20s | Max: 20m 24s | Hits:  99%/2084  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 39m 45s | Avg: 19m 52s | Max: 19m 58s | Hits:  99%/2084  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 22m 00s | Avg: 11m 00s | Max: 11m 07s | Hits:  98%/2254  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 15m | Avg:  7m 57s | Max: 22m 50s | Hits: 100%/20397 
      🟩 GCC                Pass: 100%/22  | Total:  4h 04m | Avg: 11m 06s | Max: 25m 21s | Hits:  99%/26832 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 18m | Avg: 19m 36s | Max: 20m 24s | Hits:  99%/4168  
      🟩 NVHPC              Pass: 100%/2   | Total: 22m 00s | Avg: 11m 00s | Max: 11m 07s | Hits:  98%/2254  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 51m 53s | Avg: 17m 17s | Max: 23m 39s | Hits:  99%/3657  
      🟩 rtx2080            Pass: 100%/34  | Total:  4h 37m | Avg:  8m 09s | Max: 20m 24s | Hits:  99%/40242 
      🟩 rtxa6000           Pass: 100%/8   | Total:  2h 30m | Avg: 18m 52s | Max: 25m 21s | Hits:  99%/9752  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  4h 55m | Avg:  7m 59s | Max: 20m 24s | Hits:  99%/43899 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 25m 21s | Avg: 25m 21s | Max: 25m 21s | Hits:  99%/1219  
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 20s | Avg: 19m 20s | Max: 19m 20s | Hits:  99%/1219  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 09m | Avg: 23m 07s | Max: 23m 39s | Hits:  99%/3657  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 10m | Avg: 23m 24s | Max: 24m 01s | Hits:  99%/3657  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 51m 53s | Avg: 17m 17s | Max: 23m 39s | Hits:  99%/3657  
      🟩 90;90a;100         Pass: 100%/1   | Total:  7m 09s | Avg:  7m 09s | Max:  7m 09s | Hits:  99%/1219  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  2h 51m | Avg:  8m 33s | Max: 20m 24s | Hits:  99%/23606 
      🟩 20                 Pass: 100%/25  | Total:  5h 08m | Avg: 12m 21s | Max: 25m 21s | Hits:  99%/30045 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 6h 25m | Avg: 8m 34s | Max: 27m 36s | Hits: 99%/80541

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 17m 54s | Avg:  8m 57s | Max: 11m 18s | Hits:  99%/3582  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  6h 15m | Avg:  8m 44s | Max: 27m 36s | Hits:  99%/76960 
      🟩 arm64              Pass: 100%/2   | Total:  9m 45s | Avg:  4m 52s | Max:  5m 08s | Hits:  99%/3581  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 41m 07s | Avg:  8m 13s | Max: 20m 25s | Hits:  99%/8946  
      🟩 12.6               Pass: 100%/2   | Total: 32m 39s | Avg: 16m 19s | Max: 16m 42s | Hits:  99%/3580  
      🟩 12.8               Pass: 100%/38  | Total:  5h 11m | Avg:  8m 12s | Max: 27m 36s | Hits:  99%/68015 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 36s | Avg:  5m 18s | Max:  5m 26s | Hits: 100%/3580  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 41m 07s | Avg:  8m 13s | Max: 20m 25s | Hits:  99%/8946  
      🟩 nvcc12.6           Pass: 100%/2   | Total: 32m 39s | Avg: 16m 19s | Max: 16m 42s | Hits:  99%/3580  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  5h 01m | Avg:  8m 22s | Max: 27m 36s | Hits:  99%/64435 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 36s | Avg:  5m 18s | Max:  5m 26s | Hits: 100%/3580  
      🟩 nvcc               Pass: 100%/43  | Total:  6h 15m | Avg:  8m 43s | Max: 27m 36s | Hits:  99%/76961 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 20m 58s | Avg:  5m 14s | Max:  5m 38s | Hits: 100%/7160  
      🟩 Clang15            Pass: 100%/2   | Total: 11m 31s | Avg:  5m 45s | Max:  5m 57s | Hits: 100%/3580  
      🟩 Clang16            Pass: 100%/2   | Total: 11m 27s | Avg:  5m 43s | Max:  5m 56s | Hits: 100%/3580  
      🟩 Clang17            Pass: 100%/2   | Total: 10m 54s | Avg:  5m 27s | Max:  5m 35s | Hits: 100%/3580  
      🟩 Clang18            Pass: 100%/7   | Total: 44m 50s | Avg:  6m 24s | Max: 10m 25s | Hits: 100%/12530 
      🟩 GCC7               Pass: 100%/2   | Total: 10m 56s | Avg:  5m 28s | Max:  5m 40s | Hits:  99%/3582  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 37s | Avg:  5m 37s | Max:  5m 37s | Hits:  99%/1791  
      🟩 GCC9               Pass: 100%/2   | Total: 11m 14s | Avg:  5m 37s | Max:  5m 42s | Hits:  99%/3582  
      🟩 GCC10              Pass: 100%/2   | Total: 11m 13s | Avg:  5m 36s | Max:  5m 39s | Hits:  99%/3582  
      🟩 GCC11              Pass: 100%/2   | Total: 11m 30s | Avg:  5m 45s | Max:  5m 51s | Hits:  99%/3582  
      🟩 GCC12              Pass: 100%/2   | Total: 12m 14s | Avg:  6m 07s | Max:  6m 22s | Hits:  99%/3582  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 17m | Avg:  7m 47s | Max: 11m 38s | Hits:  99%/17910 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 41m 14s | Avg: 20m 37s | Max: 20m 49s | Hits:  99%/3568  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  1h 11m | Avg: 23m 51s | Max: 27m 36s | Hits:  99%/5352  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 32m 39s | Avg: 16m 19s | Max: 16m 42s | Hits:  99%/3580  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  1h 39m | Avg:  5m 51s | Max: 10m 25s | Hits: 100%/30430 
      🟩 GCC                Pass: 100%/21  | Total:  2h 20m | Avg:  6m 41s | Max: 11m 38s | Hits:  99%/37611 
      🟩 MSVC               Pass: 100%/5   | Total:  1h 52m | Avg: 22m 33s | Max: 27m 36s | Hits:  99%/8920  
      🟩 NVHPC              Pass: 100%/2   | Total: 32m 39s | Avg: 16m 19s | Max: 16m 42s | Hits:  99%/3580  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 16m 32s | Avg:  8m 16s | Max: 11m 38s | Hits:  99%/3582  
      🟩 rtx2080            Pass: 100%/33  | Total:  4h 11m | Avg:  7m 37s | Max: 21m 14s | Hits:  99%/59066 
      🟩 rtx4090            Pass: 100%/10  | Total:  1h 57m | Avg: 11m 45s | Max: 27m 36s | Hits:  99%/17893 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  4h 57m | Avg:  7m 49s | Max: 22m 45s | Hits:  99%/68013 
      🟩 TestCPU            Pass: 100%/3   | Total: 43m 45s | Avg: 14m 35s | Max: 27m 36s | Hits:  99%/5365  
      🟩 TestGPU            Pass: 100%/4   | Total: 44m 38s | Avg: 11m 09s | Max: 11m 38s | Hits:  99%/7163  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 16m 32s | Avg:  8m 16s | Max: 11m 38s | Hits:  99%/3582  
      🟩 90;90a;100         Pass: 100%/1   | Total:  6m 29s | Avg:  6m 29s | Max:  6m 29s | Hits:  99%/1791  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  2h 49m | Avg:  8m 27s | Max: 21m 14s | Hits:  99%/35791 
      🟩 20                 Pass: 100%/23  | Total:  3h 18m | Avg:  8m 38s | Max: 27m 36s | Hits:  99%/41168 
    
  • 🟩 cudax: Pass: 100%/22 | Total: 1h 57m | Avg: 5m 19s | Max: 13m 46s | Hits: 99%/11810

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  1h 45m | Avg:  5m 52s | Max: 13m 46s | Hits:  99%/9478  
      🟩 arm64              Pass: 100%/4   | Total: 11m 23s | Avg:  2m 50s | Max:  2m 53s | Hits:  99%/2332  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total:  9m 18s | Avg:  9m 18s | Max:  9m 18s | Hits:  95%/281   
      🟩 12.6               Pass: 100%/2   | Total: 11m 29s | Avg:  5m 44s | Max:  5m 51s | Hits:  96%/750   
      🟩 12.8               Pass: 100%/19  | Total:  1h 36m | Avg:  5m 04s | Max: 13m 46s | Hits:  99%/10779 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total:  9m 18s | Avg:  9m 18s | Max:  9m 18s | Hits:  95%/281   
      🟩 nvcc12.6           Pass: 100%/2   | Total: 11m 29s | Avg:  5m 44s | Max:  5m 51s | Hits:  96%/750   
      🟩 nvcc12.8           Pass: 100%/19  | Total:  1h 36m | Avg:  5m 04s | Max: 13m 46s | Hits:  99%/10779 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/22  | Total:  1h 57m | Avg:  5m 19s | Max: 13m 46s | Hits:  99%/11810 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 18s | Avg:  3m 18s | Max:  3m 18s | Hits: 100%/585   
      🟩 Clang15            Pass: 100%/1   | Total:  3m 36s | Avg:  3m 36s | Max:  3m 36s | Hits: 100%/583   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 32s | Avg:  3m 32s | Max:  3m 32s | Hits: 100%/583   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 29s | Avg:  3m 29s | Max:  3m 29s | Hits: 100%/583   
      🟩 Clang18            Pass: 100%/4   | Total: 20m 47s | Avg:  5m 11s | Max: 11m 41s | Hits: 100%/2332  
      🟩 GCC10              Pass: 100%/1   | Total:  3m 28s | Avg:  3m 28s | Max:  3m 28s | Hits:  99%/585   
      🟩 GCC11              Pass: 100%/1   | Total:  3m 17s | Avg:  3m 17s | Max:  3m 17s | Hits:  99%/583   
      🟩 GCC12              Pass: 100%/2   | Total: 16m 23s | Avg:  8m 11s | Max: 12m 35s | Hits:  99%/1166  
      🟩 GCC13              Pass: 100%/6   | Total: 28m 35s | Avg:  4m 45s | Max: 13m 46s | Hits:  99%/3498  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  9m 18s | Avg:  9m 18s | Max:  9m 18s | Hits:  95%/281   
      🟩 MSVC14.42          Pass: 100%/1   | Total:  9m 55s | Avg:  9m 55s | Max:  9m 55s | Hits:  95%/281   
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 11m 29s | Avg:  5m 44s | Max:  5m 51s | Hits:  96%/750   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 34m 42s | Avg:  4m 20s | Max: 11m 41s | Hits: 100%/4666  
      🟩 GCC                Pass: 100%/10  | Total: 51m 43s | Avg:  5m 10s | Max: 13m 46s | Hits:  99%/5832  
      🟩 MSVC               Pass: 100%/2   | Total: 19m 13s | Avg:  9m 36s | Max:  9m 55s | Hits:  95%/562   
      🟩 NVHPC              Pass: 100%/2   | Total: 11m 29s | Avg:  5m 44s | Max:  5m 51s | Hits:  96%/750   
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 16m 52s | Avg:  8m 26s | Max: 13m 46s | Hits:  99%/1166  
      🟩 rtx2080            Pass: 100%/20  | Total:  1h 40m | Avg:  5m 00s | Max: 12m 35s | Hits:  99%/10644 
    🟩 jobs
      🟩 Build              Pass: 100%/19  | Total:  1h 19m | Avg:  4m 09s | Max:  9m 55s | Hits:  99%/10061 
      🟩 Test               Pass: 100%/3   | Total: 38m 02s | Avg: 12m 40s | Max: 13m 46s | Hits:  99%/1749  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 19m 55s | Avg:  6m 38s | Max: 13m 46s | Hits:  99%/1749  
      🟩 90a                Pass: 100%/1   | Total:  3m 00s | Avg:  3m 00s | Max:  3m 00s | Hits:  99%/583   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 14m 20s | Avg:  3m 35s | Max:  5m 38s | Hits:  99%/2124  
      🟩 20                 Pass: 100%/18  | Total:  1h 42m | Avg:  5m 42s | Max: 13m 46s | Hits:  99%/9686  
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 16m 01s | Avg: 4m 00s | Max: 4m 53s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 32s | Avg:  4m 46s | Max:  4m 53s
      🟩 arm64              Pass: 100%/2   | Total:  6m 29s | Avg:  3m 14s | Max:  3m 16s
    🟩 ctk
      🟩 12.6               Pass: 100%/4   | Total: 16m 01s | Avg:  4m 00s | Max:  4m 53s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/4   | Total: 16m 01s | Avg:  4m 00s | Max:  4m 53s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 16m 01s | Avg:  4m 00s | Max:  4m 53s
    🟩 cxx
      🟩 NVHPC25.1          Pass: 100%/4   | Total: 16m 01s | Avg:  4m 00s | Max:  4m 53s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 16m 01s | Avg:  4m 00s | Max:  4m 53s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 16m 01s | Avg:  4m 00s | Max:  4m 53s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 16m 01s | Avg:  4m 00s | Max:  4m 53s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 06s | Avg:  4m 03s | Max:  4m 53s
      🟩 20                 Pass: 100%/2   | Total:  7m 55s | Avg:  3m 57s | Max:  4m 39s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 21m 03s | Avg: 10m 31s | Max: 18m 55s | Hits: 98%/320

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 21m 03s | Avg: 10m 31s | Max: 18m 55s | Hits:  98%/320   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 21m 03s | Avg: 10m 31s | Max: 18m 55s | Hits:  98%/320   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 21m 03s | Avg: 10m 31s | Max: 18m 55s | Hits:  98%/320   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 21m 03s | Avg: 10m 31s | Max: 18m 55s | Hits:  98%/320   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 21m 03s | Avg: 10m 31s | Max: 18m 55s | Hits:  98%/320   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 21m 03s | Avg: 10m 31s | Max: 18m 55s | Hits:  98%/320   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 21m 03s | Avg: 10m 31s | Max: 18m 55s | Hits:  98%/320   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 08s | Avg:  2m 08s | Max:  2m 08s | Hits:  98%/160   
      🟩 Test               Pass: 100%/1   | Total: 18m 55s | Avg: 18m 55s | Max: 18m 55s | Hits:  98%/160   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 162)

# Runner
113 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@miscco
Copy link
Contributor

miscco commented Mar 17, 2025

/ok to test

_LIBCUDACXX_BEGIN_NAMESPACE_CUDA

template <class _Tp>
struct overflow_result
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

will we use the same structure for other operations with overflow check? e.g. add, mul, etc.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes!

@github-actions
Copy link
Contributor

🟨 CI finished in 1h 07m: Pass: 98%/162 | Total: 1d 00h | Avg: 9m 04s | Max: 1h 07m | Hits: 96%/252566
  • 🟨 libcudacxx: Pass: 95%/43 | Total: 6h 20m | Avg: 8m 51s | Max: 33m 18s | Hits: 91%/106244

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  95%/41  | Total:  6h 13m | Avg:  9m 06s | Max: 33m 18s | Hits:  91%/100431
      🟩 arm64              Pass: 100%/2   | Total:  7m 29s | Avg:  3m 44s | Max:  3m 56s | Hits:  99%/5813  
    🔍 ctk: 12.8 🔍
      🟩 12.0               Pass: 100%/5   | Total: 34m 45s | Avg:  6m 57s | Max: 19m 45s | Hits:  99%/14160 
      🟩 12.6               Pass: 100%/2   | Total: 42m 06s | Avg: 21m 03s | Max: 33m 18s | Hits:  64%/5760  
      🔍 12.8               Pass:  94%/36  | Total:  5h 04m | Avg:  8m 26s | Max: 22m 40s | Hits:  92%/86324 
    🔍 cudacxx: nvcc12.8 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 43m 05s | Avg: 21m 32s | Max: 22m 24s | Hits:  27%/5774  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 34m 45s | Avg:  6m 57s | Max: 19m 45s | Hits:  99%/14160 
      🟩 nvcc12.6           Pass: 100%/2   | Total: 42m 06s | Avg: 21m 03s | Max: 33m 18s | Hits:  64%/5760  
      🔍 nvcc12.8           Pass:  94%/34  | Total:  4h 21m | Avg:  7m 40s | Max: 22m 40s | Hits:  96%/80550 
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 43m 05s | Avg: 21m 32s | Max: 22m 24s | Hits:  27%/5774  
      🔍 nvcc               Pass:  95%/41  | Total:  5h 37m | Avg:  8m 14s | Max: 33m 18s | Hits:  95%/100470
    🔍 cxx: GCC13 🔍
      🟩 Clang14            Pass: 100%/4   | Total: 34m 58s | Avg:  8m 44s | Max: 22m 40s | Hits:  82%/11516 
      🟩 Clang15            Pass: 100%/2   | Total: 10m 05s | Avg:  5m 02s | Max:  5m 25s | Hits:  97%/5770  
      🟩 Clang16            Pass: 100%/2   | Total:  9m 29s | Avg:  4m 44s | Max:  4m 55s | Hits:  99%/5770  
      🟩 Clang17            Pass: 100%/2   | Total:  9m 02s | Avg:  4m 31s | Max:  4m 46s | Hits:  99%/5770  
      🟩 Clang18            Pass: 100%/6   | Total:  1h 05m | Avg: 10m 56s | Max: 22m 24s | Hits:  70%/14450 
      🟩 GCC7               Pass: 100%/2   | Total:  7m 44s | Avg:  3m 52s | Max:  4m 00s | Hits:  99%/5708  
      🟩 GCC8               Pass: 100%/1   | Total:  4m 01s | Avg:  4m 01s | Max:  4m 01s | Hits:  99%/2864  
      🟩 GCC9               Pass: 100%/2   | Total:  7m 37s | Avg:  3m 48s | Max:  4m 10s | Hits:  99%/5720  
      🟩 GCC10              Pass: 100%/2   | Total:  8m 03s | Avg:  4m 01s | Max:  4m 09s | Hits:  99%/5776  
      🟩 GCC11              Pass: 100%/2   | Total:  8m 11s | Avg:  4m 05s | Max:  4m 06s | Hits:  99%/5772  
      🟩 GCC12              Pass: 100%/2   | Total:  8m 01s | Avg:  4m 00s | Max:  4m 04s | Hits:  99%/5772  
      🔍 GCC13              Pass:  80%/10  | Total:  1h 20m | Avg:  8m 02s | Max: 18m 48s | Hits:  99%/14671 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 41m 53s | Avg: 20m 56s | Max: 22m 08s | Hits:  99%/5428  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 43m 39s | Avg: 21m 49s | Max: 22m 02s | Hits:  98%/5497  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 42m 06s | Avg: 21m 03s | Max: 33m 18s | Hits:  64%/5760  
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/16  | Total:  2h 09m | Avg:  8m 04s | Max: 22m 40s | Hits:  85%/43276 
      🔍 GCC                Pass:  90%/21  | Total:  2h 04m | Avg:  5m 54s | Max: 18m 48s | Hits:  99%/46283 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 25m | Avg: 21m 23s | Max: 22m 08s | Hits:  98%/10925 
      🟩 NVHPC              Pass: 100%/2   | Total: 42m 06s | Avg: 21m 03s | Max: 33m 18s | Hits:  64%/5760  
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/2   | Total: 17m 36s | Avg:  8m 48s | Max: 13m 16s | Hits:  99%/2996  
      🔍 rtx2080            Pass:  95%/41  | Total:  6h 03m | Avg:  8m 51s | Max: 33m 18s | Hits:  91%/103248
    🚨 jobs: NVRTC 🚨
      🟩 Build              Pass: 100%/37  | Total:  5h 12m | Avg:  8m 26s | Max: 33m 18s | Hits:  91%/106244
      🔥 NVRTC              Pass:   0%/2   | Total: 35m 13s | Avg: 17m 36s | Max: 18m 48s
      🟩 Test               Pass: 100%/3   | Total: 31m 16s | Avg: 10m 25s | Max: 13m 16s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 03s | Avg:  2m 03s | Max:  2m 03s
    🚨 sm: 75 🚨
      🔥 75                 Pass:   0%/2   | Total: 35m 13s | Avg: 17m 36s | Max: 18m 48s
      🟩 90                 Pass: 100%/2   | Total: 17m 36s | Avg:  8m 48s | Max: 13m 16s | Hits:  99%/2996  
      🟩 90;90a;100         Pass: 100%/1   | Total:  4m 17s | Avg:  4m 17s | Max:  4m 17s | Hits:  99%/2996  
    🟨 std
      🟨 17                 Pass:  95%/21  | Total:  3h 10m | Avg:  9m 05s | Max: 22m 40s | Hits:  92%/56799 
      🟨 20                 Pass:  95%/21  | Total:  3h 08m | Avg:  8m 57s | Max: 33m 18s | Hits:  90%/49445 
    
  • 🟩 cub: Pass: 100%/45 | Total: 7h 58m | Avg: 10m 37s | Max: 25m 53s | Hits: 99%/53651

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  7h 46m | Avg: 10m 51s | Max: 25m 53s | Hits:  99%/51213 
      🟩 arm64              Pass: 100%/2   | Total: 11m 14s | Avg:  5m 37s | Max:  5m 58s | Hits:  99%/2438  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 41m 18s | Avg:  8m 15s | Max: 17m 59s | Hits:  99%/5926  
      🟩 12.6               Pass: 100%/2   | Total: 21m 59s | Avg: 10m 59s | Max: 11m 10s | Hits:  98%/2254  
      🟩 12.8               Pass: 100%/38  | Total:  6h 54m | Avg: 10m 55s | Max: 25m 53s | Hits:  99%/45471 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 58s | Avg:  4m 59s | Max:  5m 09s | Hits: 100%/2104  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 41m 18s | Avg:  8m 15s | Max: 17m 59s | Hits:  99%/5926  
      🟩 nvcc12.6           Pass: 100%/2   | Total: 21m 59s | Avg: 10m 59s | Max: 11m 10s | Hits:  98%/2254  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  6h 44m | Avg: 11m 14s | Max: 25m 53s | Hits:  99%/43367 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 58s | Avg:  4m 59s | Max:  5m 09s | Hits: 100%/2104  
      🟩 nvcc               Pass: 100%/43  | Total:  7h 48m | Avg: 10m 53s | Max: 25m 53s | Hits:  99%/51547 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 23m 12s | Avg:  5m 48s | Max:  6m 12s | Hits: 100%/4884  
      🟩 Clang15            Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 18s | Hits: 100%/2438  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 30s | Avg:  6m 15s | Max:  6m 20s | Hits: 100%/2438  
      🟩 Clang17            Pass: 100%/2   | Total: 12m 15s | Avg:  6m 07s | Max:  6m 12s | Hits: 100%/2438  
      🟩 Clang18            Pass: 100%/7   | Total:  1h 15m | Avg: 10m 44s | Max: 24m 55s | Hits: 100%/8199  
      🟩 GCC7               Pass: 100%/2   | Total: 12m 01s | Avg:  6m 00s | Max:  6m 14s | Hits:  99%/2442  
      🟩 GCC8               Pass: 100%/1   | Total:  6m 42s | Avg:  6m 42s | Max:  6m 42s | Hits:  99%/1221  
      🟩 GCC9               Pass: 100%/2   | Total: 13m 31s | Avg:  6m 45s | Max:  6m 58s | Hits:  99%/2442  
      🟩 GCC10              Pass: 100%/2   | Total: 13m 55s | Avg:  6m 57s | Max:  7m 04s | Hits:  99%/2442  
      🟩 GCC11              Pass: 100%/2   | Total: 13m 35s | Avg:  6m 47s | Max:  7m 01s | Hits:  99%/2438  
      🟩 GCC12              Pass: 100%/2   | Total: 14m 10s | Avg:  7m 05s | Max:  7m 15s | Hits:  99%/2438  
      🟩 GCC13              Pass: 100%/11  | Total:  2h 50m | Avg: 15m 28s | Max: 25m 53s | Hits:  99%/13409 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 36m 05s | Avg: 18m 02s | Max: 18m 06s | Hits:  99%/2084  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 40m 17s | Avg: 20m 08s | Max: 20m 26s | Hits:  99%/2084  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 21m 59s | Avg: 10m 59s | Max: 11m 10s | Hits:  98%/2254  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 15m | Avg:  7m 58s | Max: 24m 55s | Hits: 100%/20397 
      🟩 GCC                Pass: 100%/22  | Total:  4h 04m | Avg: 11m 05s | Max: 25m 53s | Hits:  99%/26832 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 16m | Avg: 19m 05s | Max: 20m 26s | Hits:  99%/4168  
      🟩 NVHPC              Pass: 100%/2   | Total: 21m 59s | Avg: 10m 59s | Max: 11m 10s | Hits:  98%/2254  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 51m 06s | Avg: 17m 02s | Max: 25m 06s | Hits:  99%/3657  
      🟩 rtx2080            Pass: 100%/34  | Total:  4h 34m | Avg:  8m 04s | Max: 20m 26s | Hits:  99%/40242 
      🟩 rtxa6000           Pass: 100%/8   | Total:  2h 32m | Avg: 19m 06s | Max: 25m 53s | Hits:  99%/9752  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  4h 52m | Avg:  7m 54s | Max: 20m 26s | Hits:  99%/43899 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 25m 10s | Avg: 25m 10s | Max: 25m 10s | Hits:  99%/1219  
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 29s | Avg: 19m 29s | Max: 19m 29s | Hits:  99%/1219  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 15m | Avg: 25m 18s | Max: 25m 53s | Hits:  99%/3657  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 05m | Avg: 21m 44s | Max: 22m 53s | Hits:  99%/3657  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 51m 06s | Avg: 17m 02s | Max: 25m 06s | Hits:  99%/3657  
      🟩 90;90a;100         Pass: 100%/1   | Total:  7m 00s | Avg:  7m 00s | Max:  7m 00s | Hits:  99%/1219  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  2h 48m | Avg:  8m 26s | Max: 20m 26s | Hits:  99%/23606 
      🟩 20                 Pass: 100%/25  | Total:  5h 09m | Avg: 12m 22s | Max: 25m 53s | Hits:  99%/30045 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 6h 24m | Avg: 8m 32s | Max: 26m 11s | Hits: 99%/80541

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 17m 57s | Avg:  8m 58s | Max: 11m 53s | Hits:  99%/3582  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  6h 14m | Avg:  8m 42s | Max: 26m 11s | Hits:  99%/76960 
      🟩 arm64              Pass: 100%/2   | Total:  9m 52s | Avg:  4m 56s | Max:  5m 10s | Hits:  99%/3581  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 40m 57s | Avg:  8m 11s | Max: 21m 05s | Hits:  99%/8946  
      🟩 12.6               Pass: 100%/2   | Total: 32m 35s | Avg: 16m 17s | Max: 16m 58s | Hits:  99%/3580  
      🟩 12.8               Pass: 100%/38  | Total:  5h 10m | Avg:  8m 10s | Max: 26m 11s | Hits:  99%/68015 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 00s | Avg:  5m 00s | Max:  5m 00s | Hits: 100%/3580  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 40m 57s | Avg:  8m 11s | Max: 21m 05s | Hits:  99%/8946  
      🟩 nvcc12.6           Pass: 100%/2   | Total: 32m 35s | Avg: 16m 17s | Max: 16m 58s | Hits:  99%/3580  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  5h 00m | Avg:  8m 21s | Max: 26m 11s | Hits:  99%/64435 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 00s | Avg:  5m 00s | Max:  5m 00s | Hits: 100%/3580  
      🟩 nvcc               Pass: 100%/43  | Total:  6h 14m | Avg:  8m 42s | Max: 26m 11s | Hits:  99%/76961 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 20m 57s | Avg:  5m 14s | Max:  5m 46s | Hits: 100%/7160  
      🟩 Clang15            Pass: 100%/2   | Total: 11m 55s | Avg:  5m 57s | Max:  6m 00s | Hits: 100%/3580  
      🟩 Clang16            Pass: 100%/2   | Total: 10m 48s | Avg:  5m 24s | Max:  5m 32s | Hits: 100%/3580  
      🟩 Clang17            Pass: 100%/2   | Total: 11m 06s | Avg:  5m 33s | Max:  5m 35s | Hits: 100%/3580  
      🟩 Clang18            Pass: 100%/7   | Total: 44m 25s | Avg:  6m 20s | Max: 11m 08s | Hits: 100%/12530 
      🟩 GCC7               Pass: 100%/2   | Total: 10m 33s | Avg:  5m 16s | Max:  5m 22s | Hits:  99%/3582  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 17s | Avg:  5m 17s | Max:  5m 17s | Hits:  99%/1791  
      🟩 GCC9               Pass: 100%/2   | Total: 10m 37s | Avg:  5m 18s | Max:  5m 39s | Hits:  99%/3582  
      🟩 GCC10              Pass: 100%/2   | Total: 10m 51s | Avg:  5m 25s | Max:  5m 30s | Hits:  99%/3582  
      🟩 GCC11              Pass: 100%/2   | Total: 11m 30s | Avg:  5m 45s | Max:  5m 55s | Hits:  99%/3582  
      🟩 GCC12              Pass: 100%/2   | Total: 12m 25s | Avg:  6m 12s | Max:  6m 21s | Hits:  99%/3582  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 20m | Avg:  8m 02s | Max: 13m 33s | Hits:  99%/17910 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 42m 42s | Avg: 21m 21s | Max: 21m 37s | Hits:  99%/3568  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  1h 08m | Avg: 22m 45s | Max: 26m 11s | Hits:  99%/5352  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 32m 35s | Avg: 16m 17s | Max: 16m 58s | Hits:  99%/3580  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  1h 39m | Avg:  5m 50s | Max: 11m 08s | Hits: 100%/30430 
      🟩 GCC                Pass: 100%/21  | Total:  2h 21m | Avg:  6m 44s | Max: 13m 33s | Hits:  99%/37611 
      🟩 MSVC               Pass: 100%/5   | Total:  1h 50m | Avg: 22m 11s | Max: 26m 11s | Hits:  99%/8920  
      🟩 NVHPC              Pass: 100%/2   | Total: 32m 35s | Avg: 16m 17s | Max: 16m 58s | Hits:  99%/3580  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 18m 13s | Avg:  9m 06s | Max: 13m 33s | Hits:  99%/3582  
      🟩 rtx2080            Pass: 100%/33  | Total:  4h 09m | Avg:  7m 33s | Max: 21m 37s | Hits:  99%/59066 
      🟩 rtx4090            Pass: 100%/10  | Total:  1h 56m | Avg: 11m 38s | Max: 26m 11s | Hits:  99%/17893 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  4h 53m | Avg:  7m 43s | Max: 21m 37s | Hits:  99%/68013 
      🟩 TestCPU            Pass: 100%/3   | Total: 41m 42s | Avg: 13m 54s | Max: 26m 11s | Hits:  99%/5365  
      🟩 TestGPU            Pass: 100%/4   | Total: 48m 46s | Avg: 12m 11s | Max: 13m 33s | Hits:  99%/7163  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 18m 13s | Avg:  9m 06s | Max: 13m 33s | Hits:  99%/3582  
      🟩 90;90a;100         Pass: 100%/1   | Total:  6m 14s | Avg:  6m 14s | Max:  6m 14s | Hits:  99%/1791  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  2h 48m | Avg:  8m 24s | Max: 21m 37s | Hits:  99%/35791 
      🟩 20                 Pass: 100%/23  | Total:  3h 18m | Avg:  8m 37s | Max: 26m 11s | Hits:  99%/41168 
    
  • 🟩 cudax: Pass: 100%/22 | Total: 2h 05m | Avg: 5m 41s | Max: 19m 29s | Hits: 99%/11810

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  1h 53m | Avg:  6m 18s | Max: 19m 29s | Hits:  99%/9478  
      🟩 arm64              Pass: 100%/4   | Total: 11m 28s | Avg:  2m 52s | Max:  3m 01s | Hits:  99%/2332  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total:  9m 43s | Avg:  9m 43s | Max:  9m 43s | Hits:  95%/281   
      🟩 12.6               Pass: 100%/2   | Total: 12m 23s | Avg:  6m 11s | Max:  6m 35s | Hits:  96%/750   
      🟩 12.8               Pass: 100%/19  | Total:  1h 42m | Avg:  5m 25s | Max: 19m 29s | Hits:  99%/10779 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total:  9m 43s | Avg:  9m 43s | Max:  9m 43s | Hits:  95%/281   
      🟩 nvcc12.6           Pass: 100%/2   | Total: 12m 23s | Avg:  6m 11s | Max:  6m 35s | Hits:  96%/750   
      🟩 nvcc12.8           Pass: 100%/19  | Total:  1h 42m | Avg:  5m 25s | Max: 19m 29s | Hits:  99%/10779 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/22  | Total:  2h 05m | Avg:  5m 41s | Max: 19m 29s | Hits:  99%/11810 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 20s | Avg:  3m 20s | Max:  3m 20s | Hits: 100%/585   
      🟩 Clang15            Pass: 100%/1   | Total:  3m 20s | Avg:  3m 20s | Max:  3m 20s | Hits: 100%/583   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 22s | Avg:  3m 22s | Max:  3m 22s | Hits: 100%/583   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 27s | Avg:  3m 27s | Max:  3m 27s | Hits: 100%/583   
      🟩 Clang18            Pass: 100%/4   | Total: 21m 43s | Avg:  5m 25s | Max: 12m 36s | Hits: 100%/2332  
      🟩 GCC10              Pass: 100%/1   | Total:  3m 29s | Avg:  3m 29s | Max:  3m 29s | Hits:  99%/585   
      🟩 GCC11              Pass: 100%/1   | Total:  3m 14s | Avg:  3m 14s | Max:  3m 14s | Hits:  99%/583   
      🟩 GCC12              Pass: 100%/2   | Total: 23m 06s | Avg: 11m 33s | Max: 19m 29s | Hits:  99%/1166  
      🟩 GCC13              Pass: 100%/6   | Total: 28m 50s | Avg:  4m 48s | Max: 14m 08s | Hits:  99%/3498  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  9m 43s | Avg:  9m 43s | Max:  9m 43s | Hits:  95%/281   
      🟩 MSVC14.42          Pass: 100%/1   | Total:  9m 07s | Avg:  9m 07s | Max:  9m 07s | Hits:  95%/281   
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 12m 23s | Avg:  6m 11s | Max:  6m 35s | Hits:  96%/750   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 35m 12s | Avg:  4m 24s | Max: 12m 36s | Hits: 100%/4666  
      🟩 GCC                Pass: 100%/10  | Total: 58m 39s | Avg:  5m 51s | Max: 19m 29s | Hits:  99%/5832  
      🟩 MSVC               Pass: 100%/2   | Total: 18m 50s | Avg:  9m 25s | Max:  9m 43s | Hits:  95%/562   
      🟩 NVHPC              Pass: 100%/2   | Total: 12m 23s | Avg:  6m 11s | Max:  6m 35s | Hits:  96%/750   
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 17m 05s | Avg:  8m 32s | Max: 14m 08s | Hits:  99%/1166  
      🟩 rtx2080            Pass: 100%/20  | Total:  1h 47m | Avg:  5m 23s | Max: 19m 29s | Hits:  99%/10644 
    🟩 jobs
      🟩 Build              Pass: 100%/19  | Total:  1h 18m | Avg:  4m 09s | Max:  9m 43s | Hits:  99%/10061 
      🟩 Test               Pass: 100%/3   | Total: 46m 13s | Avg: 15m 24s | Max: 19m 29s | Hits:  99%/1749  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 19m 58s | Avg:  6m 39s | Max: 14m 08s | Hits:  99%/1749  
      🟩 90a                Pass: 100%/1   | Total:  3m 05s | Avg:  3m 05s | Max:  3m 05s | Hits:  99%/583   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 15m 08s | Avg:  3m 47s | Max:  6m 35s | Hits:  99%/2124  
      🟩 20                 Pass: 100%/18  | Total:  1h 49m | Avg:  6m 06s | Max: 19m 29s | Hits:  99%/9686  
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 16m 45s | Avg: 4m 11s | Max: 4m 58s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 50s | Avg:  4m 55s | Max:  4m 58s
      🟩 arm64              Pass: 100%/2   | Total:  6m 55s | Avg:  3m 27s | Max:  3m 40s
    🟩 ctk
      🟩 12.6               Pass: 100%/4   | Total: 16m 45s | Avg:  4m 11s | Max:  4m 58s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/4   | Total: 16m 45s | Avg:  4m 11s | Max:  4m 58s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 16m 45s | Avg:  4m 11s | Max:  4m 58s
    🟩 cxx
      🟩 NVHPC25.1          Pass: 100%/4   | Total: 16m 45s | Avg:  4m 11s | Max:  4m 58s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 16m 45s | Avg:  4m 11s | Max:  4m 58s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 16m 45s | Avg:  4m 11s | Max:  4m 58s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 16m 45s | Avg:  4m 11s | Max:  4m 58s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 32s | Avg:  4m 16s | Max:  4m 52s
      🟩 20                 Pass: 100%/2   | Total:  8m 13s | Avg:  4m 06s | Max:  4m 58s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 16m 28s | Avg: 8m 14s | Max: 14m 23s | Hits: 98%/320

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 16m 28s | Avg:  8m 14s | Max: 14m 23s | Hits:  98%/320   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 16m 28s | Avg:  8m 14s | Max: 14m 23s | Hits:  98%/320   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 16m 28s | Avg:  8m 14s | Max: 14m 23s | Hits:  98%/320   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 16m 28s | Avg:  8m 14s | Max: 14m 23s | Hits:  98%/320   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 16m 28s | Avg:  8m 14s | Max: 14m 23s | Hits:  98%/320   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 16m 28s | Avg:  8m 14s | Max: 14m 23s | Hits:  98%/320   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 16m 28s | Avg:  8m 14s | Max: 14m 23s | Hits:  98%/320   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 05s | Avg:  2m 05s | Max:  2m 05s | Hits:  98%/160   
      🟩 Test               Pass: 100%/1   | Total: 14m 23s | Avg: 14m 23s | Max: 14m 23s | Hits:  98%/160   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 07m | Avg: 1h 07m | Max: 1h 07m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 162)

# Runner
113 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

Copy link
Contributor

@fbusato fbusato left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good! Please don't forget to add the documentation

@github-project-automation github-project-automation bot moved this from In Review to In Progress in CCCL Mar 17, 2025
@davebayer
Copy link
Contributor Author

looks good! Please don't forget to add the documentation

I wanted to update the documentation once we have all of the overflow functions implemented, but I can update it bit by bit if you want

@fbusato
Copy link
Contributor

fbusato commented Mar 17, 2025

I wanted to update the documentation once we have all of the overflow functions implemented, but I can update it bit by bit if you want

no, this is not critical. What is important is to not forget about it 🙂

@github-project-automation github-project-automation bot moved this from In Progress to In Review in CCCL Mar 17, 2025
@bernhardmgruber
Copy link
Contributor

/ok to test

@github-actions
Copy link
Contributor

🟨 CI finished in 6h 05m: Pass: 99%/162 | Total: 1d 05h | Avg: 10m 56s | Max: 5h 59m | Hits: 97%/252023
  • 🟨 cudax: Pass: 95%/22 | Total: 7h 45m | Avg: 21m 08s | Max: 5h 59m | Hits: 99%/11227

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  94%/18  | Total:  7h 33m | Avg: 25m 12s | Max:  5h 59m | Hits:  99%/8895  
      🟩 arm64              Pass: 100%/4   | Total: 11m 30s | Avg:  2m 52s | Max:  3m 06s | Hits:  99%/2332  
    🔍 ctk: 12.8 🔍
      🟩 12.0               Pass: 100%/1   | Total:  9m 16s | Avg:  9m 16s | Max:  9m 16s | Hits:  95%/281   
      🟩 12.6               Pass: 100%/2   | Total: 12m 14s | Avg:  6m 07s | Max:  6m 14s | Hits:  96%/750   
      🔍 12.8               Pass:  94%/19  | Total:  7h 23m | Avg: 23m 21s | Max:  5h 59m | Hits:  99%/10196 
    🔍 cudacxx: nvcc12.8 🔍
      🟩 nvcc12.0           Pass: 100%/1   | Total:  9m 16s | Avg:  9m 16s | Max:  9m 16s | Hits:  95%/281   
      🟩 nvcc12.6           Pass: 100%/2   | Total: 12m 14s | Avg:  6m 07s | Max:  6m 14s | Hits:  96%/750   
      🔍 nvcc12.8           Pass:  94%/19  | Total:  7h 23m | Avg: 23m 21s | Max:  5h 59m | Hits:  99%/10196 
    🔍 cxx: GCC12 🔍
      🟩 Clang14            Pass: 100%/1   | Total:  3m 19s | Avg:  3m 19s | Max:  3m 19s | Hits: 100%/585   
      🟩 Clang15            Pass: 100%/1   | Total:  3m 31s | Avg:  3m 31s | Max:  3m 31s | Hits: 100%/583   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 28s | Avg:  3m 28s | Max:  3m 28s | Hits: 100%/583   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 49s | Avg:  3m 49s | Max:  3m 49s | Hits: 100%/583   
      🟩 Clang18            Pass: 100%/4   | Total: 21m 30s | Avg:  5m 22s | Max: 12m 13s | Hits: 100%/2332  
      🟩 GCC10              Pass: 100%/1   | Total:  3m 30s | Avg:  3m 30s | Max:  3m 30s | Hits:  99%/585   
      🟩 GCC11              Pass: 100%/1   | Total:  3m 13s | Avg:  3m 13s | Max:  3m 13s | Hits:  99%/583   
      🔍 GCC12              Pass:  50%/2   | Total:  6h 03m | Avg:  3h 01m | Max:  5h 59m | Hits:  99%/583   
      🟩 GCC13              Pass: 100%/6   | Total: 28m 26s | Avg:  4m 44s | Max: 13m 41s | Hits:  99%/3498  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  9m 16s | Avg:  9m 16s | Max:  9m 16s | Hits:  95%/281   
      🟩 MSVC14.42          Pass: 100%/1   | Total:  9m 14s | Avg:  9m 14s | Max:  9m 14s | Hits:  95%/281   
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 12m 14s | Avg:  6m 07s | Max:  6m 14s | Hits:  96%/750   
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/8   | Total: 35m 37s | Avg:  4m 27s | Max: 12m 13s | Hits: 100%/4666  
      🔍 GCC                Pass:  90%/10  | Total:  6h 38m | Avg: 39m 52s | Max:  5h 59m | Hits:  99%/5249  
      🟩 MSVC               Pass: 100%/2   | Total: 18m 30s | Avg:  9m 15s | Max:  9m 16s | Hits:  95%/562   
      🟩 NVHPC              Pass: 100%/2   | Total: 12m 14s | Avg:  6m 07s | Max:  6m 14s | Hits:  96%/750   
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/2   | Total: 16m 47s | Avg:  8m 23s | Max: 13m 41s | Hits:  99%/1166  
      🔍 rtx2080            Pass:  95%/20  | Total:  7h 28m | Avg: 22m 25s | Max:  5h 59m | Hits:  99%/10061 
    🔍 jobs: Test 🔍
      🟩 Build              Pass: 100%/19  | Total:  1h 19m | Avg:  4m 10s | Max:  9m 16s | Hits:  99%/10061 
      🔍 Test               Pass:  66%/3   | Total:  6h 25m | Avg:  2h 08m | Max:  5h 59m | Hits:  99%/1166  
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/4   | Total: 14m 47s | Avg:  3m 41s | Max:  6m 00s | Hits:  99%/2124  
      🔍 20                 Pass:  94%/18  | Total:  7h 30m | Avg: 25m 01s | Max:  5h 59m | Hits:  99%/9103  
    🟨 cudacxx_family
      🟨 nvcc               Pass:  95%/22  | Total:  7h 45m | Avg: 21m 08s | Max:  5h 59m | Hits:  99%/11227 
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 19m 37s | Avg:  6m 32s | Max: 13m 41s | Hits:  99%/1749  
      🟩 90a                Pass: 100%/1   | Total:  3m 12s | Avg:  3m 12s | Max:  3m 12s | Hits:  99%/583   
    
  • 🟩 cub: Pass: 100%/45 | Total: 7h 45m | Avg: 10m 21s | Max: 24m 06s | Hits: 99%/53651

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  7h 34m | Avg: 10m 34s | Max: 24m 06s | Hits:  99%/51213 
      🟩 arm64              Pass: 100%/2   | Total: 11m 15s | Avg:  5m 37s | Max:  5m 55s | Hits:  99%/2438  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 41m 01s | Avg:  8m 12s | Max: 17m 56s | Hits:  99%/5926  
      🟩 12.6               Pass: 100%/2   | Total: 22m 08s | Avg: 11m 04s | Max: 11m 23s | Hits:  98%/2254  
      🟩 12.8               Pass: 100%/38  | Total:  6h 42m | Avg: 10m 35s | Max: 24m 06s | Hits:  99%/45471 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 48s | Avg:  4m 54s | Max:  4m 55s | Hits: 100%/2104  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 41m 01s | Avg:  8m 12s | Max: 17m 56s | Hits:  99%/5926  
      🟩 nvcc12.6           Pass: 100%/2   | Total: 22m 08s | Avg: 11m 04s | Max: 11m 23s | Hits:  98%/2254  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  6h 32m | Avg: 10m 54s | Max: 24m 06s | Hits:  99%/43367 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 48s | Avg:  4m 54s | Max:  4m 55s | Hits: 100%/2104  
      🟩 nvcc               Pass: 100%/43  | Total:  7h 35m | Avg: 10m 36s | Max: 24m 06s | Hits:  99%/51547 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 23m 25s | Avg:  5m 51s | Max:  6m 09s | Hits: 100%/4884  
      🟩 Clang15            Pass: 100%/2   | Total: 12m 27s | Avg:  6m 13s | Max:  6m 18s | Hits: 100%/2438  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 57s | Avg:  6m 28s | Max:  6m 39s | Hits: 100%/2438  
      🟩 Clang17            Pass: 100%/2   | Total: 12m 31s | Avg:  6m 15s | Max:  6m 19s | Hits: 100%/2438  
      🟩 Clang18            Pass: 100%/7   | Total:  1h 11m | Avg: 10m 09s | Max: 23m 33s | Hits: 100%/8199  
      🟩 GCC7               Pass: 100%/2   | Total: 12m 02s | Avg:  6m 01s | Max:  6m 09s | Hits:  99%/2442  
      🟩 GCC8               Pass: 100%/1   | Total:  6m 12s | Avg:  6m 12s | Max:  6m 12s | Hits:  99%/1221  
      🟩 GCC9               Pass: 100%/2   | Total: 12m 39s | Avg:  6m 19s | Max:  6m 42s | Hits:  99%/2442  
      🟩 GCC10              Pass: 100%/2   | Total: 13m 19s | Avg:  6m 39s | Max:  6m 47s | Hits:  99%/2442  
      🟩 GCC11              Pass: 100%/2   | Total: 13m 17s | Avg:  6m 38s | Max:  6m 43s | Hits:  99%/2438  
      🟩 GCC12              Pass: 100%/2   | Total: 13m 47s | Avg:  6m 53s | Max:  6m 57s | Hits:  99%/2438  
      🟩 GCC13              Pass: 100%/11  | Total:  2h 43m | Avg: 14m 50s | Max: 24m 06s | Hits:  99%/13409 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 36m 50s | Avg: 18m 25s | Max: 18m 54s | Hits:  99%/2084  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 39m 52s | Avg: 19m 56s | Max: 20m 23s | Hits:  99%/2084  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 22m 08s | Avg: 11m 04s | Max: 11m 23s | Hits:  98%/2254  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 12m | Avg:  7m 47s | Max: 23m 33s | Hits: 100%/20397 
      🟩 GCC                Pass: 100%/22  | Total:  3h 54m | Avg: 10m 39s | Max: 24m 06s | Hits:  99%/26832 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 16m | Avg: 19m 10s | Max: 20m 23s | Hits:  99%/4168  
      🟩 NVHPC              Pass: 100%/2   | Total: 22m 08s | Avg: 11m 04s | Max: 11m 23s | Hits:  98%/2254  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 49m 55s | Avg: 16m 38s | Max: 23m 37s | Hits:  99%/3657  
      🟩 rtx2080            Pass: 100%/34  | Total:  4h 32m | Avg:  8m 00s | Max: 20m 23s | Hits:  99%/40242 
      🟩 rtxa6000           Pass: 100%/8   | Total:  2h 23m | Avg: 17m 55s | Max: 24m 06s | Hits:  99%/9752  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  4h 50m | Avg:  7m 51s | Max: 20m 23s | Hits:  99%/43899 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 34s | Avg: 22m 34s | Max: 22m 34s | Hits:  99%/1219  
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 56s | Avg: 16m 56s | Max: 16m 56s | Hits:  99%/1219  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 11m | Avg: 23m 45s | Max: 24m 06s | Hits:  99%/3657  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 04m | Avg: 21m 28s | Max: 23m 03s | Hits:  99%/3657  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 49m 55s | Avg: 16m 38s | Max: 23m 37s | Hits:  99%/3657  
      🟩 90;90a;100         Pass: 100%/1   | Total:  7m 08s | Avg:  7m 08s | Max:  7m 08s | Hits:  99%/1219  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  2h 47m | Avg:  8m 22s | Max: 20m 23s | Hits:  99%/23606 
      🟩 20                 Pass: 100%/25  | Total:  4h 58m | Avg: 11m 55s | Max: 24m 06s | Hits:  99%/30045 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 6h 49m | Avg: 9m 06s | Max: 37m 33s | Hits: 98%/80541

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 17m 31s | Avg:  8m 45s | Max: 11m 07s | Hits:  99%/3582  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  6h 39m | Avg:  9m 17s | Max: 37m 33s | Hits:  98%/76960 
      🟩 arm64              Pass: 100%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  5m 10s | Hits:  99%/3581  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 41m 01s | Avg:  8m 12s | Max: 21m 15s | Hits:  99%/8946  
      🟩 12.6               Pass: 100%/2   | Total: 32m 20s | Avg: 16m 10s | Max: 16m 27s | Hits:  99%/3580  
      🟩 12.8               Pass: 100%/38  | Total:  5h 36m | Avg:  8m 50s | Max: 37m 33s | Hits:  98%/68015 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 27s | Avg:  5m 13s | Max:  5m 15s | Hits: 100%/3580  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 41m 01s | Avg:  8m 12s | Max: 21m 15s | Hits:  99%/8946  
      🟩 nvcc12.6           Pass: 100%/2   | Total: 32m 20s | Avg: 16m 10s | Max: 16m 27s | Hits:  99%/3580  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  5h 25m | Avg:  9m 02s | Max: 37m 33s | Hits:  98%/64435 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 27s | Avg:  5m 13s | Max:  5m 15s | Hits: 100%/3580  
      🟩 nvcc               Pass: 100%/43  | Total:  6h 39m | Avg:  9m 16s | Max: 37m 33s | Hits:  98%/76961 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 20m 17s | Avg:  5m 04s | Max:  5m 28s | Hits: 100%/7160  
      🟩 Clang15            Pass: 100%/2   | Total: 11m 00s | Avg:  5m 30s | Max:  5m 33s | Hits: 100%/3580  
      🟩 Clang16            Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 31s | Hits: 100%/3580  
      🟩 Clang17            Pass: 100%/2   | Total: 11m 14s | Avg:  5m 37s | Max:  5m 44s | Hits: 100%/3580  
      🟩 Clang18            Pass: 100%/7   | Total: 43m 42s | Avg:  6m 14s | Max: 10m 14s | Hits: 100%/12530 
      🟩 GCC7               Pass: 100%/2   | Total: 11m 04s | Avg:  5m 32s | Max:  6m 02s | Hits:  99%/3582  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 11s | Avg:  5m 11s | Max:  5m 11s | Hits:  99%/1791  
      🟩 GCC9               Pass: 100%/2   | Total: 10m 48s | Avg:  5m 24s | Max:  5m 37s | Hits:  99%/3582  
      🟩 GCC10              Pass: 100%/2   | Total: 11m 10s | Avg:  5m 35s | Max:  5m 38s | Hits:  99%/3582  
      🟩 GCC11              Pass: 100%/2   | Total: 11m 46s | Avg:  5m 53s | Max:  5m 56s | Hits:  99%/3582  
      🟩 GCC12              Pass: 100%/2   | Total: 43m 47s | Avg: 21m 53s | Max: 37m 33s | Hits:  74%/3582  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 16m | Avg:  7m 39s | Max: 11m 22s | Hits:  99%/17910 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 42m 48s | Avg: 21m 24s | Max: 21m 33s | Hits:  99%/3568  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  1h 06m | Avg: 22m 17s | Max: 25m 32s | Hits:  99%/5352  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 32m 20s | Avg: 16m 10s | Max: 16m 27s | Hits:  99%/3580  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  1h 37m | Avg:  5m 43s | Max: 10m 14s | Hits: 100%/30430 
      🟩 GCC                Pass: 100%/21  | Total:  2h 50m | Avg:  8m 06s | Max: 37m 33s | Hits:  97%/37611 
      🟩 MSVC               Pass: 100%/5   | Total:  1h 49m | Avg: 21m 56s | Max: 25m 32s | Hits:  99%/8920  
      🟩 NVHPC              Pass: 100%/2   | Total: 32m 20s | Avg: 16m 10s | Max: 16m 27s | Hits:  99%/3580  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 16m 06s | Avg:  8m 03s | Max: 11m 13s | Hits:  99%/3582  
      🟩 rtx2080            Pass: 100%/33  | Total:  4h 40m | Avg:  8m 30s | Max: 37m 33s | Hits:  98%/59066 
      🟩 rtx4090            Pass: 100%/10  | Total:  1h 52m | Avg: 11m 17s | Max: 25m 32s | Hits:  99%/17893 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  5h 25m | Avg:  8m 33s | Max: 37m 33s | Hits:  98%/68013 
      🟩 TestCPU            Pass: 100%/3   | Total: 40m 38s | Avg: 13m 32s | Max: 25m 32s | Hits:  99%/5365  
      🟩 TestGPU            Pass: 100%/4   | Total: 43m 56s | Avg: 10m 59s | Max: 11m 22s | Hits:  99%/7163  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 16m 06s | Avg:  8m 03s | Max: 11m 13s | Hits:  99%/3582  
      🟩 90;90a;100         Pass: 100%/1   | Total:  6m 06s | Avg:  6m 06s | Max:  6m 06s | Hits:  99%/1791  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  2h 48m | Avg:  8m 24s | Max: 21m 33s | Hits:  99%/35791 
      🟩 20                 Pass: 100%/23  | Total:  3h 43m | Avg:  9m 44s | Max: 37m 33s | Hits:  97%/41168 
    
  • 🟩 libcudacxx: Pass: 100%/43 | Total: 5h 31m | Avg: 7m 42s | Max: 22m 37s | Hits: 95%/106284

    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total:  5h 23m | Avg:  7m 53s | Max: 22m 37s | Hits:  95%/100471
      🟩 arm64              Pass: 100%/2   | Total:  7m 24s | Avg:  3m 42s | Max:  3m 51s | Hits:  99%/5813  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 35m 23s | Avg:  7m 04s | Max: 19m 55s | Hits:  99%/14160 
      🟩 12.6               Pass: 100%/2   | Total: 18m 42s | Avg:  9m 21s | Max:  9m 23s | Hits:  98%/5760  
      🟩 12.8               Pass: 100%/36  | Total:  4h 37m | Avg:  7m 41s | Max: 22m 37s | Hits:  94%/86364 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 42m 41s | Avg: 21m 20s | Max: 22m 37s | Hits:  27%/5774  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 35m 23s | Avg:  7m 04s | Max: 19m 55s | Hits:  99%/14160 
      🟩 nvcc12.6           Pass: 100%/2   | Total: 18m 42s | Avg:  9m 21s | Max:  9m 23s | Hits:  98%/5760  
      🟩 nvcc12.8           Pass: 100%/34  | Total:  3h 54m | Avg:  6m 53s | Max: 22m 30s | Hits:  99%/80590 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 42m 41s | Avg: 21m 20s | Max: 22m 37s | Hits:  27%/5774  
      🟩 nvcc               Pass: 100%/41  | Total:  4h 48m | Avg:  7m 02s | Max: 22m 30s | Hits:  99%/100510
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 16m 49s | Avg:  4m 12s | Max:  4m 32s | Hits:  99%/11516 
      🟩 Clang15            Pass: 100%/2   | Total:  9m 01s | Avg:  4m 30s | Max:  4m 31s | Hits:  99%/5770  
      🟩 Clang16            Pass: 100%/2   | Total:  9m 21s | Avg:  4m 40s | Max:  4m 51s | Hits:  99%/5770  
      🟩 Clang17            Pass: 100%/2   | Total:  8m 53s | Avg:  4m 26s | Max:  4m 30s | Hits:  99%/5770  
      🟩 Clang18            Pass: 100%/6   | Total:  1h 04m | Avg: 10m 49s | Max: 22m 37s | Hits:  70%/14450 
      🟩 GCC7               Pass: 100%/2   | Total:  7m 33s | Avg:  3m 46s | Max:  3m 52s | Hits:  99%/5708  
      🟩 GCC8               Pass: 100%/1   | Total:  4m 10s | Avg:  4m 10s | Max:  4m 10s | Hits:  99%/2864  
      🟩 GCC9               Pass: 100%/2   | Total:  8m 09s | Avg:  4m 04s | Max:  4m 22s | Hits:  99%/5720  
      🟩 GCC10              Pass: 100%/2   | Total:  8m 17s | Avg:  4m 08s | Max:  4m 14s | Hits:  99%/5776  
      🟩 GCC11              Pass: 100%/2   | Total:  8m 21s | Avg:  4m 10s | Max:  4m 16s | Hits:  99%/5772  
      🟩 GCC12              Pass: 100%/2   | Total:  8m 16s | Avg:  4m 08s | Max:  4m 14s | Hits:  99%/5772  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 15m | Avg:  7m 34s | Max: 17m 00s | Hits:  99%/14711 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 39m 40s | Avg: 19m 50s | Max: 19m 55s | Hits:  99%/5428  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 43m 15s | Avg: 21m 37s | Max: 22m 30s | Hits:  98%/5497  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 18m 42s | Avg:  9m 21s | Max:  9m 23s | Hits:  98%/5760  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/16  | Total:  1h 49m | Avg:  6m 48s | Max: 22m 37s | Hits:  89%/43276 
      🟩 GCC                Pass: 100%/21  | Total:  2h 00m | Avg:  5m 44s | Max: 17m 00s | Hits:  99%/46323 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 22m | Avg: 20m 43s | Max: 22m 30s | Hits:  99%/10925 
      🟩 NVHPC              Pass: 100%/2   | Total: 18m 42s | Avg:  9m 21s | Max:  9m 23s | Hits:  98%/5760  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 15m 58s | Avg:  7m 59s | Max: 11m 55s | Hits:  99%/2996  
      🟩 rtx2080            Pass: 100%/41  | Total:  5h 15m | Avg:  7m 41s | Max: 22m 37s | Hits:  95%/103288
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  4h 27m | Avg:  7m 13s | Max: 22m 37s | Hits:  95%/106244
      🟩 NVRTC              Pass: 100%/2   | Total: 32m 25s | Avg: 16m 12s | Max: 17m 00s | Hits:  90%/40    
      🟩 Test               Pass: 100%/3   | Total: 29m 35s | Avg:  9m 51s | Max: 11m 55s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 05s | Avg:  2m 05s | Max:  2m 05s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 32m 25s | Avg: 16m 12s | Max: 17m 00s | Hits:  90%/40    
      🟩 90                 Pass: 100%/2   | Total: 15m 58s | Avg:  7m 59s | Max: 11m 55s | Hits:  99%/2996  
      🟩 90;90a;100         Pass: 100%/1   | Total:  4m 41s | Avg:  4m 41s | Max:  4m 41s | Hits:  99%/2996  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  2h 49m | Avg:  8m 04s | Max: 20m 45s | Hits:  95%/56819 
      🟩 20                 Pass: 100%/21  | Total:  2h 39m | Avg:  7m 35s | Max: 22m 37s | Hits:  94%/49465 
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 16m 11s | Avg: 4m 02s | Max: 4m 54s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 45s | Avg:  4m 52s | Max:  4m 54s
      🟩 arm64              Pass: 100%/2   | Total:  6m 26s | Avg:  3m 13s | Max:  3m 14s
    🟩 ctk
      🟩 12.6               Pass: 100%/4   | Total: 16m 11s | Avg:  4m 02s | Max:  4m 54s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/4   | Total: 16m 11s | Avg:  4m 02s | Max:  4m 54s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 16m 11s | Avg:  4m 02s | Max:  4m 54s
    🟩 cxx
      🟩 NVHPC25.1          Pass: 100%/4   | Total: 16m 11s | Avg:  4m 02s | Max:  4m 54s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 16m 11s | Avg:  4m 02s | Max:  4m 54s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 16m 11s | Avg:  4m 02s | Max:  4m 54s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 16m 11s | Avg:  4m 02s | Max:  4m 54s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 05s | Avg:  4m 02s | Max:  4m 51s
      🟩 20                 Pass: 100%/2   | Total:  8m 06s | Avg:  4m 03s | Max:  4m 54s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 17m 00s | Avg: 8m 30s | Max: 14m 53s | Hits: 98%/320

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 17m 00s | Avg:  8m 30s | Max: 14m 53s | Hits:  98%/320   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 17m 00s | Avg:  8m 30s | Max: 14m 53s | Hits:  98%/320   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 17m 00s | Avg:  8m 30s | Max: 14m 53s | Hits:  98%/320   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 17m 00s | Avg:  8m 30s | Max: 14m 53s | Hits:  98%/320   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 17m 00s | Avg:  8m 30s | Max: 14m 53s | Hits:  98%/320   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 17m 00s | Avg:  8m 30s | Max: 14m 53s | Hits:  98%/320   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 17m 00s | Avg:  8m 30s | Max: 14m 53s | Hits:  98%/320   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 07s | Avg:  2m 07s | Max:  2m 07s | Hits:  98%/160   
      🟩 Test               Pass: 100%/1   | Total: 14m 53s | Avg: 14m 53s | Max: 14m 53s | Hits:  98%/160   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 07m | Avg: 1h 07m | Max: 1h 07m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 162)

# Runner
113 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@miscco
Copy link
Contributor

miscco commented Mar 18, 2025

/ok to test

@miscco miscco enabled auto-merge (squash) March 18, 2025 08:40
@github-actions
Copy link
Contributor

🟩 CI finished in 1h 12m: Pass: 100%/162 | Total: 1d 00h | Avg: 8m 56s | Max: 1h 08m | Hits: 96%/252755
  • 🟩 cub: Pass: 100%/45 | Total: 7h 59m | Avg: 10m 39s | Max: 26m 08s | Hits: 99%/53780

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  7h 47m | Avg: 10m 53s | Max: 26m 08s | Hits:  99%/51336 
      🟩 arm64              Pass: 100%/2   | Total: 11m 18s | Avg:  5m 39s | Max:  5m 59s | Hits:  99%/2444  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 42m 17s | Avg:  8m 27s | Max: 18m 45s | Hits:  99%/5940  
      🟩 12.6               Pass: 100%/2   | Total: 21m 59s | Avg: 10m 59s | Max: 11m 12s | Hits:  98%/2260  
      🟩 12.8               Pass: 100%/38  | Total:  6h 55m | Avg: 10m 55s | Max: 26m 08s | Hits:  99%/45580 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 58s | Avg:  4m 59s | Max:  5m 05s | Hits: 100%/2108  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 42m 17s | Avg:  8m 27s | Max: 18m 45s | Hits:  99%/5940  
      🟩 nvcc12.6           Pass: 100%/2   | Total: 21m 59s | Avg: 10m 59s | Max: 11m 12s | Hits:  98%/2260  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  6h 45m | Avg: 11m 15s | Max: 26m 08s | Hits:  99%/43472 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 58s | Avg:  4m 59s | Max:  5m 05s | Hits: 100%/2108  
      🟩 nvcc               Pass: 100%/43  | Total:  7h 49m | Avg: 10m 54s | Max: 26m 08s | Hits:  99%/51672 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 24m 27s | Avg:  6m 06s | Max:  6m 42s | Hits: 100%/4896  
      🟩 Clang15            Pass: 100%/2   | Total: 13m 16s | Avg:  6m 38s | Max:  6m 51s | Hits: 100%/2444  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 36s | Avg:  6m 18s | Max:  6m 19s | Hits: 100%/2444  
      🟩 Clang17            Pass: 100%/2   | Total: 12m 20s | Avg:  6m 10s | Max:  6m 12s | Hits: 100%/2444  
      🟩 Clang18            Pass: 100%/7   | Total:  1h 13m | Avg: 10m 30s | Max: 23m 09s | Hits: 100%/8218  
      🟩 GCC7               Pass: 100%/2   | Total: 12m 31s | Avg:  6m 15s | Max:  6m 20s | Hits:  99%/2448  
      🟩 GCC8               Pass: 100%/1   | Total:  6m 06s | Avg:  6m 06s | Max:  6m 06s | Hits:  99%/1224  
      🟩 GCC9               Pass: 100%/2   | Total: 12m 45s | Avg:  6m 22s | Max:  6m 38s | Hits:  99%/2448  
      🟩 GCC10              Pass: 100%/2   | Total: 13m 17s | Avg:  6m 38s | Max:  6m 43s | Hits:  99%/2448  
      🟩 GCC11              Pass: 100%/2   | Total: 13m 22s | Avg:  6m 41s | Max:  6m 44s | Hits:  99%/2444  
      🟩 GCC12              Pass: 100%/2   | Total: 13m 51s | Avg:  6m 55s | Max:  6m 57s | Hits:  99%/2444  
      🟩 GCC13              Pass: 100%/11  | Total:  2h 54m | Avg: 15m 51s | Max: 26m 08s | Hits:  99%/13442 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 37m 11s | Avg: 18m 35s | Max: 18m 45s | Hits:  99%/2088  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 37m 43s | Avg: 18m 51s | Max: 18m 55s | Hits:  99%/2088  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 21m 59s | Avg: 10m 59s | Max: 11m 12s | Hits:  98%/2260  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 16m | Avg:  8m 00s | Max: 23m 09s | Hits: 100%/20446 
      🟩 GCC                Pass: 100%/22  | Total:  4h 06m | Avg: 11m 11s | Max: 26m 08s | Hits:  99%/26898 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 14m | Avg: 18m 43s | Max: 18m 55s | Hits:  99%/4176  
      🟩 NVHPC              Pass: 100%/2   | Total: 21m 59s | Avg: 10m 59s | Max: 11m 12s | Hits:  98%/2260  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 51m 59s | Avg: 17m 19s | Max: 25m 04s | Hits:  99%/3666  
      🟩 rtx2080            Pass: 100%/34  | Total:  4h 32m | Avg:  8m 01s | Max: 18m 55s | Hits:  99%/40338 
      🟩 rtxa6000           Pass: 100%/8   | Total:  2h 34m | Avg: 19m 20s | Max: 26m 08s | Hits:  99%/9776  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  4h 50m | Avg:  7m 50s | Max: 18m 55s | Hits:  99%/44004 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 24m 35s | Avg: 24m 35s | Max: 24m 35s | Hits:  99%/1222  
      🟩 GraphCapture       Pass: 100%/1   | Total: 20m 14s | Avg: 20m 14s | Max: 20m 14s | Hits:  99%/1222  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 14m | Avg: 24m 47s | Max: 26m 08s | Hits:  99%/3666  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 09m | Avg: 23m 14s | Max: 24m 59s | Hits:  99%/3666  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 51m 59s | Avg: 17m 19s | Max: 25m 04s | Hits:  99%/3666  
      🟩 90;90a;100         Pass: 100%/1   | Total:  6m 55s | Avg:  6m 55s | Max:  6m 55s | Hits:  99%/1222  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  2h 48m | Avg:  8m 24s | Max: 18m 48s | Hits:  99%/23662 
      🟩 20                 Pass: 100%/25  | Total:  5h 11m | Avg: 12m 26s | Max: 26m 08s | Hits:  99%/30118 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 6h 10m | Avg: 8m 14s | Max: 25m 15s | Hits: 99%/80541

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 17m 03s | Avg:  8m 31s | Max: 11m 12s | Hits:  99%/3582  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  6h 00m | Avg:  8m 23s | Max: 25m 15s | Hits:  99%/76960 
      🟩 arm64              Pass: 100%/2   | Total:  9m 56s | Avg:  4m 58s | Max:  5m 15s | Hits:  99%/3581  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 40m 38s | Avg:  8m 07s | Max: 20m 40s | Hits:  99%/8946  
      🟩 12.6               Pass: 100%/2   | Total: 30m 21s | Avg: 15m 10s | Max: 15m 19s | Hits:  99%/3580  
      🟩 12.8               Pass: 100%/38  | Total:  4h 59m | Avg:  7m 53s | Max: 25m 15s | Hits:  99%/68015 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 06s | Avg:  5m 03s | Max:  5m 12s | Hits: 100%/3580  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 40m 38s | Avg:  8m 07s | Max: 20m 40s | Hits:  99%/8946  
      🟩 nvcc12.6           Pass: 100%/2   | Total: 30m 21s | Avg: 15m 10s | Max: 15m 19s | Hits:  99%/3580  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  4h 49m | Avg:  8m 02s | Max: 25m 15s | Hits:  99%/64435 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 06s | Avg:  5m 03s | Max:  5m 12s | Hits: 100%/3580  
      🟩 nvcc               Pass: 100%/43  | Total:  6h 00m | Avg:  8m 23s | Max: 25m 15s | Hits:  99%/76961 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 00s | Avg:  5m 15s | Max:  5m 50s | Hits: 100%/7160  
      🟩 Clang15            Pass: 100%/2   | Total: 11m 17s | Avg:  5m 38s | Max:  5m 57s | Hits: 100%/3580  
      🟩 Clang16            Pass: 100%/2   | Total: 11m 05s | Avg:  5m 32s | Max:  5m 34s | Hits: 100%/3580  
      🟩 Clang17            Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 32s | Hits: 100%/3580  
      🟩 Clang18            Pass: 100%/7   | Total: 43m 18s | Avg:  6m 11s | Max: 10m 18s | Hits: 100%/12530 
      🟩 GCC7               Pass: 100%/2   | Total: 10m 42s | Avg:  5m 21s | Max:  5m 43s | Hits:  99%/3582  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 35s | Avg:  5m 35s | Max:  5m 35s | Hits:  99%/1791  
      🟩 GCC9               Pass: 100%/2   | Total: 11m 08s | Avg:  5m 34s | Max:  5m 42s | Hits:  99%/3582  
      🟩 GCC10              Pass: 100%/2   | Total: 11m 03s | Avg:  5m 31s | Max:  5m 41s | Hits:  99%/3582  
      🟩 GCC11              Pass: 100%/2   | Total: 11m 18s | Avg:  5m 39s | Max:  5m 43s | Hits:  99%/3582  
      🟩 GCC12              Pass: 100%/2   | Total: 11m 46s | Avg:  5m 53s | Max:  6m 00s | Hits:  99%/3582  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 15m | Avg:  7m 30s | Max: 11m 44s | Hits:  99%/17910 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 40m 39s | Avg: 20m 19s | Max: 20m 40s | Hits:  99%/3568  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  1h 05m | Avg: 21m 48s | Max: 25m 15s | Hits:  99%/5352  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 30m 21s | Avg: 15m 10s | Max: 15m 19s | Hits:  99%/3580  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  1h 37m | Avg:  5m 44s | Max: 10m 18s | Hits: 100%/30430 
      🟩 GCC                Pass: 100%/21  | Total:  2h 16m | Avg:  6m 30s | Max: 11m 44s | Hits:  99%/37611 
      🟩 MSVC               Pass: 100%/5   | Total:  1h 46m | Avg: 21m 13s | Max: 25m 15s | Hits:  99%/8920  
      🟩 NVHPC              Pass: 100%/2   | Total: 30m 21s | Avg: 15m 10s | Max: 15m 19s | Hits:  99%/3580  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 15m 09s | Avg:  7m 34s | Max: 10m 36s | Hits:  99%/3582  
      🟩 rtx2080            Pass: 100%/33  | Total:  4h 03m | Avg:  7m 23s | Max: 20m 40s | Hits:  99%/59066 
      🟩 rtx4090            Pass: 100%/10  | Total:  1h 51m | Avg: 11m 10s | Max: 25m 15s | Hits:  99%/17893 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  4h 46m | Avg:  7m 32s | Max: 20m 53s | Hits:  99%/68013 
      🟩 TestCPU            Pass: 100%/3   | Total: 40m 24s | Avg: 13m 28s | Max: 25m 15s | Hits:  99%/5365  
      🟩 TestGPU            Pass: 100%/4   | Total: 43m 50s | Avg: 10m 57s | Max: 11m 44s | Hits:  99%/7163  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 15m 09s | Avg:  7m 34s | Max: 10m 36s | Hits:  99%/3582  
      🟩 90;90a;100         Pass: 100%/1   | Total:  5m 52s | Avg:  5m 52s | Max:  5m 52s | Hits:  99%/1791  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  2h 44m | Avg:  8m 13s | Max: 20m 40s | Hits:  99%/35791 
      🟩 20                 Pass: 100%/23  | Total:  3h 09m | Avg:  8m 13s | Max: 25m 15s | Hits:  99%/41168 
    
  • 🟩 libcudacxx: Pass: 100%/43 | Total: 6h 24m | Avg: 8m 56s | Max: 32m 19s | Hits: 92%/106284

    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total:  6h 16m | Avg:  9m 11s | Max: 32m 19s | Hits:  92%/100471
      🟩 arm64              Pass: 100%/2   | Total:  7m 31s | Avg:  3m 45s | Max:  4m 01s | Hits:  99%/5813  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 35m 24s | Avg:  7m 04s | Max: 20m 11s | Hits:  98%/14160 
      🟩 12.6               Pass: 100%/2   | Total: 42m 13s | Avg: 21m 06s | Max: 32m 19s | Hits:  65%/5760  
      🟩 12.8               Pass: 100%/36  | Total:  5h 06m | Avg:  8m 31s | Max: 24m 08s | Hits:  93%/86364 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 45m 09s | Avg: 22m 34s | Max: 24m 08s | Hits:  27%/5774  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 35m 24s | Avg:  7m 04s | Max: 20m 11s | Hits:  98%/14160 
      🟩 nvcc12.6           Pass: 100%/2   | Total: 42m 13s | Avg: 21m 06s | Max: 32m 19s | Hits:  65%/5760  
      🟩 nvcc12.8           Pass: 100%/34  | Total:  4h 21m | Avg:  7m 41s | Max: 22m 34s | Hits:  97%/80590 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 45m 09s | Avg: 22m 34s | Max: 24m 08s | Hits:  27%/5774  
      🟩 nvcc               Pass: 100%/41  | Total:  5h 39m | Avg:  8m 16s | Max: 32m 19s | Hits:  96%/100510
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 18m 16s | Avg:  4m 34s | Max:  5m 39s | Hits:  98%/11516 
      🟩 Clang15            Pass: 100%/2   | Total: 10m 15s | Avg:  5m 07s | Max:  5m 35s | Hits:  97%/5770  
      🟩 Clang16            Pass: 100%/2   | Total:  9m 33s | Avg:  4m 46s | Max:  4m 48s | Hits:  99%/5770  
      🟩 Clang17            Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  5m 43s | Hits:  97%/5770  
      🟩 Clang18            Pass: 100%/6   | Total:  1h 07m | Avg: 11m 10s | Max: 24m 08s | Hits:  70%/14450 
      🟩 GCC7               Pass: 100%/2   | Total:  8m 05s | Avg:  4m 02s | Max:  4m 45s | Hits:  97%/5708  
      🟩 GCC8               Pass: 100%/1   | Total:  4m 50s | Avg:  4m 50s | Max:  4m 50s | Hits:  96%/2864  
      🟩 GCC9               Pass: 100%/2   | Total:  9m 18s | Avg:  4m 39s | Max:  5m 29s | Hits:  97%/5720  
      🟩 GCC10              Pass: 100%/2   | Total: 10m 02s | Avg:  5m 01s | Max:  5m 04s | Hits:  96%/5776  
      🟩 GCC11              Pass: 100%/2   | Total: 10m 23s | Avg:  5m 11s | Max:  5m 23s | Hits:  96%/5772  
      🟩 GCC12              Pass: 100%/2   | Total:  9m 13s | Avg:  4m 36s | Max:  5m 25s | Hits:  97%/5772  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 31m | Avg:  9m 06s | Max: 20m 53s | Hits:  98%/14711 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 42m 45s | Avg: 21m 22s | Max: 22m 34s | Hits:  96%/5428  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 41m 27s | Avg: 20m 43s | Max: 21m 37s | Hits:  98%/5497  
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 42m 13s | Avg: 21m 06s | Max: 32m 19s | Hits:  65%/5760  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/16  | Total:  1h 55m | Avg:  7m 11s | Max: 24m 08s | Hits:  89%/43276 
      🟩 GCC                Pass: 100%/21  | Total:  2h 22m | Avg:  6m 48s | Max: 20m 53s | Hits:  97%/46323 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 24m | Avg: 21m 03s | Max: 22m 34s | Hits:  97%/10925 
      🟩 NVHPC              Pass: 100%/2   | Total: 42m 13s | Avg: 21m 06s | Max: 32m 19s | Hits:  65%/5760  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 13m 00s | Hits:  99%/2996  
      🟩 rtx2080            Pass: 100%/41  | Total:  6h 07m | Avg:  8m 57s | Max: 32m 19s | Hits:  92%/103288
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  5h 07m | Avg:  8m 18s | Max: 32m 19s | Hits:  92%/106244
      🟩 NVRTC              Pass: 100%/2   | Total: 38m 03s | Avg: 19m 01s | Max: 20m 53s | Hits:  90%/40    
      🟩 Test               Pass: 100%/3   | Total: 37m 08s | Avg: 12m 22s | Max: 14m 58s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 05s | Avg:  2m 05s | Max:  2m 05s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 38m 03s | Avg: 19m 01s | Max: 20m 53s | Hits:  90%/40    
      🟩 90                 Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 13m 00s | Hits:  99%/2996  
      🟩 90;90a;100         Pass: 100%/1   | Total:  5m 38s | Avg:  5m 38s | Max:  5m 38s | Hits:  96%/2996  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 21m | Avg:  9m 34s | Max: 32m 19s | Hits:  91%/56819 
      🟩 20                 Pass: 100%/21  | Total:  3h 01m | Avg:  8m 38s | Max: 24m 08s | Hits:  93%/49465 
    
  • 🟩 cudax: Pass: 100%/22 | Total: 1h 52m | Avg: 5m 07s | Max: 12m 20s | Hits: 99%/11830

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  1h 41m | Avg:  5m 39s | Max: 12m 20s | Hits:  99%/9494  
      🟩 arm64              Pass: 100%/4   | Total: 11m 01s | Avg:  2m 45s | Max:  2m 47s | Hits:  99%/2336  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total:  8m 58s | Avg:  8m 58s | Max:  8m 58s | Hits:  95%/281   
      🟩 12.6               Pass: 100%/2   | Total: 11m 21s | Avg:  5m 40s | Max:  5m 44s | Hits:  96%/752   
      🟩 12.8               Pass: 100%/19  | Total:  1h 32m | Avg:  4m 52s | Max: 12m 20s | Hits:  99%/10797 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total:  8m 58s | Avg:  8m 58s | Max:  8m 58s | Hits:  95%/281   
      🟩 nvcc12.6           Pass: 100%/2   | Total: 11m 21s | Avg:  5m 40s | Max:  5m 44s | Hits:  96%/752   
      🟩 nvcc12.8           Pass: 100%/19  | Total:  1h 32m | Avg:  4m 52s | Max: 12m 20s | Hits:  99%/10797 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/22  | Total:  1h 52m | Avg:  5m 07s | Max: 12m 20s | Hits:  99%/11830 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 28s | Avg:  3m 28s | Max:  3m 28s | Hits: 100%/586   
      🟩 Clang15            Pass: 100%/1   | Total:  3m 22s | Avg:  3m 22s | Max:  3m 22s | Hits: 100%/584   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 35s | Avg:  3m 35s | Max:  3m 35s | Hits: 100%/584   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 22s | Avg:  3m 22s | Max:  3m 22s | Hits: 100%/584   
      🟩 Clang18            Pass: 100%/4   | Total: 21m 12s | Avg:  5m 18s | Max: 12m 11s | Hits: 100%/2336  
      🟩 GCC10              Pass: 100%/1   | Total:  3m 15s | Avg:  3m 15s | Max:  3m 15s | Hits:  99%/586   
      🟩 GCC11              Pass: 100%/1   | Total:  3m 19s | Avg:  3m 19s | Max:  3m 19s | Hits:  99%/584   
      🟩 GCC12              Pass: 100%/2   | Total: 15m 47s | Avg:  7m 53s | Max: 12m 20s | Hits:  99%/1168  
      🟩 GCC13              Pass: 100%/6   | Total: 25m 29s | Avg:  4m 14s | Max: 11m 17s | Hits:  99%/3504  
      🟩 MSVC14.39          Pass: 100%/1   | Total:  8m 58s | Avg:  8m 58s | Max:  8m 58s | Hits:  95%/281   
      🟩 MSVC14.42          Pass: 100%/1   | Total:  9m 47s | Avg:  9m 47s | Max:  9m 47s | Hits:  95%/281   
      🟩 NVHPC25.1          Pass: 100%/2   | Total: 11m 21s | Avg:  5m 40s | Max:  5m 44s | Hits:  96%/752   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 34m 59s | Avg:  4m 22s | Max: 12m 11s | Hits: 100%/4674  
      🟩 GCC                Pass: 100%/10  | Total: 47m 50s | Avg:  4m 47s | Max: 12m 20s | Hits:  99%/5842  
      🟩 MSVC               Pass: 100%/2   | Total: 18m 45s | Avg:  9m 22s | Max:  9m 47s | Hits:  95%/562   
      🟩 NVHPC              Pass: 100%/2   | Total: 11m 21s | Avg:  5m 40s | Max:  5m 44s | Hits:  96%/752   
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 14m 13s | Avg:  7m 06s | Max: 11m 17s | Hits:  99%/1168  
      🟩 rtx2080            Pass: 100%/20  | Total:  1h 38m | Avg:  4m 56s | Max: 12m 20s | Hits:  99%/10662 
    🟩 jobs
      🟩 Build              Pass: 100%/19  | Total:  1h 17m | Avg:  4m 03s | Max:  9m 47s | Hits:  99%/10078 
      🟩 Test               Pass: 100%/3   | Total: 35m 48s | Avg: 11m 56s | Max: 12m 20s | Hits:  99%/1752  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 17m 07s | Avg:  5m 42s | Max: 11m 17s | Hits:  99%/1752  
      🟩 90a                Pass: 100%/1   | Total:  2m 51s | Avg:  2m 51s | Max:  2m 51s | Hits:  99%/584   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 14m 01s | Avg:  3m 30s | Max:  5m 37s | Hits:  99%/2128  
      🟩 20                 Pass: 100%/18  | Total:  1h 38m | Avg:  5m 29s | Max: 12m 20s | Hits:  99%/9702  
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 17m 37s | Avg: 4m 24s | Max: 5m 05s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 05s | Avg:  5m 02s | Max:  5m 05s
      🟩 arm64              Pass: 100%/2   | Total:  7m 32s | Avg:  3m 46s | Max:  3m 49s
    🟩 ctk
      🟩 12.6               Pass: 100%/4   | Total: 17m 37s | Avg:  4m 24s | Max:  5m 05s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/4   | Total: 17m 37s | Avg:  4m 24s | Max:  5m 05s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 17m 37s | Avg:  4m 24s | Max:  5m 05s
    🟩 cxx
      🟩 NVHPC25.1          Pass: 100%/4   | Total: 17m 37s | Avg:  4m 24s | Max:  5m 05s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 17m 37s | Avg:  4m 24s | Max:  5m 05s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 17m 37s | Avg:  4m 24s | Max:  5m 05s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 17m 37s | Avg:  4m 24s | Max:  5m 05s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 48s | Avg:  4m 24s | Max:  5m 05s
      🟩 20                 Pass: 100%/2   | Total:  8m 49s | Avg:  4m 24s | Max:  5m 00s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 16m 51s | Avg: 8m 25s | Max: 14m 39s | Hits: 98%/320

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 16m 51s | Avg:  8m 25s | Max: 14m 39s | Hits:  98%/320   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 16m 51s | Avg:  8m 25s | Max: 14m 39s | Hits:  98%/320   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 16m 51s | Avg:  8m 25s | Max: 14m 39s | Hits:  98%/320   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 16m 51s | Avg:  8m 25s | Max: 14m 39s | Hits:  98%/320   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 16m 51s | Avg:  8m 25s | Max: 14m 39s | Hits:  98%/320   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 16m 51s | Avg:  8m 25s | Max: 14m 39s | Hits:  98%/320   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 16m 51s | Avg:  8m 25s | Max: 14m 39s | Hits:  98%/320   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 12s | Avg:  2m 12s | Max:  2m 12s | Hits:  98%/160   
      🟩 Test               Pass: 100%/1   | Total: 14m 39s | Avg: 14m 39s | Max: 14m 39s | Hits:  98%/160   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 08m | Avg: 1h 08m | Max: 1h 08m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 162)

# Runner
113 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@miscco miscco merged commit a89f1eb into NVIDIA:main Mar 18, 2025
175 of 177 checks passed
@github-project-automation github-project-automation bot moved this from In Review to Done in CCCL Mar 18, 2025
pciolkosz pushed a commit to pciolkosz/cccl that referenced this pull request Mar 18, 2025
* implement `cuda::overflow_cast`
davebayer added a commit to davebayer/cccl that referenced this pull request Apr 7, 2025
* implement `cuda::overflow_cast`
@fbusato
Copy link
Contributor

fbusato commented Jul 15, 2025

cuda::overflow_cast also misses the documentation

@davebayer
Copy link
Contributor Author

cuda::overflow_cast also misses the documentation

The docs will be a part of #5270 :)

@davebayer davebayer deleted the overflow_cast branch February 17, 2026 16:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

4 participants