Skip to content

Revert Thrust find_if_not implementation to please nvc++#3901

Merged
bernhardmgruber merged 1 commit intoNVIDIA:mainfrom
bernhardmgruber:fix_nvhpc2
Feb 22, 2025
Merged

Revert Thrust find_if_not implementation to please nvc++#3901
bernhardmgruber merged 1 commit intoNVIDIA:mainfrom
bernhardmgruber:fix_nvhpc2

Conversation

@bernhardmgruber
Copy link
Contributor

Fixes: #3594

@bernhardmgruber bernhardmgruber requested a review from a team as a code owner February 21, 2025 17:52
@github-actions
Copy link
Contributor

🟨 CI finished in 1h 16m: Pass: 7%/93 | Total: 1d 05h | Avg: 18m 59s | Max: 1h 14m | Hits: 79%/5958
  • 🟨 cub: Pass: 4%/45 | Total: 22h 47m | Avg: 30m 22s | Max: 1h 14m | Hits: 75%/2092

    🚨 cudacxx_family: nvcc 🚨
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 57m | Avg: 58m 36s | Max: 58m 38s | Hits:  75%/2092  
      🔥 nvcc               Pass:   0%/43  | Total: 20h 49m | Avg: 29m 03s | Max:  1h 14m
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 57m | Avg: 58m 36s | Max: 58m 38s | Hits:  75%/2092  
      🟥 nvcc12.0           Pass:   0%/5   | Total:  3h 01m | Avg: 36m 19s | Max: 52m 56s
      🟥 nvcc12.5           Pass:   0%/2   | Total:  1h 09m | Avg: 34m 40s | Max: 35m 46s
      🟥 nvcc12.8           Pass:   0%/36  | Total: 16h 38m | Avg: 27m 44s | Max:  1h 14m
    🟨 cpu
      🟨 amd64              Pass:   4%/43  | Total: 21h 25m | Avg: 29m 54s | Max:  1h 14m | Hits:  75%/2092  
      🟥 arm64              Pass:   0%/2   | Total:  1h 21m | Avg: 40m 31s | Max: 40m 33s
    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total:  3h 01m | Avg: 36m 19s | Max: 52m 56s
      🟥 12.5               Pass:   0%/2   | Total:  1h 09m | Avg: 34m 40s | Max: 35m 46s
      🟨 12.8               Pass:   5%/38  | Total: 18h 36m | Avg: 29m 22s | Max:  1h 14m | Hits:  75%/2092  
    🟨 cxx
      🟥 Clang14            Pass:   0%/4   | Total:  2h 04m | Avg: 31m 13s | Max: 33m 23s
      🟥 Clang15            Pass:   0%/2   | Total:  1h 01m | Avg: 30m 39s | Max: 31m 07s
      🟥 Clang16            Pass:   0%/2   | Total:  1h 00m | Avg: 30m 28s | Max: 30m 34s
      🟥 Clang17            Pass:   0%/2   | Total:  1h 02m | Avg: 31m 20s | Max: 32m 23s
      🟨 Clang18            Pass:  28%/7   | Total:  3h 38m | Avg: 31m 17s | Max: 58m 38s | Hits:  75%/2092  
      🟥 GCC7               Pass:   0%/2   | Total:  1h 02m | Avg: 31m 22s | Max: 31m 41s
      🟥 GCC8               Pass:   0%/1   | Total: 33m 13s | Avg: 33m 13s | Max: 33m 13s
      🟥 GCC9               Pass:   0%/2   | Total:  1h 02m | Avg: 31m 02s | Max: 31m 53s
      🟥 GCC10              Pass:   0%/2   | Total:  1h 01m | Avg: 30m 49s | Max: 31m 45s
      🟥 GCC11              Pass:   0%/2   | Total:  1h 00m | Avg: 30m 23s | Max: 30m 37s
      🟥 GCC12              Pass:   0%/2   | Total:  1h 00m | Avg: 30m 23s | Max: 30m 29s
      🟥 GCC13              Pass:   0%/11  | Total:  2h 40m | Avg: 14m 35s | Max: 44m 18s
      🟥 MSVC14.29          Pass:   0%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 10m
      🟥 MSVC14.42          Pass:   0%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 14m
      🟥 NVHPC24.7          Pass:   0%/2   | Total:  1h 09m | Avg: 34m 40s | Max: 35m 46s
    🟨 cxx_family
      🟨 Clang              Pass:  11%/17  | Total:  8h 48m | Avg: 31m 06s | Max: 58m 38s | Hits:  75%/2092  
      🟥 GCC                Pass:   0%/22  | Total:  8h 21m | Avg: 22m 48s | Max: 44m 18s
      🟥 MSVC               Pass:   0%/4   | Total:  4h 27m | Avg:  1h 06m | Max:  1h 14m
      🟥 NVHPC              Pass:   0%/2   | Total:  1h 09m | Avg: 34m 40s | Max: 35m 46s
    🟨 gpu
      🟥 h100               Pass:   0%/3   | Total: 11m 56s | Avg:  3m 58s | Max: 11m 56s
      🟨 rtx2080            Pass:   5%/34  | Total: 21h 32m | Avg: 38m 00s | Max:  1h 14m | Hits:  75%/2092  
      🟥 rtxa6000           Pass:   0%/8   | Total:  1h 02m | Avg:  7m 49s | Max: 32m 55s
    🟨 jobs
      🟨 Build              Pass:   5%/37  | Total: 22h 47m | Avg: 36m 56s | Max:  1h 14m | Hits:  75%/2092  
      🟥 DeviceLaunch       Pass:   0%/1  
      🟥 GraphCapture       Pass:   0%/1  
      🟥 HostLaunch         Pass:   0%/3  
      🟥 TestGPU            Pass:   0%/3  
    🟥 sm
      🟥 90                 Pass:   0%/3   | Total: 11m 56s | Avg:  3m 58s | Max: 11m 56s
      🟥 90;90a;100         Pass:   0%/1   | Total: 44m 18s | Avg: 44m 18s | Max: 44m 18s
    🟨 std
      🟨 17                 Pass:   5%/20  | Total: 12h 35m | Avg: 37m 47s | Max:  1h 14m | Hits:  75%/1046  
      🟨 20                 Pass:   4%/25  | Total: 10h 11m | Avg: 24m 27s | Max:  1h 09m | Hits:  75%/1046  
    
  • 🟨 thrust: Pass: 4%/45 | Total: 5h 37m | Avg: 7m 29s | Max: 51m 53s | Hits: 80%/3562

    🚨 cudacxx_family: nvcc 🚨
      🟩 ClangCUDA          Pass: 100%/2   | Total: 46m 18s | Avg: 23m 09s | Max: 24m 23s | Hits:  80%/3562  
      🔥 nvcc               Pass:   0%/43  | Total:  4h 50m | Avg:  6m 45s | Max: 51m 53s
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 46m 18s | Avg: 23m 09s | Max: 24m 23s | Hits:  80%/3562  
      🟥 nvcc12.0           Pass:   0%/5   | Total: 55m 19s | Avg: 11m 03s | Max: 43m 35s
      🟥 nvcc12.5           Pass:   0%/2   | Total: 11m 13s | Avg:  5m 36s | Max:  5m 40s
      🟥 nvcc12.8           Pass:   0%/36  | Total:  3h 44m | Avg:  6m 13s | Max: 51m 53s
    🟥 cmake_options
      🟥 -DTHRUST_DISPATCH_TYPE=Force32bit Pass:   0%/2   | Total:  3m 09s | Avg:  1m 34s | Max:  3m 09s
    🟨 cpu
      🟨 amd64              Pass:   4%/43  | Total:  5h 31m | Avg:  7m 42s | Max: 51m 53s | Hits:  80%/3562  
      🟥 arm64              Pass:   0%/2   | Total:  5m 30s | Avg:  2m 45s | Max:  2m 46s
    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total: 55m 19s | Avg: 11m 03s | Max: 43m 35s
      🟥 12.5               Pass:   0%/2   | Total: 11m 13s | Avg:  5m 36s | Max:  5m 40s
      🟨 12.8               Pass:   5%/38  | Total:  4h 30m | Avg:  7m 07s | Max: 51m 53s | Hits:  80%/3562  
    🟨 cxx
      🟥 Clang14            Pass:   0%/4   | Total: 12m 17s | Avg:  3m 04s | Max:  3m 10s
      🟥 Clang15            Pass:   0%/2   | Total:  6m 00s | Avg:  3m 00s | Max:  3m 03s
      🟥 Clang16            Pass:   0%/2   | Total:  6m 25s | Avg:  3m 12s | Max:  3m 13s
      🟥 Clang17            Pass:   0%/2   | Total:  6m 18s | Avg:  3m 09s | Max:  3m 14s
      🟨 Clang18            Pass:  28%/7   | Total: 55m 23s | Avg:  7m 54s | Max: 24m 23s | Hits:  80%/3562  
      🟥 GCC7               Pass:   0%/2   | Total:  5m 57s | Avg:  2m 58s | Max:  3m 09s
      🟥 GCC8               Pass:   0%/1   | Total:  2m 55s | Avg:  2m 55s | Max:  2m 55s
      🟥 GCC9               Pass:   0%/2   | Total:  5m 54s | Avg:  2m 57s | Max:  3m 00s
      🟥 GCC10              Pass:   0%/2   | Total:  6m 23s | Avg:  3m 11s | Max:  3m 17s
      🟥 GCC11              Pass:   0%/2   | Total:  5m 57s | Avg:  2m 58s | Max:  3m 01s
      🟥 GCC12              Pass:   0%/2   | Total:  5m 59s | Avg:  2m 59s | Max:  3m 08s
      🟥 GCC13              Pass:   0%/10  | Total: 18m 34s | Avg:  1m 51s | Max:  3m 31s
      🟥 MSVC14.29          Pass:   0%/2   | Total:  1h 30m | Avg: 45m 02s | Max: 46m 30s
      🟥 MSVC14.42          Pass:   0%/3   | Total:  1h 37m | Avg: 32m 37s | Max: 51m 53s
      🟥 NVHPC24.7          Pass:   0%/2   | Total: 11m 13s | Avg:  5m 36s | Max:  5m 40s
    🟨 cxx_family
      🟨 Clang              Pass:  11%/17  | Total:  1h 26m | Avg:  5m 04s | Max: 24m 23s | Hits:  80%/3562  
      🟥 GCC                Pass:   0%/21  | Total: 51m 39s | Avg:  2m 27s | Max:  3m 31s
      🟥 MSVC               Pass:   0%/5   | Total:  3h 07m | Avg: 37m 35s | Max: 51m 53s
      🟥 NVHPC              Pass:   0%/2   | Total: 11m 13s | Avg:  5m 36s | Max:  5m 40s
    🟨 gpu
      🟥 h100               Pass:   0%/2   | Total:  2m 44s | Avg:  1m 22s | Max:  2m 44s
      🟨 rtx2080            Pass:   6%/33  | Total:  4h 33m | Avg:  8m 16s | Max: 46m 30s | Hits:  80%/3562  
      🟥 rtx4090            Pass:   0%/10  | Total:  1h 01m | Avg:  6m 08s | Max: 51m 53s
    🟨 jobs
      🟨 Build              Pass:   5%/38  | Total:  5h 37m | Avg:  8m 52s | Max: 51m 53s | Hits:  80%/3562  
      🟥 TestCPU            Pass:   0%/3  
      🟥 TestGPU            Pass:   0%/4  
    🟥 sm
      🟥 90                 Pass:   0%/2   | Total:  2m 44s | Avg:  1m 22s | Max:  2m 44s
      🟥 90;90a;100         Pass:   0%/1   | Total:  3m 31s | Avg:  3m 31s | Max:  3m 31s
    🟨 std
      🟨 17                 Pass:   5%/20  | Total:  3h 29m | Avg: 10m 27s | Max: 46m 30s | Hits:  80%/1781  
      🟨 20                 Pass:   4%/23  | Total:  2h 04m | Avg:  5m 25s | Max: 51m 53s | Hits:  80%/1781  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 21m 25s | Avg: 10m 42s | Max: 19m 04s | Hits: 98%/304

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 21m 25s | Avg: 10m 42s | Max: 19m 04s | Hits:  98%/304   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 21m 25s | Avg: 10m 42s | Max: 19m 04s | Hits:  98%/304   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 21m 25s | Avg: 10m 42s | Max: 19m 04s | Hits:  98%/304   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 21m 25s | Avg: 10m 42s | Max: 19m 04s | Hits:  98%/304   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 21m 25s | Avg: 10m 42s | Max: 19m 04s | Hits:  98%/304   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 21m 25s | Avg: 10m 42s | Max: 19m 04s | Hits:  98%/304   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 21m 25s | Avg: 10m 42s | Max: 19m 04s | Hits:  98%/304   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 21s | Avg:  2m 21s | Max:  2m 21s | Hits:  98%/152   
      🟩 Test               Pass: 100%/1   | Total: 19m 04s | Avg: 19m 04s | Max: 19m 04s | Hits:  98%/152   
    
  • 🟩 python: Pass: 100%/1 | Total: 39m 55s | Avg: 39m 55s | Max: 39m 55s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 39m 55s | Avg: 39m 55s | Max: 39m 55s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 39m 55s | Avg: 39m 55s | Max: 39m 55s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 39m 55s | Avg: 39m 55s | Max: 39m 55s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 39m 55s | Avg: 39m 55s | Max: 39m 55s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 39m 55s | Avg: 39m 55s | Max: 39m 55s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 39m 55s | Avg: 39m 55s | Max: 39m 55s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 39m 55s | Avg: 39m 55s | Max: 39m 55s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 39m 55s | Avg: 39m 55s | Max: 39m 55s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 93)

# Runner
66 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

@github-actions
Copy link
Contributor

🟩 CI finished in 1h 25m: Pass: 100%/93 | Total: 2d 13h | Avg: 39m 26s | Max: 1h 23m | Hits: 76%/133745
  • 🟩 cub: Pass: 100%/45 | Total: 1d 15h | Avg: 52m 51s | Max: 1h 23m | Hits: 70%/53305

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 13h | Avg: 52m 32s | Max:  1h 23m | Hits:  71%/50883 
      🟩 arm64              Pass: 100%/2   | Total:  1h 59m | Avg: 59m 36s | Max: 59m 51s | Hits:  69%/2422  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  4h 53m | Avg: 58m 40s | Max:  1h 05m | Hits:  59%/5888  
      🟩 12.5               Pass: 100%/2   | Total:  2h 30m | Avg:  1h 15m | Max:  1h 23m | Hits:  69%/2240  
      🟩 12.8               Pass: 100%/38  | Total:  1d 08h | Avg: 50m 55s | Max:  1h 19m | Hits:  72%/45177 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 00m | Hits:  75%/2092  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 53m | Avg: 58m 40s | Max:  1h 05m | Hits:  59%/5888  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 30m | Avg:  1h 15m | Max:  1h 23m | Hits:  69%/2240  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 06h | Avg: 50m 24s | Max:  1h 19m | Hits:  72%/43085 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 00m | Hits:  75%/2092  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 13h | Avg: 52m 31s | Max:  1h 23m | Hits:  70%/51213 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 41m | Avg: 55m 23s | Max: 58m 03s | Hits:  69%/4852  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 00m | Hits:  69%/2422  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 50m | Avg: 55m 28s | Max: 55m 52s | Hits:  69%/2422  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 54m | Avg: 57m 25s | Max: 59m 48s | Hits:  69%/2422  
      🟩 Clang18            Pass: 100%/7   | Total:  5h 31m | Avg: 47m 19s | Max:  1h 00m | Hits:  80%/8147  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 02m | Hits:  69%/2426  
      🟩 GCC8               Pass: 100%/1   | Total: 54m 17s | Avg: 54m 17s | Max: 54m 17s | Hits:  69%/1213  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 51m | Avg: 55m 42s | Max: 55m 44s | Hits:  69%/2426  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 02m | Hits:  69%/2426  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 54m | Avg: 57m 07s | Max: 57m 15s | Hits:  69%/2422  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 53m | Avg: 56m 56s | Max: 57m 18s | Hits:  69%/2422  
      🟩 GCC13              Pass: 100%/11  | Total:  6h 35m | Avg: 35m 59s | Max:  1h 07m | Hits:  85%/13321 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 19m | Hits:  14%/2072  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 18m | Hits:  14%/2072  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 30m | Avg:  1h 15m | Max:  1h 23m | Hits:  69%/2240  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 14h 58m | Avg: 52m 52s | Max:  1h 00m | Hits:  73%/20265 
      🟩 GCC                Pass: 100%/22  | Total: 17h 15m | Avg: 47m 04s | Max:  1h 07m | Hits:  77%/26656 
      🟩 MSVC               Pass: 100%/4   | Total:  4h 53m | Avg:  1h 13m | Max:  1h 19m | Hits:  14%/4144  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 30m | Avg:  1h 15m | Max:  1h 23m | Hits:  69%/2240  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 09m | Avg: 23m 14s | Max: 24m 52s | Hits:  89%/3633  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 10h | Avg:  1h 00m | Max:  1h 23m | Hits:  64%/39984 
      🟩 rtxa6000           Pass: 100%/8   | Total:  3h 59m | Avg: 29m 59s | Max: 56m 17s | Hits:  92%/9688  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 12h | Avg: 59m 34s | Max:  1h 23m | Hits:  64%/43617 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 23m 43s | Avg: 23m 43s | Max: 23m 43s | Hits:  99%/1211  
      🟩 GraphCapture       Pass: 100%/1   | Total: 18m 39s | Avg: 18m 39s | Max: 18m 39s | Hits:  99%/1211  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 08m | Avg: 22m 59s | Max: 23m 22s | Hits:  99%/3633  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 02m | Avg: 20m 56s | Max: 21m 57s | Hits:  99%/3633  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 09m | Avg: 23m 14s | Max: 24m 52s | Hits:  89%/3633  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m | Hits:  69%/1211  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 19h 59m | Avg: 59m 59s | Max:  1h 19m | Hits:  62%/23455 
      🟩 20                 Pass: 100%/25  | Total: 19h 39m | Avg: 47m 09s | Max:  1h 23m | Hits:  77%/29850 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 20h 35m | Avg: 27m 27s | Max: 50m 38s | Hits: 80%/80136

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 33m 50s | Avg: 16m 55s | Max: 22m 41s | Hits:  90%/3564  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 19h 46m | Avg: 27m 34s | Max: 50m 38s | Hits:  80%/76573 
      🟩 arm64              Pass: 100%/2   | Total: 49m 42s | Avg: 24m 51s | Max: 26m 18s | Hits:  80%/3563  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  2h 39m | Avg: 31m 54s | Max: 50m 30s | Hits:  75%/8901  
      🟩 12.5               Pass: 100%/2   | Total:  1h 31m | Avg: 45m 40s | Max: 47m 08s | Hits:  76%/3562  
      🟩 12.8               Pass: 100%/38  | Total: 16h 24m | Avg: 25m 55s | Max: 50m 38s | Hits:  81%/67673 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 44m 13s | Avg: 22m 06s | Max: 22m 38s | Hits:  80%/3562  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  2h 39m | Avg: 31m 54s | Max: 50m 30s | Hits:  75%/8901  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 31m | Avg: 45m 40s | Max: 47m 08s | Hits:  76%/3562  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 15h 40m | Avg: 26m 07s | Max: 50m 38s | Hits:  81%/64111 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 44m 13s | Avg: 22m 06s | Max: 22m 38s | Hits:  80%/3562  
      🟩 nvcc               Pass: 100%/43  | Total: 19h 51m | Avg: 27m 42s | Max: 50m 38s | Hits:  80%/76574 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 49m | Avg: 27m 16s | Max: 28m 40s | Hits:  80%/7124  
      🟩 Clang15            Pass: 100%/2   | Total: 57m 11s | Avg: 28m 35s | Max: 28m 39s | Hits:  80%/3562  
      🟩 Clang16            Pass: 100%/2   | Total: 54m 00s | Avg: 27m 00s | Max: 27m 39s | Hits:  80%/3562  
      🟩 Clang17            Pass: 100%/2   | Total: 54m 25s | Avg: 27m 12s | Max: 27m 40s | Hits:  80%/3562  
      🟩 Clang18            Pass: 100%/7   | Total:  2h 19m | Avg: 19m 59s | Max: 28m 32s | Hits:  85%/12467 
      🟩 GCC7               Pass: 100%/2   | Total: 55m 46s | Avg: 27m 53s | Max: 29m 11s | Hits:  80%/3564  
      🟩 GCC8               Pass: 100%/1   | Total: 27m 51s | Avg: 27m 51s | Max: 27m 51s | Hits:  80%/1782  
      🟩 GCC9               Pass: 100%/2   | Total: 56m 06s | Avg: 28m 03s | Max: 28m 35s | Hits:  80%/3564  
      🟩 GCC10              Pass: 100%/2   | Total: 56m 23s | Avg: 28m 11s | Max: 28m 51s | Hits:  80%/3564  
      🟩 GCC11              Pass: 100%/2   | Total: 57m 34s | Avg: 28m 47s | Max: 29m 47s | Hits:  80%/3564  
      🟩 GCC12              Pass: 100%/2   | Total: 57m 47s | Avg: 28m 53s | Max: 30m 14s | Hits:  80%/3564  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 09m | Avg: 18m 58s | Max: 28m 27s | Hits:  88%/17820 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 41m | Avg: 50m 34s | Max: 50m 38s | Hits:  55%/3550  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 07m | Avg: 42m 29s | Max: 50m 15s | Hits:  60%/5325  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 31m | Avg: 45m 40s | Max: 47m 08s | Hits:  76%/3562  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  6h 54m | Avg: 24m 23s | Max: 28m 40s | Hits:  82%/30277 
      🟩 GCC                Pass: 100%/21  | Total:  8h 21m | Avg: 23m 51s | Max: 30m 14s | Hits:  83%/37422 
      🟩 MSVC               Pass: 100%/5   | Total:  3h 48m | Avg: 45m 43s | Max: 50m 38s | Hits:  58%/8875  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 31m | Avg: 45m 40s | Max: 47m 08s | Hits:  76%/3562  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 27m 07s | Avg: 13m 33s | Max: 16m 16s | Hits:  90%/3564  
      🟩 rtx2080            Pass: 100%/33  | Total: 16h 43m | Avg: 30m 24s | Max: 50m 38s | Hits:  77%/58769 
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 25m | Avg: 20m 31s | Max: 50m 15s | Hits:  86%/17803 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total: 19h 07m | Avg: 30m 11s | Max: 50m 38s | Hits:  77%/67671 
      🟩 TestCPU            Pass: 100%/3   | Total: 44m 48s | Avg: 14m 56s | Max: 29m 39s | Hits:  90%/5338  
      🟩 TestGPU            Pass: 100%/4   | Total: 43m 45s | Avg: 10m 56s | Max: 11m 25s | Hits:  99%/7127  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 27m 07s | Avg: 13m 33s | Max: 16m 16s | Hits:  90%/3564  
      🟩 90;90a;100         Pass: 100%/1   | Total: 28m 27s | Avg: 28m 27s | Max: 28m 27s | Hits:  80%/1782  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 10h 35m | Avg: 31m 47s | Max: 50m 38s | Hits:  76%/35611 
      🟩 20                 Pass: 100%/23  | Total:  9h 26m | Avg: 24m 36s | Max: 50m 15s | Hits:  82%/40961 
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 14m 56s | Avg: 7m 28s | Max: 12m 40s | Hits: 98%/304

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 14m 56s | Avg:  7m 28s | Max: 12m 40s | Hits:  98%/304   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 14m 56s | Avg:  7m 28s | Max: 12m 40s | Hits:  98%/304   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 14m 56s | Avg:  7m 28s | Max: 12m 40s | Hits:  98%/304   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 14m 56s | Avg:  7m 28s | Max: 12m 40s | Hits:  98%/304   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 14m 56s | Avg:  7m 28s | Max: 12m 40s | Hits:  98%/304   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 14m 56s | Avg:  7m 28s | Max: 12m 40s | Hits:  98%/304   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 14m 56s | Avg:  7m 28s | Max: 12m 40s | Hits:  98%/304   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 16s | Avg:  2m 16s | Max:  2m 16s | Hits:  98%/152   
      🟩 Test               Pass: 100%/1   | Total: 12m 40s | Avg: 12m 40s | Max: 12m 40s | Hits:  98%/152   
    
  • 🟩 python: Pass: 100%/1 | Total: 38m 45s | Avg: 38m 45s | Max: 38m 45s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 38m 45s | Avg: 38m 45s | Max: 38m 45s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 38m 45s | Avg: 38m 45s | Max: 38m 45s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 38m 45s | Avg: 38m 45s | Max: 38m 45s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 38m 45s | Avg: 38m 45s | Max: 38m 45s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 38m 45s | Avg: 38m 45s | Max: 38m 45s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 38m 45s | Avg: 38m 45s | Max: 38m 45s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 38m 45s | Avg: 38m 45s | Max: 38m 45s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 38m 45s | Avg: 38m 45s | Max: 38m 45s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 93)

# Runner
66 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

@bernhardmgruber bernhardmgruber merged commit 90d3aa1 into NVIDIA:main Feb 22, 2025
104 of 107 checks passed
@bernhardmgruber bernhardmgruber deleted the fix_nvhpc2 branch February 23, 2025 21:59
bernhardmgruber added a commit to bernhardmgruber/cccl that referenced this pull request Mar 5, 2025
davebayer pushed a commit to davebayer/cccl that referenced this pull request Apr 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

[BUG]: A few parallel algorithms have started hanging at runtime after commit 35df3a

2 participants