Skip to content

Drop deprecated warp/block algo specializations#4007

Merged
bernhardmgruber merged 1 commit intoNVIDIA:mainfrom
bernhardmgruber:drop_warp_block_spec
Mar 4, 2025
Merged

Drop deprecated warp/block algo specializations#4007
bernhardmgruber merged 1 commit intoNVIDIA:mainfrom
bernhardmgruber:drop_warp_block_spec

Conversation

@bernhardmgruber
Copy link
Contributor

No description provided.

@bernhardmgruber bernhardmgruber requested a review from a team as a code owner March 4, 2025 12:48
@bernhardmgruber bernhardmgruber requested a review from fbusato March 4, 2025 12:49
@github-project-automation github-project-automation bot moved this to Todo in CCCL Mar 4, 2025
@bernhardmgruber bernhardmgruber added cub For all items related to CUB breaking Breaking change labels Mar 4, 2025
@cccl-authenticator-app cccl-authenticator-app bot moved this from Todo to In Review in CCCL Mar 4, 2025
@github-actions
Copy link
Contributor

github-actions bot commented Mar 4, 2025

🟩 CI finished in 2h 46m: Pass: 100%/93 | Total: 2d 21h | Avg: 44m 46s | Max: 1h 45m | Hits: 65%/133878
  • 🟩 cub: Pass: 100%/45 | Total: 1d 22h | Avg: 1h 02m | Max: 1h 45m | Hits: 46%/53614

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 20h | Avg:  1h 02m | Max:  1h 45m | Hits:  46%/51178 
      🟩 arm64              Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 02m | Hits:  41%/2436  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 11m | Avg:  1h 02m | Max:  1h 07m | Hits:  36%/5922  
      🟩 12.5               Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 12m | Hits:  17%/2254  
      🟩 12.8               Pass: 100%/38  | Total:  1d 15h | Avg:  1h 02m | Max:  1h 45m | Hits:  49%/45438 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 08m | Hits:  43%/2104  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 11m | Avg:  1h 02m | Max:  1h 07m | Hits:  36%/5922  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 12m | Hits:  17%/2254  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 13h | Avg:  1h 01m | Max:  1h 45m | Hits:  49%/43334 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 08m | Hits:  43%/2104  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 20h | Avg:  1h 02m | Max:  1h 45m | Hits:  46%/51510 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 01m | Avg:  1h 00m | Max:  1h 07m | Hits:  41%/4880  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 57m | Avg: 58m 32s | Max: 59m 36s | Hits:  41%/2436  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 01m | Hits:  41%/2436  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 56m | Avg: 58m 21s | Max: 59m 15s | Hits:  41%/2436  
      🟩 Clang18            Pass: 100%/7   | Total:  5h 55m | Avg: 50m 46s | Max:  1h 08m | Hits:  59%/8194  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 05m | Hits:  41%/2440  
      🟩 GCC8               Pass: 100%/1   | Total:  1h 03m | Avg:  1h 03m | Max:  1h 03m | Hits:  41%/1220  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 05m | Hits:  41%/2440  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 59m | Avg: 59m 48s | Max:  1h 00m | Hits:  41%/2440  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 02m | Hits:  41%/2436  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 00m | Hits:  41%/2436  
      🟩 GCC13              Pass: 100%/11  | Total: 12h 19m | Avg:  1h 07m | Max:  1h 45m | Hits:  63%/13398 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 22m | Avg:  1h 11m | Max:  1h 14m | Hits:  12%/2084  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 32m | Avg:  1h 16m | Max:  1h 16m | Hits:  12%/2084  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 12m | Hits:  17%/2254  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 15h 52m | Avg: 56m 02s | Max:  1h 08m | Hits:  48%/20382 
      🟩 GCC                Pass: 100%/22  | Total: 23h 40m | Avg:  1h 04m | Max:  1h 45m | Hits:  52%/26810 
      🟩 MSVC               Pass: 100%/4   | Total:  4h 54m | Avg:  1h 13m | Max:  1h 16m | Hits:  12%/4168  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 12m | Hits:  17%/2254  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 11m | Avg: 23m 59s | Max: 25m 24s | Hits:  80%/3654  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 12h | Avg:  1h 03m | Max:  1h 16m | Hits:  37%/40216 
      🟩 rtxa6000           Pass: 100%/8   | Total:  9h 38m | Avg:  1h 12m | Max:  1h 45m | Hits:  72%/9744  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 14h | Avg:  1h 02m | Max:  1h 16m | Hits:  37%/43870 
      🟩 DeviceLaunch       Pass: 100%/1   | Total:  1h 44m | Avg:  1h 44m | Max:  1h 44m | Hits:  79%/1218  
      🟩 GraphCapture       Pass: 100%/1   | Total:  1h 37m | Avg:  1h 37m | Max:  1h 37m | Hits:  69%/1218  
      🟩 HostLaunch         Pass: 100%/3   | Total:  2h 33m | Avg: 51m 02s | Max:  1h 45m | Hits:  92%/3654  
      🟩 TestGPU            Pass: 100%/3   | Total:  2h 27m | Avg: 49m 05s | Max:  1h 42m | Hits:  89%/3654  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 11m | Avg: 23m 59s | Max: 25m 24s | Hits:  80%/3654  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m | Hits:  41%/1218  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 21h 27m | Avg:  1h 04m | Max:  1h 16m | Hits:  36%/23591 
      🟩 20                 Pass: 100%/25  | Total:  1d 01h | Avg:  1h 01m | Max:  1h 45m | Hits:  54%/30023 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 21h 14m | Avg: 28m 18s | Max: 54m 37s | Hits: 78%/79956

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 36m 22s | Avg: 18m 11s | Max: 25m 56s | Hits:  89%/3556  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 20h 22m | Avg: 28m 25s | Max: 54m 37s | Hits:  78%/76401 
      🟩 arm64              Pass: 100%/2   | Total: 51m 44s | Avg: 25m 52s | Max: 27m 32s | Hits:  79%/3555  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  2h 43m | Avg: 32m 46s | Max: 48m 09s | Hits:  74%/8881  
      🟩 12.5               Pass: 100%/2   | Total:  1h 43m | Avg: 51m 57s | Max: 52m 39s | Hits:  65%/3554  
      🟩 12.8               Pass: 100%/38  | Total: 16h 46m | Avg: 26m 28s | Max: 54m 37s | Hits:  80%/67521 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 48m 47s | Avg: 24m 23s | Max: 26m 02s | Hits:  79%/3554  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  2h 43m | Avg: 32m 46s | Max: 48m 09s | Hits:  74%/8881  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 43m | Avg: 51m 57s | Max: 52m 39s | Hits:  65%/3554  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 15h 57m | Avg: 26m 35s | Max: 54m 37s | Hits:  80%/63967 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 48m 47s | Avg: 24m 23s | Max: 26m 02s | Hits:  79%/3554  
      🟩 nvcc               Pass: 100%/43  | Total: 20h 25m | Avg: 28m 29s | Max: 54m 37s | Hits:  78%/76402 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 52m | Avg: 28m 14s | Max: 29m 14s | Hits:  79%/7108  
      🟩 Clang15            Pass: 100%/2   | Total: 53m 43s | Avg: 26m 51s | Max: 27m 23s | Hits:  79%/3554  
      🟩 Clang16            Pass: 100%/2   | Total: 58m 57s | Avg: 29m 28s | Max: 30m 26s | Hits:  79%/3554  
      🟩 Clang17            Pass: 100%/2   | Total: 54m 46s | Avg: 27m 23s | Max: 28m 47s | Hits:  79%/3554  
      🟩 Clang18            Pass: 100%/7   | Total:  2h 23m | Avg: 20m 26s | Max: 26m 51s | Hits:  85%/12439 
      🟩 GCC7               Pass: 100%/2   | Total: 54m 31s | Avg: 27m 15s | Max: 27m 30s | Hits:  79%/3556  
      🟩 GCC8               Pass: 100%/1   | Total: 30m 25s | Avg: 30m 25s | Max: 30m 25s | Hits:  79%/1778  
      🟩 GCC9               Pass: 100%/2   | Total: 59m 46s | Avg: 29m 53s | Max: 31m 14s | Hits:  79%/3556  
      🟩 GCC10              Pass: 100%/2   | Total: 56m 29s | Avg: 28m 14s | Max: 29m 44s | Hits:  79%/3556  
      🟩 GCC11              Pass: 100%/2   | Total: 56m 33s | Avg: 28m 16s | Max: 29m 33s | Hits:  79%/3556  
      🟩 GCC12              Pass: 100%/2   | Total: 59m 13s | Avg: 29m 36s | Max: 31m 08s | Hits:  79%/3556  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 16m | Avg: 19m 39s | Max: 29m 27s | Hits:  87%/17780 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 40m | Avg: 50m 17s | Max: 52m 25s | Hits:  55%/3542  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 12m | Avg: 44m 14s | Max: 54m 37s | Hits:  60%/5313  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 43m | Avg: 51m 57s | Max: 52m 39s | Hits:  65%/3554  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  7h 03m | Avg: 24m 54s | Max: 30m 26s | Hits:  81%/30209 
      🟩 GCC                Pass: 100%/21  | Total:  8h 33m | Avg: 24m 27s | Max: 31m 14s | Hits:  82%/37338 
      🟩 MSVC               Pass: 100%/5   | Total:  3h 53m | Avg: 46m 39s | Max: 54m 37s | Hits:  58%/8855  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 43m | Avg: 51m 57s | Max: 52m 39s | Hits:  65%/3554  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 27m 29s | Avg: 13m 44s | Max: 16m 00s | Hits:  89%/3556  
      🟩 rtx2080            Pass: 100%/33  | Total: 17h 15m | Avg: 31m 22s | Max: 52m 39s | Hits:  76%/58637 
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 31m | Avg: 21m 06s | Max: 54m 37s | Hits:  86%/17763 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total: 19h 46m | Avg: 31m 13s | Max: 54m 37s | Hits:  75%/67519 
      🟩 TestCPU            Pass: 100%/3   | Total: 44m 49s | Avg: 14m 56s | Max: 29m 44s | Hits:  90%/5326  
      🟩 TestGPU            Pass: 100%/4   | Total: 42m 37s | Avg: 10m 39s | Max: 11m 29s | Hits:  99%/7111  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 27m 29s | Avg: 13m 44s | Max: 16m 00s | Hits:  89%/3556  
      🟩 90;90a;100         Pass: 100%/1   | Total: 29m 27s | Avg: 29m 27s | Max: 29m 27s | Hits:  79%/1778  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 10h 57m | Avg: 32m 52s | Max: 52m 25s | Hits:  74%/35531 
      🟩 20                 Pass: 100%/23  | Total:  9h 40m | Avg: 25m 13s | Max: 54m 37s | Hits:  81%/40869 
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 15m 34s | Avg: 7m 47s | Max: 12m 59s | Hits: 97%/308

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 15m 34s | Avg:  7m 47s | Max: 12m 59s | Hits:  97%/308   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 15m 34s | Avg:  7m 47s | Max: 12m 59s | Hits:  97%/308   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 15m 34s | Avg:  7m 47s | Max: 12m 59s | Hits:  97%/308   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 15m 34s | Avg:  7m 47s | Max: 12m 59s | Hits:  97%/308   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 15m 34s | Avg:  7m 47s | Max: 12m 59s | Hits:  97%/308   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 15m 34s | Avg:  7m 47s | Max: 12m 59s | Hits:  97%/308   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 15m 34s | Avg:  7m 47s | Max: 12m 59s | Hits:  97%/308   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 35s | Avg:  2m 35s | Max:  2m 35s | Hits:  96%/154   
      🟩 Test               Pass: 100%/1   | Total: 12m 59s | Avg: 12m 59s | Max: 12m 59s | Hits:  98%/154   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 01m | Avg: 1h 01m | Max: 1h 01m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 93)

# Runner
66 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

@bernhardmgruber bernhardmgruber merged commit 7513841 into NVIDIA:main Mar 4, 2025
108 of 110 checks passed
@github-project-automation github-project-automation bot moved this from In Review to Done in CCCL Mar 4, 2025
@bernhardmgruber bernhardmgruber deleted the drop_warp_block_spec branch March 4, 2025 17:12
davebayer pushed a commit to davebayer/cccl that referenced this pull request Apr 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

breaking Breaking change cub For all items related to CUB

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

3 participants