Skip to content

Implement more of cmath#3963

Merged
miscco merged 10 commits intoNVIDIA:mainfrom
miscco:trigonometric_math
Mar 4, 2025
Merged

Implement more of cmath#3963
miscco merged 10 commits intoNVIDIA:mainfrom
miscco:trigonometric_math

Conversation

@miscco
Copy link
Contributor

@miscco miscco commented Feb 28, 2025

This properly implements more of the functions in <cuda/std/cmath>

I did not port more functions because those are conflictig with the ongoing work on extended floating point types

@miscco miscco requested a review from a team as a code owner February 28, 2025 14:05
@miscco miscco requested a review from griwes February 28, 2025 14:05
@github-project-automation github-project-automation bot moved this to Todo in CCCL Feb 28, 2025
@cccl-authenticator-app cccl-authenticator-app bot moved this from Todo to In Review in CCCL Feb 28, 2025
@miscco miscco added feature request libcu++ For all items related to libcu++ labels Feb 28, 2025
@miscco miscco force-pushed the trigonometric_math branch from b5b6f6f to 929e6a0 Compare February 28, 2025 15:17
@miscco miscco force-pushed the trigonometric_math branch from 929e6a0 to a2b042e Compare February 28, 2025 16:23
@github-actions
Copy link
Contributor

🟨 CI finished in 1h 43m: Pass: 84%/158 | Total: 1d 11h | Avg: 13m 38s | Max: 1h 19m | Hits: 83%/206360
  • 🟨 thrust: Pass: 53%/45 | Total: 11h 16m | Avg: 15m 01s | Max: 1h 10m | Hits: 59%/42714

    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 19m 35s | Avg:  9m 47s | Max: 10m 10s | Hits:  65%/3562  
      🔍 nvcc               Pass:  51%/43  | Total: 10h 56m | Avg: 15m 16s | Max:  1h 10m | Hits:  58%/39152 
    🚨 cxx_family: GCC 🚨
      🟩 Clang              Pass: 100%/17  | Total:  2h 45m | Avg:  9m 45s | Max: 10m 59s | Hits:  73%/30277 
      🔥 GCC                Pass:   0%/21  | Total:  2h 03m | Avg:  5m 52s | Max: 22m 11s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 11m | Avg: 50m 18s | Max:  1h 01m | Hits:  34%/8875  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 10m | Hits:   0%/3562  
    🟨 ctk
      🟨 12.0               Pass:  60%/5   | Total:  1h 23m | Avg: 16m 44s | Max: 51m 11s | Hits:  50%/5337  
      🟩 12.5               Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 10m | Hits:   0%/3562  
      🟨 12.8               Pass:  50%/38  | Total:  7h 36m | Avg: 12m 01s | Max:  1h 01m | Hits:  66%/33815 
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 19m 35s | Avg:  9m 47s | Max: 10m 10s | Hits:  65%/3562  
      🟨 nvcc12.0           Pass:  60%/5   | Total:  1h 23m | Avg: 16m 44s | Max: 51m 11s | Hits:  50%/5337  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 10m | Hits:   0%/3562  
      🟨 nvcc12.8           Pass:  47%/36  | Total:  7h 17m | Avg: 12m 08s | Max:  1h 01m | Hits:  67%/30253 
    🟨 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 39m 28s | Avg:  9m 52s | Max: 10m 34s | Hits:  76%/7124  
      🟩 Clang15            Pass: 100%/2   | Total: 20m 16s | Avg: 10m 08s | Max: 10m 59s | Hits:  65%/3562  
      🟩 Clang16            Pass: 100%/2   | Total: 20m 09s | Avg: 10m 04s | Max: 10m 21s | Hits:  65%/3562  
      🟩 Clang17            Pass: 100%/2   | Total: 20m 09s | Avg: 10m 04s | Max: 10m 45s | Hits:  65%/3562  
      🟩 Clang18            Pass: 100%/7   | Total:  1h 05m | Avg:  9m 23s | Max: 10m 18s | Hits:  78%/12467 
      🟥 GCC7               Pass:   0%/2   | Total: 11m 35s | Avg:  5m 47s | Max:  5m 53s
      🟥 GCC8               Pass:   0%/1   | Total:  6m 07s | Avg:  6m 07s | Max:  6m 07s
      🟥 GCC9               Pass:   0%/2   | Total: 12m 58s | Avg:  6m 29s | Max:  6m 41s
      🟥 GCC10              Pass:   0%/2   | Total: 12m 38s | Avg:  6m 19s | Max:  6m 26s
      🟥 GCC11              Pass:   0%/2   | Total: 13m 14s | Avg:  6m 37s | Max:  6m 43s
      🟥 GCC12              Pass:   0%/2   | Total: 13m 57s | Avg:  6m 58s | Max:  7m 03s
      🟥 GCC13              Pass:   0%/10  | Total: 52m 45s | Avg:  5m 16s | Max: 22m 11s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 42m | Avg: 51m 05s | Max: 51m 11s | Hits:  32%/3550  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 29m | Avg: 49m 48s | Max:  1h 01m | Hits:  35%/5325  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 10m | Hits:   0%/3562  
    🟥 cmake_options
      🟥 -DTHRUST_DISPATCH_TYPE=Force32bit Pass:   0%/2   | Total: 22m 11s | Avg: 11m 05s | Max: 22m 11s
    🟨 cpu
      🟨 amd64              Pass:  53%/43  | Total: 11h 01m | Avg: 15m 23s | Max:  1h 10m | Hits:  59%/40933 
      🟨 arm64              Pass:  50%/2   | Total: 14m 30s | Avg:  7m 15s | Max:  8m 30s | Hits:  65%/1781  
    🟨 gpu
      🟥 h100               Pass:   0%/2   | Total:  6m 03s | Avg:  3m 01s | Max:  6m 03s
      🟨 rtx2080            Pass:  57%/33  | Total:  8h 37m | Avg: 15m 40s | Max:  1h 10m | Hits:  55%/33821 
      🟨 rtx4090            Pass:  50%/10  | Total:  2h 32m | Avg: 15m 17s | Max:  1h 01m | Hits:  73%/8893  
    🟨 jobs
      🟨 Build              Pass:  55%/38  | Total: 10h 21m | Avg: 16m 21s | Max:  1h 10m | Hits:  54%/37377 
      🟨 TestCPU            Pass:  66%/3   | Total: 44m 03s | Avg: 14m 41s | Max: 36m 33s | Hits:  85%/3556  
      🟨 TestGPU            Pass:  25%/4   | Total: 10m 18s | Avg:  2m 34s | Max: 10m 18s | Hits: 100%/1781  
    🟥 sm
      🟥 90                 Pass:   0%/2   | Total:  6m 03s | Avg:  3m 01s | Max:  6m 03s
      🟥 90;90a;100         Pass:   0%/1   | Total:  6m 32s | Avg:  6m 32s | Max:  6m 32s
    🟨 std
      🟨 17                 Pass:  55%/20  | Total:  5h 52m | Avg: 17m 38s | Max:  1h 10m | Hits:  52%/19573 
      🟨 20                 Pass:  56%/23  | Total:  5h 01m | Avg: 13m 05s | Max:  1h 05m | Hits:  65%/23141 
    
  • 🟨 libcudacxx: Pass: 93%/43 | Total: 7h 32m | Avg: 10m 30s | Max: 42m 13s | Hits: 89%/98461

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  92%/41  | Total:  7h 22m | Avg: 10m 47s | Max: 42m 13s | Hits:  89%/92748 
      🟩 arm64              Pass: 100%/2   | Total:  9m 51s | Avg:  4m 55s | Max:  5m 00s | Hits:  94%/5713  
    🔍 ctk: 12.8 🔍
      🟩 12.0               Pass: 100%/5   | Total: 53m 09s | Avg: 10m 37s | Max: 33m 33s | Hits:  93%/13819 
      🟩 12.5               Pass: 100%/2   | Total:  1h 18m | Avg: 39m 28s | Max: 42m 13s | Hits:   3%/5658  
      🔍 12.8               Pass:  91%/36  | Total:  5h 19m | Avg:  8m 53s | Max: 38m 04s | Hits:  95%/78984 
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/2   | Total: 16m 05s | Avg:  8m 02s | Max: 12m 00s | Hits:  99%/2946  
      🔍 rtx2080            Pass:  92%/41  | Total:  7h 15m | Avg: 10m 37s | Max: 42m 13s | Hits:  89%/95515 
    🔍 sm: 75 🔍
      🔍 75                 Pass:  50%/2   | Total: 35m 37s | Avg: 17m 48s | Max: 19m 36s | Hits:  90%/20    
      🟩 90                 Pass: 100%/2   | Total: 16m 05s | Avg:  8m 02s | Max: 12m 00s | Hits:  99%/2946  
      🟩 90;90a;100         Pass: 100%/1   | Total:  4m 34s | Avg:  4m 34s | Max:  4m 34s | Hits:  98%/2946  
    🟨 cudacxx
      🟥 ClangCUDA18        Pass:   0%/2   | Total:  5m 13s | Avg:  2m 36s | Max:  2m 37s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 53m 09s | Avg: 10m 37s | Max: 33m 33s | Hits:  93%/13819 
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 18m | Avg: 39m 28s | Max: 42m 13s | Hits:   3%/5658  
      🟨 nvcc12.8           Pass:  97%/34  | Total:  5h 14m | Avg:  9m 15s | Max: 38m 04s | Hits:  95%/78984 
    🟨 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 19m 56s | Avg:  4m 59s | Max:  5m 26s | Hits:  95%/11318 
      🟩 Clang15            Pass: 100%/2   | Total: 11m 36s | Avg:  5m 48s | Max:  5m 50s | Hits:  94%/5670  
      🟩 Clang16            Pass: 100%/2   | Total: 11m 40s | Avg:  5m 50s | Max:  5m 52s | Hits:  94%/5670  
      🟩 Clang17            Pass: 100%/2   | Total: 12m 05s | Avg:  6m 02s | Max:  6m 14s | Hits:  94%/5670  
      🟨 Clang18            Pass:  66%/6   | Total: 39m 29s | Avg:  6m 34s | Max: 14m 24s | Hits:  92%/8526  
      🟩 GCC7               Pass: 100%/2   | Total:  8m 06s | Avg:  4m 03s | Max:  4m 09s | Hits:  97%/5608  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 26s | Avg:  5m 26s | Max:  5m 26s | Hits:  93%/2814  
      🟩 GCC9               Pass: 100%/2   | Total: 11m 27s | Avg:  5m 43s | Max:  6m 21s | Hits:  91%/5620  
      🟩 GCC10              Pass: 100%/2   | Total: 10m 39s | Avg:  5m 19s | Max:  5m 31s | Hits:  93%/5676  
      🟩 GCC11              Pass: 100%/2   | Total:  8m 55s | Avg:  4m 27s | Max:  4m 53s | Hits:  96%/5672  
      🟩 GCC12              Pass: 100%/2   | Total: 11m 34s | Avg:  5m 47s | Max:  6m 30s | Hits:  93%/5672  
      🟨 GCC13              Pass:  90%/10  | Total:  1h 24m | Avg:  8m 27s | Max: 19m 36s | Hits:  97%/14441 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 06m | Avg: 33m 19s | Max: 33m 33s | Hits:  94%/5142  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  1h 11m | Avg: 35m 30s | Max: 38m 04s | Hits:  93%/5304  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 18m | Avg: 39m 28s | Max: 42m 13s | Hits:   3%/5658  
    🟨 cxx_family
      🟨 Clang              Pass:  87%/16  | Total:  1h 34m | Avg:  5m 55s | Max: 14m 24s | Hits:  94%/36854 
      🟨 GCC                Pass:  95%/21  | Total:  2h 20m | Avg:  6m 41s | Max: 19m 36s | Hits:  95%/45503 
      🟩 MSVC               Pass: 100%/4   | Total:  2h 17m | Avg: 34m 24s | Max: 38m 04s | Hits:  94%/10446 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 18m | Avg: 39m 28s | Max: 42m 13s | Hits:   3%/5658  
    🟨 jobs
      🟨 Build              Pass:  94%/37  | Total:  6h 16m | Avg: 10m 10s | Max: 42m 13s | Hits:  89%/98441 
      🟨 NVRTC              Pass:  50%/2   | Total: 35m 37s | Avg: 17m 48s | Max: 19m 36s | Hits:  90%/20    
      🟩 Test               Pass: 100%/3   | Total: 37m 29s | Avg: 12m 29s | Max: 14m 24s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 12s | Avg:  2m 12s | Max:  2m 12s
    🟨 cudacxx_family
      🟥 ClangCUDA          Pass:   0%/2   | Total:  5m 13s | Avg:  2m 36s | Max:  2m 37s
      🟨 nvcc               Pass:  97%/41  | Total:  7h 26m | Avg: 10m 53s | Max: 42m 13s | Hits:  89%/98461 
    🟨 std
      🟨 17                 Pass:  95%/21  | Total:  3h 57m | Avg: 11m 19s | Max: 36m 44s | Hits:  89%/52724 
      🟨 20                 Pass:  90%/21  | Total:  3h 31m | Avg: 10m 05s | Max: 42m 13s | Hits:  89%/45737 
    
  • 🟩 cub: Pass: 100%/45 | Total: 13h 56m | Avg: 18m 34s | Max: 1h 19m | Hits: 89%/53485

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 13h 44m | Avg: 19m 10s | Max:  1h 19m | Hits:  89%/51055 
      🟩 arm64              Pass: 100%/2   | Total: 11m 52s | Avg:  5m 56s | Max:  6m 04s | Hits:  99%/2430  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 27m | Avg: 17m 35s | Max:  1h 04m | Hits:  85%/5908  
      🟩 12.5               Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 17m | Hits:  11%/2248  
      🟩 12.8               Pass: 100%/38  | Total: 10h 02m | Avg: 15m 50s | Max:  1h 19m | Hits:  94%/45329 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  5m 08s | Hits: 100%/2100  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 27m | Avg: 17m 35s | Max:  1h 04m | Hits:  85%/5908  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 17m | Hits:  11%/2248  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  9h 52m | Avg: 16m 27s | Max:  1h 19m | Hits:  93%/43229 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  5m 08s | Hits: 100%/2100  
      🟩 nvcc               Pass: 100%/43  | Total: 13h 46m | Avg: 19m 13s | Max:  1h 19m | Hits:  89%/51385 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 23m 58s | Avg:  5m 59s | Max:  6m 38s | Hits: 100%/4868  
      🟩 Clang15            Pass: 100%/2   | Total: 12m 16s | Avg:  6m 08s | Max:  6m 22s | Hits: 100%/2430  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 30s | Avg:  6m 15s | Max:  6m 22s | Hits: 100%/2430  
      🟩 Clang17            Pass: 100%/2   | Total: 12m 22s | Avg:  6m 11s | Max:  6m 16s | Hits: 100%/2430  
      🟩 Clang18            Pass: 100%/7   | Total:  1h 09m | Avg:  9m 58s | Max: 21m 58s | Hits:  99%/8175  
      🟩 GCC7               Pass: 100%/2   | Total: 11m 45s | Avg:  5m 52s | Max:  5m 57s | Hits:  99%/2434  
      🟩 GCC8               Pass: 100%/1   | Total:  6m 05s | Avg:  6m 05s | Max:  6m 05s | Hits:  99%/1217  
      🟩 GCC9               Pass: 100%/2   | Total: 12m 12s | Avg:  6m 06s | Max:  6m 21s | Hits:  99%/2434  
      🟩 GCC10              Pass: 100%/2   | Total: 12m 39s | Avg:  6m 19s | Max:  6m 25s | Hits:  99%/2434  
      🟩 GCC11              Pass: 100%/2   | Total: 12m 38s | Avg:  6m 19s | Max:  6m 24s | Hits:  99%/2430  
      🟩 GCC12              Pass: 100%/2   | Total: 13m 48s | Avg:  6m 54s | Max:  6m 54s | Hits:  99%/2430  
      🟩 GCC13              Pass: 100%/11  | Total:  3h 15m | Avg: 17m 48s | Max: 35m 44s | Hits:  99%/13365 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 15m | Hits:  15%/2080  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 33m | Avg:  1h 16m | Max:  1h 19m | Hits:  15%/2080  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 17m | Hits:  11%/2248  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 10m | Avg:  7m 41s | Max: 21m 58s | Hits:  99%/20333 
      🟩 GCC                Pass: 100%/22  | Total:  4h 25m | Avg: 12m 02s | Max: 35m 44s | Hits:  99%/26744 
      🟩 MSVC               Pass: 100%/4   | Total:  4h 54m | Avg:  1h 13m | Max:  1h 19m | Hits:  15%/4160  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 17m | Hits:  11%/2248  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 50m 22s | Avg: 16m 47s | Max: 23m 24s | Hits:  99%/3645  
      🟩 rtx2080            Pass: 100%/34  | Total: 10h 12m | Avg: 18m 00s | Max:  1h 19m | Hits:  86%/40120 
      🟩 rtxa6000           Pass: 100%/8   | Total:  2h 53m | Avg: 21m 38s | Max: 35m 44s | Hits:  99%/9720  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 10h 59m | Avg: 17m 49s | Max:  1h 19m | Hits:  87%/43765 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 23m 31s | Avg: 23m 31s | Max: 23m 31s | Hits:  99%/1215  
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 06s | Avg: 19m 06s | Max: 19m 06s | Hits:  99%/1215  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 08m | Avg: 22m 42s | Max: 23m 24s | Hits:  99%/3645  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 05m | Avg: 21m 58s | Max: 24m 33s | Hits:  99%/3645  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 50m 22s | Avg: 16m 47s | Max: 23m 24s | Hits:  99%/3645  
      🟩 90;90a;100         Pass: 100%/1   | Total:  7m 14s | Avg:  7m 14s | Max:  7m 14s | Hits:  99%/1215  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  6h 22m | Avg: 19m 06s | Max:  1h 15m | Hits:  84%/23535 
      🟩 20                 Pass: 100%/25  | Total:  7h 33m | Avg: 18m 09s | Max:  1h 19m | Hits:  93%/29950 
    
  • 🟩 cudax: Pass: 100%/22 | Total: 2h 04m | Avg: 5m 39s | Max: 12m 24s | Hits: 93%/11392

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  1h 52m | Avg:  6m 15s | Max: 12m 24s | Hits:  91%/9136  
      🟩 arm64              Pass: 100%/4   | Total: 11m 47s | Avg:  2m 56s | Max:  3m 02s | Hits:  99%/2256  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 11m 28s | Avg: 11m 28s | Max: 11m 28s | Hits:  49%/262   
      🟩 12.5               Pass: 100%/2   | Total: 18m 35s | Avg:  9m 17s | Max:  9m 53s | Hits:  33%/712   
      🟩 12.8               Pass: 100%/19  | Total:  1h 34m | Avg:  4m 57s | Max: 12m 24s | Hits:  98%/10418 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 11m 28s | Avg: 11m 28s | Max: 11m 28s | Hits:  49%/262   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 18m 35s | Avg:  9m 17s | Max:  9m 53s | Hits:  33%/712   
      🟩 nvcc12.8           Pass: 100%/19  | Total:  1h 34m | Avg:  4m 57s | Max: 12m 24s | Hits:  98%/10418 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/22  | Total:  2h 04m | Avg:  5m 39s | Max: 12m 24s | Hits:  93%/11392 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 19s | Avg:  3m 19s | Max:  3m 19s | Hits:  99%/566   
      🟩 Clang15            Pass: 100%/1   | Total:  3m 43s | Avg:  3m 43s | Max:  3m 43s | Hits:  99%/564   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 28s | Avg:  3m 28s | Max:  3m 28s | Hits:  99%/564   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 24s | Avg:  3m 24s | Max:  3m 24s | Hits:  99%/564   
      🟩 Clang18            Pass: 100%/4   | Total: 20m 40s | Avg:  5m 10s | Max: 11m 28s | Hits:  99%/2256  
      🟩 GCC10              Pass: 100%/1   | Total:  3m 16s | Avg:  3m 16s | Max:  3m 16s | Hits:  99%/566   
      🟩 GCC11              Pass: 100%/1   | Total:  3m 26s | Avg:  3m 26s | Max:  3m 26s | Hits:  99%/564   
      🟩 GCC12              Pass: 100%/2   | Total: 15m 47s | Avg:  7m 53s | Max: 12m 24s | Hits:  99%/1128  
      🟩 GCC13              Pass: 100%/6   | Total: 26m 45s | Avg:  4m 27s | Max: 11m 25s | Hits:  99%/3384  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 28s | Avg: 11m 28s | Max: 11m 28s | Hits:  49%/262   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 10m 31s | Avg: 10m 31s | Max: 10m 31s | Hits:  49%/262   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 18m 35s | Avg:  9m 17s | Max:  9m 53s | Hits:  33%/712   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 34m 34s | Avg:  4m 19s | Max: 11m 28s | Hits:  99%/4514  
      🟩 GCC                Pass: 100%/10  | Total: 49m 14s | Avg:  4m 55s | Max: 12m 24s | Hits:  99%/5642  
      🟩 MSVC               Pass: 100%/2   | Total: 21m 59s | Avg: 10m 59s | Max: 11m 28s | Hits:  49%/524   
      🟩 NVHPC              Pass: 100%/2   | Total: 18m 35s | Avg:  9m 17s | Max:  9m 53s | Hits:  33%/712   
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 14m 23s | Avg:  7m 11s | Max: 11m 25s | Hits:  99%/1128  
      🟩 rtx2080            Pass: 100%/20  | Total:  1h 49m | Avg:  5m 29s | Max: 12m 24s | Hits:  92%/10264 
    🟩 jobs
      🟩 Build              Pass: 100%/19  | Total:  1h 29m | Avg:  4m 41s | Max: 11m 28s | Hits:  92%/9700  
      🟩 Test               Pass: 100%/3   | Total: 35m 17s | Avg: 11m 45s | Max: 12m 24s | Hits:  99%/1692  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 17m 37s | Avg:  5m 52s | Max: 11m 25s | Hits:  99%/1692  
      🟩 90a                Pass: 100%/1   | Total:  3m 08s | Avg:  3m 08s | Max:  3m 08s | Hits:  99%/564   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 17m 55s | Avg:  4m 28s | Max:  8m 42s | Hits:  88%/2048  
      🟩 20                 Pass: 100%/18  | Total:  1h 46m | Avg:  5m 54s | Max: 12m 24s | Hits:  94%/9344  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 15m 37s | Avg: 7m 48s | Max: 13m 25s | Hits: 98%/308

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 15m 37s | Avg:  7m 48s | Max: 13m 25s | Hits:  98%/308   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 15m 37s | Avg:  7m 48s | Max: 13m 25s | Hits:  98%/308   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 15m 37s | Avg:  7m 48s | Max: 13m 25s | Hits:  98%/308   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 15m 37s | Avg:  7m 48s | Max: 13m 25s | Hits:  98%/308   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 15m 37s | Avg:  7m 48s | Max: 13m 25s | Hits:  98%/308   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 15m 37s | Avg:  7m 48s | Max: 13m 25s | Hits:  98%/308   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 15m 37s | Avg:  7m 48s | Max: 13m 25s | Hits:  98%/308   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 12s | Avg:  2m 12s | Max:  2m 12s | Hits:  98%/154   
      🟩 Test               Pass: 100%/1   | Total: 13m 25s | Avg: 13m 25s | Max: 13m 25s | Hits:  98%/154   
    
  • 🟩 python: Pass: 100%/1 | Total: 50m 23s | Avg: 50m 23s | Max: 50m 23s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 50m 23s | Avg: 50m 23s | Max: 50m 23s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 50m 23s | Avg: 50m 23s | Max: 50m 23s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 50m 23s | Avg: 50m 23s | Max: 50m 23s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 50m 23s | Avg: 50m 23s | Max: 50m 23s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 50m 23s | Avg: 50m 23s | Max: 50m 23s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 50m 23s | Avg: 50m 23s | Max: 50m 23s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 50m 23s | Avg: 50m 23s | Max: 50m 23s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 50m 23s | Avg: 50m 23s | Max: 50m 23s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 158)

# Runner
111 linux-amd64-cpu16
15 windows-amd64-cpu16
10 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@miscco miscco force-pushed the trigonometric_math branch from 80efbcf to 315fc11 Compare February 28, 2025 20:17
@github-actions
Copy link
Contributor

🟩 CI finished in 1h 30m: Pass: 100%/158 | Total: 1d 13h | Avg: 14m 24s | Max: 1h 19m | Hits: 85%/249476
  • 🟩 cub: Pass: 100%/45 | Total: 15h 19m | Avg: 20m 26s | Max: 1h 19m | Hits: 86%/53485

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 15h 08m | Avg: 21m 08s | Max:  1h 19m | Hits:  85%/51055 
      🟩 arm64              Pass: 100%/2   | Total: 11m 05s | Avg:  5m 32s | Max:  5m 45s | Hits:  99%/2430  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 26m | Avg: 17m 23s | Max:  1h 04m | Hits:  85%/5908  
      🟩 12.5               Pass: 100%/2   | Total:  2h 27m | Avg:  1h 13m | Max:  1h 19m | Hits:  10%/2248  
      🟩 12.8               Pass: 100%/38  | Total: 11h 25m | Avg: 18m 01s | Max:  1h 17m | Hits:  90%/45329 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 07m | Hits:  15%/2100  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 26m | Avg: 17m 23s | Max:  1h 04m | Hits:  85%/5908  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 27m | Avg:  1h 13m | Max:  1h 19m | Hits:  10%/2248  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  9h 15m | Avg: 15m 25s | Max:  1h 17m | Hits:  93%/43229 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 07m | Hits:  15%/2100  
      🟩 nvcc               Pass: 100%/43  | Total: 13h 10m | Avg: 18m 22s | Max:  1h 19m | Hits:  89%/51385 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 22m 58s | Avg:  5m 44s | Max:  6m 04s | Hits: 100%/4868  
      🟩 Clang15            Pass: 100%/2   | Total: 12m 24s | Avg:  6m 12s | Max:  6m 13s | Hits: 100%/2430  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 27s | Avg:  6m 13s | Max:  6m 16s | Hits: 100%/2430  
      🟩 Clang17            Pass: 100%/2   | Total: 12m 04s | Avg:  6m 02s | Max:  6m 08s | Hits: 100%/2430  
      🟩 Clang18            Pass: 100%/7   | Total:  3h 10m | Avg: 27m 15s | Max:  1h 07m | Hits:  78%/8175  
      🟩 GCC7               Pass: 100%/2   | Total: 11m 29s | Avg:  5m 44s | Max:  5m 52s | Hits:  99%/2434  
      🟩 GCC8               Pass: 100%/1   | Total:  6m 06s | Avg:  6m 06s | Max:  6m 06s | Hits:  99%/1217  
      🟩 GCC9               Pass: 100%/2   | Total: 12m 20s | Avg:  6m 10s | Max:  6m 34s | Hits:  99%/2434  
      🟩 GCC10              Pass: 100%/2   | Total: 13m 13s | Avg:  6m 36s | Max:  6m 42s | Hits:  99%/2434  
      🟩 GCC11              Pass: 100%/2   | Total: 12m 22s | Avg:  6m 11s | Max:  6m 11s | Hits:  99%/2430  
      🟩 GCC12              Pass: 100%/2   | Total: 13m 14s | Avg:  6m 37s | Max:  6m 41s | Hits:  99%/2430  
      🟩 GCC13              Pass: 100%/11  | Total:  2h 38m | Avg: 14m 26s | Max: 23m 23s | Hits:  99%/13365 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 15m | Hits:  16%/2080  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 33m | Avg:  1h 16m | Max:  1h 17m | Hits:  15%/2080  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 27m | Avg:  1h 13m | Max:  1h 19m | Hits:  10%/2248  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  4h 10m | Avg: 14m 44s | Max:  1h 07m | Hits:  91%/20333 
      🟩 GCC                Pass: 100%/22  | Total:  3h 47m | Avg: 10m 20s | Max: 23m 23s | Hits:  99%/26744 
      🟩 MSVC               Pass: 100%/4   | Total:  4h 53m | Avg:  1h 13m | Max:  1h 17m | Hits:  15%/4160  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 27m | Avg:  1h 13m | Max:  1h 19m | Hits:  10%/2248  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 49m 51s | Avg: 16m 37s | Max: 23m 22s | Hits:  99%/3645  
      🟩 rtx2080            Pass: 100%/34  | Total: 12h 10m | Avg: 21m 29s | Max:  1h 19m | Hits:  81%/40120 
      🟩 rtxa6000           Pass: 100%/8   | Total:  2h 19m | Avg: 17m 24s | Max: 23m 23s | Hits:  99%/9720  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 12h 28m | Avg: 20m 13s | Max:  1h 19m | Hits:  83%/43765 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 20m 38s | Avg: 20m 38s | Max: 20m 38s | Hits:  99%/1215  
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 02s | Avg: 17m 02s | Max: 17m 02s | Hits:  99%/1215  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 10m | Avg: 23m 21s | Max: 23m 23s | Hits:  99%/3645  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 03m | Avg: 21m 14s | Max: 21m 50s | Hits:  99%/3645  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 49m 51s | Avg: 16m 37s | Max: 23m 22s | Hits:  99%/3645  
      🟩 90;90a;100         Pass: 100%/1   | Total:  6m 48s | Avg:  6m 48s | Max:  6m 48s | Hits:  99%/1215  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  7h 20m | Avg: 22m 01s | Max:  1h 17m | Hits:  80%/23535 
      🟩 20                 Pass: 100%/25  | Total:  7h 59m | Avg: 19m 10s | Max:  1h 19m | Hits:  90%/29950 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 11h 27m | Avg: 15m 16s | Max: 1h 06m | Hits: 82%/80136

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 22m 31s | Avg: 11m 15s | Max: 12m 29s | Hits:  85%/3564  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 11h 15m | Avg: 15m 42s | Max:  1h 06m | Hits:  81%/76573 
      🟩 arm64              Pass: 100%/2   | Total: 12m 03s | Avg:  6m 01s | Max:  7m 35s | Hits:  92%/3563  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 17m | Avg: 15m 30s | Max: 53m 07s | Hits:  77%/8901  
      🟩 12.5               Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 06m | Hits:   0%/3562  
      🟩 12.8               Pass: 100%/38  | Total:  7h 57m | Avg: 12m 33s | Max: 53m 40s | Hits:  87%/67673 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 46m 43s | Avg: 23m 21s | Max: 23m 23s | Hits:  75%/3562  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 17m | Avg: 15m 30s | Max: 53m 07s | Hits:  77%/8901  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 06m | Hits:   0%/3562  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  7h 10m | Avg: 11m 57s | Max: 53m 40s | Hits:  87%/64111 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 46m 43s | Avg: 23m 21s | Max: 23m 23s | Hits:  75%/3562  
      🟩 nvcc               Pass: 100%/43  | Total: 10h 40m | Avg: 14m 53s | Max:  1h 06m | Hits:  82%/76574 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 19m 54s | Avg:  4m 58s | Max:  5m 12s | Hits: 100%/7124  
      🟩 Clang15            Pass: 100%/2   | Total: 11m 24s | Avg:  5m 42s | Max:  5m 55s | Hits: 100%/3562  
      🟩 Clang16            Pass: 100%/2   | Total: 10m 43s | Avg:  5m 21s | Max:  5m 22s | Hits: 100%/3562  
      🟩 Clang17            Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 37s | Hits: 100%/3562  
      🟩 Clang18            Pass: 100%/7   | Total:  1h 20m | Avg: 11m 29s | Max: 23m 23s | Hits:  92%/12467 
      🟩 GCC7               Pass: 100%/2   | Total: 13m 42s | Avg:  6m 51s | Max:  6m 52s | Hits:  89%/3564  
      🟩 GCC8               Pass: 100%/1   | Total:  7m 18s | Avg:  7m 18s | Max:  7m 18s | Hits:  84%/1782  
      🟩 GCC9               Pass: 100%/2   | Total: 15m 57s | Avg:  7m 58s | Max:  8m 02s | Hits:  86%/3564  
      🟩 GCC10              Pass: 100%/2   | Total: 15m 12s | Avg:  7m 36s | Max:  7m 38s | Hits:  84%/3564  
      🟩 GCC11              Pass: 100%/2   | Total: 16m 09s | Avg:  8m 04s | Max:  8m 13s | Hits:  84%/3564  
      🟩 GCC12              Pass: 100%/2   | Total: 17m 26s | Avg:  8m 43s | Max:  8m 59s | Hits:  84%/3564  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 29m | Avg:  8m 58s | Max: 12m 29s | Hits:  92%/17820 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 46m | Avg: 53m 14s | Max: 53m 22s | Hits:  39%/3550  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 19m | Avg: 46m 24s | Max: 53m 40s | Hits:  35%/5325  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 06m | Hits:   0%/3562  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 13m | Avg:  7m 51s | Max: 23m 23s | Hits:  97%/30277 
      🟩 GCC                Pass: 100%/21  | Total:  2h 55m | Avg:  8m 21s | Max: 12m 29s | Hits:  88%/37422 
      🟩 MSVC               Pass: 100%/5   | Total:  4h 05m | Avg: 49m 08s | Max: 53m 40s | Hits:  37%/8875  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 06m | Hits:   0%/3562  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 16m 49s | Avg:  8m 24s | Max: 11m 33s | Hits:  96%/3564  
      🟩 rtx2080            Pass: 100%/33  | Total:  8h 33m | Avg: 15m 33s | Max:  1h 06m | Hits:  80%/58769 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 37m | Avg: 15m 43s | Max: 53m 29s | Hits:  85%/17803 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  9h 53m | Avg: 15m 37s | Max:  1h 06m | Hits:  79%/67671 
      🟩 TestCPU            Pass: 100%/3   | Total: 47m 50s | Avg: 15m 56s | Max: 32m 04s | Hits:  90%/5338  
      🟩 TestGPU            Pass: 100%/4   | Total: 45m 50s | Avg: 11m 27s | Max: 12m 29s | Hits:  99%/7127  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 16m 49s | Avg:  8m 24s | Max: 11m 33s | Hits:  96%/3564  
      🟩 90;90a;100         Pass: 100%/1   | Total:  8m 58s | Avg:  8m 58s | Max:  8m 58s | Hits:  87%/1782  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  5h 51m | Avg: 17m 34s | Max:  1h 06m | Hits:  77%/35611 
      🟩 20                 Pass: 100%/23  | Total:  5h 13m | Avg: 13m 37s | Max:  1h 06m | Hits:  86%/40961 
    
  • 🟩 libcudacxx: Pass: 100%/43 | Total: 7h 54m | Avg: 11m 01s | Max: 39m 37s | Hits: 86%/104155

    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total:  7h 44m | Avg: 11m 19s | Max: 39m 37s | Hits:  85%/98442 
      🟩 arm64              Pass: 100%/2   | Total: 10m 05s | Avg:  5m 02s | Max:  5m 03s | Hits:  94%/5713  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 53m 32s | Avg: 10m 42s | Max: 31m 37s | Hits:  92%/13819 
      🟩 12.5               Pass: 100%/2   | Total:  1h 12m | Avg: 36m 16s | Max: 37m 30s | Hits:   3%/5658  
      🟩 12.8               Pass: 100%/36  | Total:  5h 48m | Avg:  9m 40s | Max: 39m 37s | Hits:  90%/84678 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 41m 42s | Avg: 20m 51s | Max: 22m 13s | Hits:  26%/5674  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 53m 32s | Avg: 10m 42s | Max: 31m 37s | Hits:  92%/13819 
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 12m | Avg: 36m 16s | Max: 37m 30s | Hits:   3%/5658  
      🟩 nvcc12.8           Pass: 100%/34  | Total:  5h 06m | Avg:  9m 00s | Max: 39m 37s | Hits:  95%/79004 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 41m 42s | Avg: 20m 51s | Max: 22m 13s | Hits:  26%/5674  
      🟩 nvcc               Pass: 100%/41  | Total:  7h 12m | Avg: 10m 32s | Max: 39m 37s | Hits:  89%/98481 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 22m 46s | Avg:  5m 41s | Max:  7m 22s | Hits:  93%/11318 
      🟩 Clang15            Pass: 100%/2   | Total: 11m 19s | Avg:  5m 39s | Max:  5m 52s | Hits:  95%/5670  
      🟩 Clang16            Pass: 100%/2   | Total: 10m 59s | Avg:  5m 29s | Max:  5m 51s | Hits:  96%/5670  
      🟩 Clang17            Pass: 100%/2   | Total: 11m 19s | Avg:  5m 39s | Max:  5m 42s | Hits:  94%/5670  
      🟩 Clang18            Pass: 100%/6   | Total:  1h 07m | Avg: 11m 13s | Max: 22m 13s | Hits:  68%/14200 
      🟩 GCC7               Pass: 100%/2   | Total: 11m 14s | Avg:  5m 37s | Max:  6m 16s | Hits:  91%/5608  
      🟩 GCC8               Pass: 100%/1   | Total:  4m 56s | Avg:  4m 56s | Max:  4m 56s | Hits:  95%/2814  
      🟩 GCC9               Pass: 100%/2   | Total:  9m 39s | Avg:  4m 49s | Max:  5m 22s | Hits:  95%/5620  
      🟩 GCC10              Pass: 100%/2   | Total:  9m 41s | Avg:  4m 50s | Max:  4m 59s | Hits:  96%/5676  
      🟩 GCC11              Pass: 100%/2   | Total: 10m 11s | Avg:  5m 05s | Max:  5m 08s | Hits:  94%/5672  
      🟩 GCC12              Pass: 100%/2   | Total:  9m 53s | Avg:  4m 56s | Max:  5m 20s | Hits:  96%/5672  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 20m | Avg:  8m 04s | Max: 16m 33s | Hits:  95%/14461 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 08m | Avg: 34m 19s | Max: 37m 01s | Hits:  94%/5142  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  1h 13m | Avg: 36m 32s | Max: 39m 37s | Hits:  90%/5304  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 12m | Avg: 36m 16s | Max: 37m 30s | Hits:   3%/5658  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/16  | Total:  2h 03m | Avg:  7m 43s | Max: 22m 13s | Hits:  85%/42528 
      🟩 GCC                Pass: 100%/21  | Total:  2h 16m | Avg:  6m 29s | Max: 16m 33s | Hits:  95%/45523 
      🟩 MSVC               Pass: 100%/4   | Total:  2h 21m | Avg: 35m 25s | Max: 39m 37s | Hits:  92%/10446 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 12m | Avg: 36m 16s | Max: 37m 30s | Hits:   3%/5658  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 16m 22s | Avg:  8m 11s | Max: 12m 04s | Hits:  97%/2946  
      🟩 rtx2080            Pass: 100%/41  | Total:  7h 37m | Avg: 11m 10s | Max: 39m 37s | Hits:  85%/101209
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  6h 48m | Avg: 11m 02s | Max: 39m 37s | Hits:  86%/104115
      🟩 NVRTC              Pass: 100%/2   | Total: 31m 25s | Avg: 15m 42s | Max: 16m 33s | Hits:  90%/40    
      🟩 Test               Pass: 100%/3   | Total: 31m 57s | Avg: 10m 39s | Max: 12m 04s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 07s | Avg:  2m 07s | Max:  2m 07s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 31m 25s | Avg: 15m 42s | Max: 16m 33s | Hits:  90%/40    
      🟩 90                 Pass: 100%/2   | Total: 16m 22s | Avg:  8m 11s | Max: 12m 04s | Hits:  97%/2946  
      🟩 90;90a;100         Pass: 100%/1   | Total:  5m 24s | Avg:  5m 24s | Max:  5m 24s | Hits:  95%/2946  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  4h 13m | Avg: 12m 04s | Max: 37m 01s | Hits:  86%/55540 
      🟩 20                 Pass: 100%/21  | Total:  3h 38m | Avg: 10m 24s | Max: 39m 37s | Hits:  86%/48615 
    
  • 🟩 cudax: Pass: 100%/22 | Total: 2h 08m | Avg: 5m 51s | Max: 16m 44s | Hits: 93%/11392

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  1h 57m | Avg:  6m 31s | Max: 16m 44s | Hits:  91%/9136  
      🟩 arm64              Pass: 100%/4   | Total: 11m 31s | Avg:  2m 52s | Max:  2m 56s | Hits:  99%/2256  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 11m 14s | Avg: 11m 14s | Max: 11m 14s | Hits:  49%/262   
      🟩 12.5               Pass: 100%/2   | Total: 18m 39s | Avg:  9m 19s | Max:  9m 23s | Hits:  31%/712   
      🟩 12.8               Pass: 100%/19  | Total:  1h 39m | Avg:  5m 12s | Max: 16m 44s | Hits:  98%/10418 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 11m 14s | Avg: 11m 14s | Max: 11m 14s | Hits:  49%/262   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 18m 39s | Avg:  9m 19s | Max:  9m 23s | Hits:  31%/712   
      🟩 nvcc12.8           Pass: 100%/19  | Total:  1h 39m | Avg:  5m 12s | Max: 16m 44s | Hits:  98%/10418 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/22  | Total:  2h 08m | Avg:  5m 51s | Max: 16m 44s | Hits:  93%/11392 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 18s | Avg:  3m 18s | Max:  3m 18s | Hits: 100%/566   
      🟩 Clang15            Pass: 100%/1   | Total:  3m 14s | Avg:  3m 14s | Max:  3m 14s | Hits: 100%/564   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 14s | Avg:  3m 14s | Max:  3m 14s | Hits: 100%/564   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 32s | Avg:  3m 32s | Max:  3m 32s | Hits: 100%/564   
      🟩 Clang18            Pass: 100%/4   | Total: 25m 58s | Avg:  6m 29s | Max: 16m 44s | Hits: 100%/2256  
      🟩 GCC10              Pass: 100%/1   | Total:  3m 15s | Avg:  3m 15s | Max:  3m 15s | Hits:  99%/566   
      🟩 GCC11              Pass: 100%/1   | Total:  3m 15s | Avg:  3m 15s | Max:  3m 15s | Hits:  99%/564   
      🟩 GCC12              Pass: 100%/2   | Total: 15m 44s | Avg:  7m 52s | Max: 12m 19s | Hits:  99%/1128  
      🟩 GCC13              Pass: 100%/6   | Total: 26m 16s | Avg:  4m 22s | Max: 11m 26s | Hits:  99%/3384  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 14s | Avg: 11m 14s | Max: 11m 14s | Hits:  49%/262   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 11m 17s | Avg: 11m 17s | Max: 11m 17s | Hits:  49%/262   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 18m 39s | Avg:  9m 19s | Max:  9m 23s | Hits:  31%/712   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 39m 16s | Avg:  4m 54s | Max: 16m 44s | Hits: 100%/4514  
      🟩 GCC                Pass: 100%/10  | Total: 48m 30s | Avg:  4m 51s | Max: 12m 19s | Hits:  99%/5642  
      🟩 MSVC               Pass: 100%/2   | Total: 22m 31s | Avg: 11m 15s | Max: 11m 17s | Hits:  49%/524   
      🟩 NVHPC              Pass: 100%/2   | Total: 18m 39s | Avg:  9m 19s | Max:  9m 23s | Hits:  31%/712   
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 14m 27s | Avg:  7m 13s | Max: 11m 26s | Hits:  99%/1128  
      🟩 rtx2080            Pass: 100%/20  | Total:  1h 54m | Avg:  5m 43s | Max: 16m 44s | Hits:  92%/10264 
    🟩 jobs
      🟩 Build              Pass: 100%/19  | Total:  1h 28m | Avg:  4m 39s | Max: 11m 17s | Hits:  92%/9700  
      🟩 Test               Pass: 100%/3   | Total: 40m 29s | Avg: 13m 29s | Max: 16m 44s | Hits:  99%/1692  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 17m 26s | Avg:  5m 48s | Max: 11m 26s | Hits:  99%/1692  
      🟩 90a                Pass: 100%/1   | Total:  3m 02s | Avg:  3m 02s | Max:  3m 02s | Hits:  99%/564   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 18m 06s | Avg:  4m 31s | Max:  9m 16s | Hits:  87%/2048  
      🟩 20                 Pass: 100%/18  | Total:  1h 50m | Avg:  6m 09s | Max: 16m 44s | Hits:  94%/9344  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 15m 31s | Avg: 7m 45s | Max: 13m 12s | Hits: 98%/308

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 15m 31s | Avg:  7m 45s | Max: 13m 12s | Hits:  98%/308   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 15m 31s | Avg:  7m 45s | Max: 13m 12s | Hits:  98%/308   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 15m 31s | Avg:  7m 45s | Max: 13m 12s | Hits:  98%/308   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 15m 31s | Avg:  7m 45s | Max: 13m 12s | Hits:  98%/308   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 15m 31s | Avg:  7m 45s | Max: 13m 12s | Hits:  98%/308   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 15m 31s | Avg:  7m 45s | Max: 13m 12s | Hits:  98%/308   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 15m 31s | Avg:  7m 45s | Max: 13m 12s | Hits:  98%/308   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 19s | Avg:  2m 19s | Max:  2m 19s | Hits:  98%/154   
      🟩 Test               Pass: 100%/1   | Total: 13m 12s | Avg: 13m 12s | Max: 13m 12s | Hits:  98%/154   
    
  • 🟩 python: Pass: 100%/1 | Total: 51m 23s | Avg: 51m 23s | Max: 51m 23s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 51m 23s | Avg: 51m 23s | Max: 51m 23s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 51m 23s | Avg: 51m 23s | Max: 51m 23s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 51m 23s | Avg: 51m 23s | Max: 51m 23s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 51m 23s | Avg: 51m 23s | Max: 51m 23s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 51m 23s | Avg: 51m 23s | Max: 51m 23s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 51m 23s | Avg: 51m 23s | Max: 51m 23s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 51m 23s | Avg: 51m 23s | Max: 51m 23s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 51m 23s | Avg: 51m 23s | Max: 51m 23s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 158)

# Runner
111 linux-amd64-cpu16
15 windows-amd64-cpu16
10 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

Copy link
Contributor

@griwes griwes left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Approving, but please either change the commented part, or leave some trace of the reason for it in the code.

Comment on lines +95 to +99
#if _CCCL_CUDA_COMPILER(CLANG)
# undef _CCCL_BUILTIN_ACOSF
# undef _CCCL_BUILTIN_ACOS
# undef _CCCL_BUILTIN_ACOSL
#endif // _CCCL_CUDA_COMPILER(CLANG)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why this way, instead of adding && !_CCCL_CUDA_COMPILER(CLANG) to the condition around where these are defined?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In this case its relatively simple, but we have a ton of situations where the enablement of the builtin has multiple conditions and mixing a chain of || with a chain of && ! just got ugly fast.

So we settled on splitting it into enablement and "compiler issues". That should also make it easier to spot what is a compiler issue and what is actually part of the condition

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If this is a compiler issue, we really should have a comment somewhere in there recording what it is.

@miscco miscco merged commit 4e5824f into NVIDIA:main Mar 4, 2025
172 of 175 checks passed
@github-project-automation github-project-automation bot moved this from In Review to Done in CCCL Mar 4, 2025
@miscco miscco deleted the trigonometric_math branch March 4, 2025 09:02
davebayer pushed a commit to davebayer/cccl that referenced this pull request Apr 7, 2025
* Implement trigonometric functions

* Implement hyperbolic math functions

* Implement inverse trigonometric functions

* Implement inverse hyperbolic functions

* Implement gamma functions

* Implement `std::hypot`

* Use variable templates in cmath tests

* Avoid unused variable warnings

* long double fixes

* Try and fix clang-cuda
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

libcu++ For all items related to libcu++

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

3 participants