Skip to content

ci: cache target dir for CUDA codspeed bench (~7× faster build)#8256

Closed
joseph-isaacs wants to merge 1 commit into
developfrom
ci/cuda-bench-target-cache
Closed

ci: cache target dir for CUDA codspeed bench (~7× faster build)#8256
joseph-isaacs wants to merge 1 commit into
developfrom
ci/cuda-bench-target-cache

ci: cache target dir for CUDA codspeed bench

a1c1818
Select commit
Loading
Failed to load commit list.
CodSpeed HQ / CodSpeed Performance Analysis succeeded Jun 5, 2026 in 0s

Performance Gate Passed

⚠️ Unknown Walltime execution environment detected

Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.

For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.

⚠️ Different runtime environments detected

Some benchmarks with significant performance changes were compared across different runtime environments,
which may affect the accuracy of the results.

Open the report in CodSpeed to investigate

⚡ 6 improved benchmarks
✅ 1501 untouched benchmarks

Performance Changes

Mode Benchmark BASE HEAD Efficiency
Simulation chunked_bool_canonical_into[(1000, 10)] 46.6 µs 31.7 µs +46.98%
Simulation bitwise_not_vortex_buffer_mut[128] 275.3 ns 216.9 ns +26.89%
Simulation bitwise_not_vortex_buffer_mut[1024] 336.9 ns 278.6 ns +20.94%
Simulation chunked_varbinview_into_canonical[(1000, 10)] 213.2 µs 177.1 µs +20.41%
Simulation bitwise_not_vortex_buffer_mut[2048] 400.6 ns 342.2 ns +17.05%
Simulation chunked_varbinview_canonical_into[(100, 100)] 309.6 µs 274.7 µs +12.71%

Tip

Curious why this is faster? Comment @codspeedbot explain why this is faster on this PR, or directly use the CodSpeed MCP with your agent.


Comparing ci/cuda-bench-target-cache (a1c1818) with develop (d97d2bd)

Open in CodSpeed