perf: branchless primitive zip kernel#8270
Merged
Merged
CodSpeed HQ / CodSpeed Performance Analysis
succeeded
Jun 9, 2026 in 0s
Performance Gate Passed
⚠️ Unknown Walltime execution environment detected
Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.
For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.
⚡ 7 improved benchmarks
✅ 1516 untouched benchmarks
🆕 2 new benchmarks
Performance Changes
| Mode | Benchmark | BASE |
HEAD |
Efficiency | |
|---|---|---|---|---|---|
| ⚡ | Simulation | chunked_bool_canonical_into[(1000, 10)] |
46.8 µs | 31.7 µs | +47.6% |
| ⚡ | Simulation | varbinview_zip_block_mask |
3.7 ms | 2.9 ms | +27.5% |
| ⚡ | Simulation | chunked_varbinview_canonical_into[(1000, 10)] |
198.5 µs | 161.6 µs | +22.87% |
| ⚡ | Simulation | chunked_varbinview_into_canonical[(1000, 10)] |
213.3 µs | 176.9 µs | +20.59% |
| ⚡ | Simulation | chunked_varbinview_canonical_into[(100, 100)] |
309.8 µs | 273.6 µs | +13.21% |
| ⚡ | Simulation | varbinview_zip_fragmented_mask |
6.9 ms | 6.1 ms | +12.74% |
| ⚡ | Simulation | chunked_varbinview_into_canonical[(100, 100)] |
362.6 µs | 326.6 µs | +11.03% |
| 🆕 | Simulation | nonnull |
N/A | 252.8 µs | N/A |
| 🆕 | Simulation | nullable |
N/A | 276.5 µs | N/A |
Tip
Curious why this is faster? Comment @codspeedbot explain why this is faster on this PR, or directly use the CodSpeed MCP with your agent.
Comparing claude/primitive-branchless-zip (336271c) with develop (efd3e9b)
Loading