Skip to content

perf: branchless primitive zip kernel#8270

Merged
joseph-isaacs merged 4 commits into
developfrom
claude/primitive-branchless-zip
Jun 12, 2026
Merged

perf: branchless primitive zip kernel#8270
joseph-isaacs merged 4 commits into
developfrom
claude/primitive-branchless-zip

Add TODO to use unaligned_chunks in primitive zip kernel

336271c
Select commit
Loading
Failed to load commit list.
CodSpeed HQ / CodSpeed Performance Analysis succeeded Jun 9, 2026 in 0s

Performance Gate Passed

⚠️ Unknown Walltime execution environment detected

Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.

For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.

⚡ 7 improved benchmarks
✅ 1516 untouched benchmarks
🆕 2 new benchmarks

Performance Changes

Mode Benchmark BASE HEAD Efficiency
Simulation chunked_bool_canonical_into[(1000, 10)] 46.8 µs 31.7 µs +47.6%
Simulation varbinview_zip_block_mask 3.7 ms 2.9 ms +27.5%
Simulation chunked_varbinview_canonical_into[(1000, 10)] 198.5 µs 161.6 µs +22.87%
Simulation chunked_varbinview_into_canonical[(1000, 10)] 213.3 µs 176.9 µs +20.59%
Simulation chunked_varbinview_canonical_into[(100, 100)] 309.8 µs 273.6 µs +13.21%
Simulation varbinview_zip_fragmented_mask 6.9 ms 6.1 ms +12.74%
Simulation chunked_varbinview_into_canonical[(100, 100)] 362.6 µs 326.6 µs +11.03%
🆕 Simulation nonnull N/A 252.8 µs N/A
🆕 Simulation nullable N/A 276.5 µs N/A

Tip

Curious why this is faster? Comment @codspeedbot explain why this is faster on this PR, or directly use the CodSpeed MCP with your agent.


Comparing claude/primitive-branchless-zip (336271c) with develop (efd3e9b)

Open in CodSpeed