[arm64] Addressing mode for vectors

I noticed that we lose some perf in various `SpanHelpers` on arm64 due to missing addressing modes which brake pipelining, minimal repro:
```csharp
    Vector128<byte> Add(ref byte b1, ref byte b2, nuint offset) =>
        Vector128.LoadUnsafe(ref b1, offset) + 
        Vector128.LoadUnsafe(ref b2, offset);
```
Current codegen:
```asm
        add     x0, x1, x3
        ld1     {v16.16b}, [x0]
        add     x0, x2, x3
        ld1     {v17.16b}, [x0]
        add     v16.16b, v16.16b, v17.16b
        mov     v0.16b, v16.16b
```
Expected codegen:
```asm
        ldr     q16, [x1, x3]
        ldr     q17, [x2, x3]
        add     v16.16b, v16.16b, v17.16b
        mov     v0.16b, v16.16b
```
same for `[addr + imm]` e.g. `Vector128.LoadUnsafe(ref b2, 16)`

cc @tannergooding

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[arm64] Addressing mode for vectors #67435

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[arm64] Addressing mode for vectors #67435

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions