🚀 The feature, motivation and pitch
Support better tiling algorithms for ANE in linear, SDPA, matmul, bmm, etc. We noticed that we can boost performance by explicitly splitting up ops in PyTorch, but ideally this would be done by the CoreML compiler.
Alternatives
No response
Additional context
No response
RFC (Optional)
No response
🚀 The feature, motivation and pitch
Support better tiling algorithms for ANE in linear, SDPA, matmul, bmm, etc. We noticed that we can boost performance by explicitly splitting up ops in PyTorch, but ideally this would be done by the CoreML compiler.
Alternatives
No response
Additional context
No response
RFC (Optional)
No response