Skip to content

Split llama and modify architecture for performance

9646142
Select commit
Loading
Failed to load commit list.
Closed

[Draft] Qualcomm AI Engine Direct - Support kv_cached llama2 model #2966

Split llama and modify architecture for performance
9646142
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs