Skip to content

[FEAT] Improved PagedAttention FP8 (faster kvcache dequant v2) #347

Closed
tjtanaa wants to merge 5 commits intoROCm:llama_fp8_12062024from
EmbeddedLLM:paged-attn-updated
Closed

[FEAT] Improved PagedAttention FP8 (faster kvcache dequant v2) #347
tjtanaa wants to merge 5 commits intoROCm:llama_fp8_12062024from
EmbeddedLLM:paged-attn-updated

Commits

Commits on Dec 20, 2024

Commits on Dec 27, 2024

Commits on Dec 29, 2024

Commits on Dec 30, 2024