Skip to content

cudnn flash attention GQA support#555

Closed
kocchop wants to merge 2 commits into
AI-Hypercomputer:mainfrom
kocchop:cudnn_flash_dpa
Closed

cudnn flash attention GQA support#555
kocchop wants to merge 2 commits into
AI-Hypercomputer:mainfrom
kocchop:cudnn_flash_dpa

added GQA support for cudnn flash attention

51f6d85
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs