Skip to content

Add FP8*(B)F16 GEMM#3960

Merged
lvhan028 merged 14 commits intoInternLM:mainfrom
lzhangzz:fp8b
Sep 15, 2025
Merged

Add FP8*(B)F16 GEMM#3960
lvhan028 merged 14 commits intoInternLM:mainfrom
lzhangzz:fp8b

Conversation

@lzhangzz
Copy link
Copy Markdown
Collaborator

@lzhangzz lzhangzz commented Sep 11, 2025

  • Add FP8*(B)F16 for sm_70 ... sm_90
  • Optimize grouped GEMM performance for all mixed GEMMs
  • Re-organized code structure

@lzhangzz lzhangzz changed the title Add FP8x(B)F16 GEMM Add FP8*(B)F16 GEMM Sep 11, 2025
@lvhan028 lvhan028 added the enhancement New feature or request label Sep 12, 2025
@lvhan028
Copy link
Copy Markdown
Collaborator

In lmdeploy chat, when typing exit, it crashed:

terminate called after throwing an instance of 'std::system_error'
  what():  Resource deadlock avoided
Aborted (core dumped)

@lvhan028 lvhan028 merged commit 4e6419d into InternLM:main Sep 15, 2025
9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants