MXFP4 support for turbomind GEMM library by lzhangzz · Pull Request #3927 · InternLM/lmdeploy

lzhangzz · 2025-09-02T13:14:27Z

Grouped MXFP4 * half/bfloat16 for sm_70 ... sm_90
Support official gpt-oss (MXFP4) models

lvhan028 · 2025-09-02T14:11:04Z

FINALLY!!!!

lvhan028 · 2025-09-03T05:15:34Z

build failed:

ptxas /tmp/tmpxft_0003abdf_00000000-8_sm80_mxfp4.compute_86.ptx, line 2626; error   : Feature 'mul.bf16x2' requires .target sm_90 or higher

* unify gemm test * mxfp468 conversion * mxfp4 gemm * add group gemm tests * new sm70 tile scheduler * mxfp4 model loading * fix tp * fix dispatch heuristic * add half x mxfp4 * fix configs for sm70-89 * remove unused * minor * disable cache miss warning by default * fix * fix * fuse bias * fix split-k * fix lint * fix kernel names * stochastic rounding experiment * disable debug info

lzhangzz added 12 commits September 2, 2025 21:06

unify gemm test

3d650b5

mxfp468 conversion

9684d8a

mxfp4 gemm

484767d

add group gemm tests

4e8d153

new sm70 tile scheduler

e19145c

mxfp4 model loading

0c89dee

fix tp

a213b2b

fix dispatch heuristic

b79d3e4

add half x mxfp4

d6f7998

fix configs for sm70-89

f15ed5d

remove unused

a5dbfff

minor

d08d474

lvhan028 added the enhancement New feature or request label Sep 2, 2025

disable cache miss warning by default

67bb2a1

lzhangzz added 5 commits September 3, 2025 14:15

fix

58d968c

fix

c3afac6

fuse bias

8e49d14

fix split-k

716e92c

fix lint

86b1178

lvhan028 approved these changes Sep 4, 2025

View reviewed changes

fix kernel names

2077d56

lvhan028 mentioned this pull request Sep 4, 2025

bump version to v0.10.0 #3933

Merged

3 tasks

lzhangzz added 2 commits September 4, 2025 17:02

stochastic rounding experiment

522242c

disable debug info

d5993b3

lvhan028 merged commit 693c5b9 into InternLM:main Sep 4, 2025
7 of 9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MXFP4 support for turbomind GEMM library#3927

MXFP4 support for turbomind GEMM library#3927
lvhan028 merged 21 commits intoInternLM:mainfrom
lzhangzz:mxfp4a

lzhangzz commented Sep 2, 2025 •

edited

Loading

Uh oh!

lvhan028 commented Sep 2, 2025

Uh oh!

lvhan028 commented Sep 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lzhangzz commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lvhan028 commented Sep 2, 2025

Uh oh!

lvhan028 commented Sep 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lzhangzz commented Sep 2, 2025 •

edited

Loading