convert float16 weight to bfloat16 for FP8 models by lvhan028 · Pull Request #4276 · InternLM/lmdeploy

lvhan028 · 2026-01-15T11:15:17Z

In the model Qwen/Qwen3-4B-Instruct-2507-FP8, some parameters like "*.weight_scale_inv" are in half precision. However, the turbomind FP8 kernel is only compatible with bfloat16.
This PR implements a temporary workaround by converting Half-precision weights to the bfloat16 format.

Copilot

Pull request overview

This PR addresses issue #4261 by adding support for converting float16 weights to bfloat16 format in FP8 models, specifically for the Qwen/Qwen3-4B-Instruct-2507-FP8 model where parameters like *.weight_scale_inv are stored in half precision but require bfloat16 for compatibility with turbomind FP8 kernels.

Changes:

Added float16 to bfloat16 conversion in the process_fp8 function

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-15T11:18:37Z

lmdeploy/turbomind/deploy/policy.py

        return x.view(dtype=torch.uint8)
    elif kind != 'weight_scale_inv' and x.dtype == torch.float:
        return x.to(dtype=torch.bfloat16)
+    elif x.dtype == torch.float16:


The new float16 to bfloat16 conversion does not respect the 'weight_scale_inv' exclusion that exists for float32 tensors (line 65). This inconsistency means that weight_scale_inv parameters will be converted from float16 to bfloat16, but not from float32 to bfloat16. Consider whether this condition should also check kind != 'weight_scale_inv' to maintain consistency with the existing logic.

Suggested change

elif x.dtype == torch.float16:

elif kind != 'weight_scale_inv' and x.dtype == torch.float16:

convert float16 weight to bfloat16 for FP8 model

7a14992

Copilot AI review requested due to automatic review settings January 15, 2026 11:15

Copilot started reviewing on behalf of lvhan028 January 15, 2026 11:15 View session

lvhan028 requested a review from lzhangzz January 15, 2026 11:16

lvhan028 added the Bug:P1 label Jan 15, 2026

lvhan028 mentioned this pull request Jan 15, 2026

[Bug] Qwen/Qwen3-4B-Instruct-2507-FP8 在lmdeploy 10.2上输出乱码 #4261

Closed

3 tasks

Copilot AI reviewed Jan 15, 2026

View reviewed changes

fix

827fadc

lzhangzz approved these changes Jan 15, 2026

View reviewed changes

lvhan028 merged commit 0e335e0 into InternLM:main Jan 15, 2026
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

convert float16 weight to bfloat16 for FP8 models#4276

convert float16 weight to bfloat16 for FP8 models#4276
lvhan028 merged 2 commits intoInternLM:mainfrom
lvhan028:half-fp8-workround

lvhan028 commented Jan 15, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jan 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	elif x.dtype == torch.float16:
	elif kind != 'weight_scale_inv' and x.dtype == torch.float16:

Conversation

lvhan028 commented Jan 15, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants