fix: correct weight loading prefix mapping for Qwen3-VL by Lollipop · Pull Request #18024 · sgl-project/sglang

Lollipop · 2026-01-31T05:55:53Z

Summary

Fix Qwen3-VL-8B model producing garbage output due to incorrect weight loading.

Problem

The weight loading code unconditionally copies embed_tokens.weight to lm_head.weight:

if self.pp_group.is_last_rank and "model.embed_tokens.weight" in name:
    if "lm_head.weight" in params_dict:
        # copies embed_tokens to lm_head unconditionally

This is incorrect for models with tie_word_embeddings=False (like Qwen3-VL-8B), where lm_head has independent weights that should not be overwritten.

Model	tie_word_embeddings	lm_head weights
Qwen3-VL-2B	True	Shared with embed_tokens
Qwen3-VL-8B	False	Independent (should NOT be overwritten)

Fix

Add a check to only copy when tie_word_embeddings=True:

if (
    self.pp_group.is_last_rank
    and "model.embed_tokens.weight" in name
    and self.config.tie_word_embeddings  # <-- added check
):

gemini-code-assist · 2026-01-31T05:55:56Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

JustinTong0323 · 2026-01-31T18:22:07Z

It seems not a valid fix, my test still shows garbage output.

…to lm_head The weight loading code unconditionally copied embed_tokens.weight to lm_head.weight, which is incorrect for models with tie_word_embeddings=False (e.g. Qwen3-VL-8B). This caused garbage output from the 8B model. Add a check for self.config.tie_word_embeddings to ensure embed_tokens is only copied to lm_head when they are supposed to share weights. Fixes sgl-project#17887

JustinTong0323 · 2026-02-02T05:29:29Z

/tag-and-rerun-ci

Lollipop · 2026-02-02T05:30:07Z

@JustinTong0323 Thanks for testing! The previous fix was incorrect - I've updated the PR with the correct fix now.

Root Cause Clarification

The issue only affects models with tie_word_embeddings=False. Here's the difference:

Model	tie_word_embeddings	Affected
Qwen3-VL-2B	True	No
Qwen3-VL-4B	True	No
Qwen3-VL-8B	False	Yes

For models with tie_word_embeddings=True, embed_tokens and lm_head share the same weights, so copying is correct.

For Qwen3-VL-8B (tie_word_embeddings=False), lm_head has its own independent weights. The unconditional copy overwrites these weights with embed_tokens, causing garbage output.

Updated Fix

The new fix adds a check for self.config.tie_word_embeddings:

if (
    self.pp_group.is_last_rank
    and "model.embed_tokens.weight" in name
    and self.config.tie_word_embeddings  # <-- only copy when weights are shared
):

Could you please test again with Qwen3-VL-8B specifically? The 2B/4B models should work fine with or without this fix.

JustinTong0323

LGTM

…18024) Co-authored-by: liuxiaoming <liuxiaoming@modelbest.cn> Co-authored-by: Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com>

Lollipop mentioned this pull request Jan 31, 2026

[Bug] the model result is wrong when using sglang to serve qwen3-vl-8b-instruct #17887

Closed

5 tasks

Kangyan-Zhou requested a review from JustinTong0323 January 31, 2026 06:35

Lollipop force-pushed the fix/qwen3-vl-weight-loading branch from 400c549 to ce0e92f Compare February 2, 2026 05:22

liuxiaoming added 2 commits February 2, 2026 13:26

Lollipop force-pushed the fix/qwen3-vl-weight-loading branch from ce0e92f to 670aec5 Compare February 2, 2026 05:27

Merge branch 'main' into fix/qwen3-vl-weight-loading

5f39983

github-actions bot added the run-ci label Feb 2, 2026

JustinTong0323 approved these changes Feb 2, 2026

View reviewed changes

Kangyan-Zhou merged commit 522e13b into sgl-project:main Feb 2, 2026
40 of 78 checks passed

Lollipop deleted the fix/qwen3-vl-weight-loading branch February 3, 2026 07:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: correct weight loading prefix mapping for Qwen3-VL#18024

fix: correct weight loading prefix mapping for Qwen3-VL#18024
Kangyan-Zhou merged 3 commits intosgl-project:mainfrom
Lollipop:fix/qwen3-vl-weight-loading

Lollipop commented Jan 31, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Jan 31, 2026

Uh oh!

JustinTong0323 commented Jan 31, 2026

Uh oh!

JustinTong0323 commented Feb 2, 2026

Uh oh!

Lollipop commented Feb 2, 2026

Uh oh!

JustinTong0323 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Lollipop commented Jan 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Fix

Uh oh!

gemini-code-assist bot commented Jan 31, 2026

Uh oh!

JustinTong0323 commented Jan 31, 2026

Uh oh!

JustinTong0323 commented Feb 2, 2026

Uh oh!

Lollipop commented Feb 2, 2026

Root Cause Clarification

Updated Fix

Uh oh!

JustinTong0323 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Lollipop commented Jan 31, 2026 •

edited

Loading