Skip to content

support qwen3-next eagle3#14607

Merged
Kangyan-Zhou merged 7 commits intosgl-project:mainfrom
sleepcoo:support_qwen3next_eagle3
Feb 1, 2026
Merged

support qwen3-next eagle3#14607
Kangyan-Zhou merged 7 commits intosgl-project:mainfrom
sleepcoo:support_qwen3next_eagle3

Conversation

@sleepcoo
Copy link
Collaborator

@sleepcoo sleepcoo commented Dec 8, 2025

Motivation

support qwen3 next eagle3,https://huggingface.co/lukeysong/qwen3-next-draft/tree/main

Benchmarking and Profiling

use Specfoge

python bench_eagle3.py --model /workdir/huggingface.co/Qwen/Qwen3-Next-80B-A3B-Instruct-FP8 --speculative-algorithm EAGLE3 --speculative-draft-model-path /workdir/epoch_1_step_120000/ --port 30000 --config-list 1,3,1,4 --dtype bfloat16 --tp 4

tp4 h20

Benchmark Latency (ms) Throughput (tok/s) Accept Length Accuracy Num Questions Valid Predictions Num Samples
mtbench 461.65 234.52 2.59 N/A 80 0 80
gsm8k 137.10 234.18 3.13 0.955 200 200 200
humaneval 309.11 304.63 3.45 0.000 164 164 200
math500 515.99 317.04 3.52 0.615 200 200 200

@gemini-code-assist
Copy link
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@sleepcoo sleepcoo requested a review from hnyls2002 December 10, 2025 03:26
@sleepcoo
Copy link
Collaborator Author

sleepcoo commented Jan 4, 2026

/tag-and-rerun-ci

@github-actions github-actions bot added the run-ci label Jan 4, 2026
@Kangyan-Zhou Kangyan-Zhou merged commit 3ca29df into sgl-project:main Feb 1, 2026
155 of 165 checks passed
charlesHsuGG pushed a commit to charlesHsuGG/sglang that referenced this pull request Feb 2, 2026
sfiisf pushed a commit to sfiisf/sglang that referenced this pull request Feb 5, 2026
Johnsonms pushed a commit to Johnsonms/sglang that referenced this pull request Feb 14, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants