Support group router for moe models by RunningLeon · Pull Request #4120 · InternLM/lmdeploy

RunningLeon · 2025-11-12T08:54:52Z

Motivation

Support group router for moe models

Use cases (Optional)

pipeline

from lmdeploy import pipeline, GenerationConfig, PytorchEngineConfig

if __name__ == '__main__':
    backend_config = PytorchEngineConfig(hf_overrides=dict(router_n_groups=4))
    model_path = 'Qwen/Qwen3-30B-A3B'
    pipe = pipeline(model_path, backend_config=backend_config)

    resps = pipe(['Hi.'])
    for res in resps:
        print(res)

api server

lmdeploy serve api_server \
Qwen/Qwen3-30B-A3B \
--backend pytorch \
--hf-overrides '{"router_n_groups": 4}'

Checklist

Pre-commit or other linting tools are used to fix the potential lint issues.
The modification is covered by complete unit tests. If not, please add more unit tests to ensure the correctness.
If the modification has a dependency on downstream projects of a newer version, this PR should be tested with all supported versions of downstream projects.
The documentation has been modified accordingly, like docstring or example tutorials.

lmdeploy/pytorch/backends/moe.py

lmdeploy/pytorch/backends/default/moe.py

support group router for moe

120634c

lvhan028 added the enhancement New feature or request label Nov 12, 2025

lvhan028 approved these changes Nov 13, 2025

View reviewed changes

lvhan028 requested a review from grimoire November 13, 2025 03:40

grimoire reviewed Nov 13, 2025

View reviewed changes

lmdeploy/pytorch/backends/moe.py Show resolved Hide resolved

grimoire reviewed Nov 13, 2025

View reviewed changes

lmdeploy/pytorch/backends/default/moe.py Outdated Show resolved Hide resolved

grimoire reviewed Nov 13, 2025

View reviewed changes

lmdeploy/pytorch/backends/default/moe.py Show resolved Hide resolved

grimoire reviewed Nov 13, 2025

View reviewed changes

lmdeploy/pytorch/backends/default/moe.py Outdated Show resolved Hide resolved

resolve comments

0179c59

grimoire approved these changes Nov 13, 2025

View reviewed changes

lvhan028 merged commit bf52c04 into InternLM:main Nov 13, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support group router for moe models#4120

Support group router for moe models#4120
lvhan028 merged 2 commits intoInternLM:mainfrom
RunningLeon:group-router

RunningLeon commented Nov 12, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

RunningLeon commented Nov 12, 2025

Motivation

Use cases (Optional)

pipeline

api server

Checklist

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants