[Refactor]: Remove tokenizer when building engine by RunningLeon · Pull Request #3978 · InternLM/lmdeploy

RunningLeon · 2025-09-16T06:47:46Z

Motivation

Remove tokenizer argument in mp engine for bad serialization issue

Modification

Please briefly describe what modification is made in this PR.

BC-breaking (Optional)

Does the modification introduce changes that break the backward-compatibility of the downstream repositories?
If so, please describe how it breaks the compatibility and how the downstream projects should modify their code to keep compatibility with this PR.

Use cases (Optional)

If this PR introduces a new feature, it is better to list some use cases here, and update the documentation.

Checklist

Pre-commit or other linting tools are used to fix the potential lint issues.
The modification is covered by complete unit tests. If not, please add more unit tests to ensure the correctness.
If the modification has a dependency on downstream projects of a newer version, this PR should be tested with all supported versions of downstream projects.
The documentation has been modified accordingly, like docstring or example tutorials.

grimoire · 2025-09-16T07:06:44Z

https://github.com/RunningLeon/lmdeploy/blob/a7b8d2e294aaf6e133e48f5fe9c93d10e509cbc1/lmdeploy/pytorch/engine/mp_engine/zmq_engine.py#L49 Do we need to update in Multiprocessing engine?

RunningLeon · 2025-09-16T07:11:05Z

https://github.com/RunningLeon/lmdeploy/blob/a7b8d2e294aaf6e133e48f5fe9c93d10e509cbc1/lmdeploy/pytorch/engine/mp_engine/zmq_engine.py#L49 Do we need to update in Multiprocessing engine?

Tested ok with zmq engine. So no need to change it.

grimoire

LGTM

lmdeploy/pytorch/engine/mp_engine/ray_engine.py

refactoring again

lmdeploy/pytorch/engine/model_agent.py

test failed

grimoire · 2025-09-16T13:07:08Z

lmdeploy/pytorch/engine/model_agent.py

        self.model_config = model_config
        self.cache_config = cache_config
-        self.tokenizer = tokenizer
+        self.tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)


Is this true for every model?

grimoire

LGTM

to not serialize tokenizer

a7b8d2e

RunningLeon requested a review from grimoire September 16, 2025 06:47

grimoire previously approved these changes Sep 16, 2025

View reviewed changes

lvhan028 reviewed Sep 16, 2025

View reviewed changes

lmdeploy/pytorch/engine/mp_engine/ray_engine.py Outdated Show resolved Hide resolved

remove tokenizer when build mp engine

6fc03b4

RunningLeon changed the title ~~[Fix]: do not serialize tokenizer in ray mp engine~~ [Fix]: Remove tokenizer for mp engine Sep 16, 2025

RunningLeon requested a review from lvhan028 September 16, 2025 08:40

remove tokenizer argument when building engine

afa05c2

RunningLeon changed the title ~~[Fix]: Remove tokenizer for mp engine~~ [Refactor]: Remove tokenizer when building engine Sep 16, 2025

RunningLeon added the enhancement New feature or request label Sep 16, 2025

lvhan028 requested a review from grimoire September 16, 2025 10:54

lvhan028 previously approved these changes Sep 16, 2025

View reviewed changes

lvhan028 reviewed Sep 16, 2025

View reviewed changes

lmdeploy/pytorch/engine/model_agent.py Show resolved Hide resolved

lvhan028 self-requested a review September 16, 2025 12:06

use hf tokenizer

b56a899

grimoire reviewed Sep 16, 2025

View reviewed changes

use raw tokenizer to align with original

eead9b4

lvhan028 approved these changes Sep 17, 2025

View reviewed changes

grimoire approved these changes Sep 17, 2025

View reviewed changes

lvhan028 merged commit 8095307 into InternLM:main Sep 17, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Refactor]: Remove tokenizer when building engine#3978

[Refactor]: Remove tokenizer when building engine#3978
lvhan028 merged 5 commits intoInternLM:mainfrom
RunningLeon:no-serialize-tokenizer

RunningLeon commented Sep 16, 2025 •

edited

Loading

Uh oh!

grimoire commented Sep 16, 2025

Uh oh!

RunningLeon commented Sep 16, 2025

Uh oh!

grimoire left a comment

Uh oh!

Uh oh!

Uh oh!

grimoire Sep 16, 2025

Uh oh!

grimoire left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

RunningLeon commented Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modification

BC-breaking (Optional)

Use cases (Optional)

Checklist

Uh oh!

grimoire commented Sep 16, 2025

Uh oh!

RunningLeon commented Sep 16, 2025

Uh oh!

grimoire left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

grimoire Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

grimoire left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

RunningLeon commented Sep 16, 2025 •

edited

Loading