Skip to content

fix: duplicated token usage in /chat/completions stream mode#3859

Merged
lvhan028 merged 4 commits intoInternLM:mainfrom
Huarong:fix-dupcated_tokens_usage
Aug 21, 2025
Merged

fix: duplicated token usage in /chat/completions stream mode#3859
lvhan028 merged 4 commits intoInternLM:mainfrom
Huarong:fix-dupcated_tokens_usage

Conversation

@Huarong
Copy link
Copy Markdown
Contributor

@Huarong Huarong commented Aug 19, 2025

Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily receiving feedbacks. If you do not understand some items, don't worry, just make the pull request and seek help from maintainers.

Motivation

fix #3832

Modification

Check if the returned chunk is the final chunk.

BC-breaking (Optional)

Does the modification introduce changes that break the backward-compatibility of the downstream repositories?
If so, please describe how it breaks the compatibility and how the downstream projects should modify their code to keep compatibility with this PR.

Use cases (Optional)

If this PR introduces a new feature, it is better to list some use cases here, and update the documentation.

Checklist

  1. Pre-commit or other linting tools are used to fix the potential lint issues.
  2. The modification is covered by complete unit tests. If not, please add more unit tests to ensure the correctness.
  3. If the modification has a dependency on downstream projects of a newer version, this PR should be tested with all supported versions of downstream projects.
  4. The documentation has been modified accordingly, like docstring or example tutorials.

@lvhan028 lvhan028 merged commit e00b3ba into InternLM:main Aug 21, 2025
5 checks passed
littlegy pushed a commit to littlegy/lmdeploy that referenced this pull request Sep 11, 2025
…M#3859)

* fix: duplicated token usage in /chat/completions stream mode

* remove mistake code

* get back to double quote

* yapf format
@Huarong Huarong deleted the fix-dupcated_tokens_usage branch November 22, 2025 15:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug] duplicated token usage in /chat/completions stream mode

3 participants