Skip to content

[diffusion] fix the bug of redundant memory usage on GPU-0#18221

Merged
mickqian merged 1 commit intosgl-project:mainfrom
nono-Sang:fix_gpu_mem
Feb 4, 2026
Merged

[diffusion] fix the bug of redundant memory usage on GPU-0#18221
mickqian merged 1 commit intosgl-project:mainfrom
nono-Sang:fix_gpu_mem

Conversation

@nono-Sang
Copy link
Contributor

Motivation

Modifications

During multi-GPU inference, GPU0 shows an additional num_gpus - 1 instances of 520MiB memory usage. The code causing this issue is

gpu_mem_before_loading = current_platform.get_available_gpu_memory()

For example, when I use 4 GPUs for inference.

  • Before fix:
image
  • After fix:
image

Accuracy Tests

Benchmarking and Profiling

Checklist

Review Process

  1. Ping Merge Oncalls to start the PR flow. See the PR Merge Process.
  2. Get approvals from CODEOWNERS and other reviewers.
  3. Trigger CI tests with comments or contact authorized users to do so.
    • /tag-run-ci-label, /rerun-failed-ci, /tag-and-rerun-ci
  4. After green CI and required approvals, ask Merge Oncalls to merge.

@gemini-code-assist
Copy link
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@github-actions github-actions bot added the diffusion SGLang Diffusion label Feb 4, 2026
@mickqian
Copy link
Collaborator

mickqian commented Feb 4, 2026

/tag-and-rerun-ci

@github-actions github-actions bot added the run-ci label Feb 4, 2026
@mickqian mickqian merged commit 4c40304 into sgl-project:main Feb 4, 2026
149 of 157 checks passed
charlesHsuGG pushed a commit to charlesHsuGG/sglang that referenced this pull request Feb 5, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

diffusion SGLang Diffusion run-ci

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants