Update B200 Dsv4 configs by wzhao18 · Pull Request #1655 · SemiAnalysisAI/InferenceX

wzhao18 · 2026-06-03T16:46:25Z

Note

Low Risk
Benchmark-only image and vLLM serve-flag changes; no production auth or application logic.

Overview
Updates DeepSeek-V4 FP4 B200 vLLM fixed-seq-len benchmarks to a pinned vLLM nightly image and turns on expert-parallel load balancing (EPLB) whenever DP attention (DEP) is active.

For DP_ATTENTION=true, dsv4_fp4_b200_vllm.sh now passes --enable-eplb with an NCCL communicator config alongside the existing deep_gemm_mega_moe MoE backend. perf-changelog.yaml documents this under dsv4-fp4-b200-vllm.

^{Reviewed by Cursor Bugbot for commit e1b0750. Bugbot is set up for automated code reviews on this repo. Configure here.}

github-actions · 2026-06-03T16:46:37Z

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow

As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers.

If additional help is needed, PR authors can reach out to core maintainers over Slack.

github-actions · 2026-06-03T17:11:48Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26899483031
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=26899483031

github-actions · 2026-06-03T18:02:48Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26900694153
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=26900694153

github-actions · 2026-06-03T18:20:22Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26900694153
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=26900694153

Set environment variables for NIXL EPLB configuration.

github-actions · 2026-06-03T19:16:16Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26905612623
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=26905612623

Add eplb-config option to EPLB_ARGS for NCCL.

Removed unnecessary environment variable exports for NCCL and UCX.

github-actions · 2026-06-03T21:27:32Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26905612623
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=26905612623

github-actions · 2026-06-04T04:42:49Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26914041013
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=26914041013

github-actions · 2026-06-04T05:21:02Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26914041013
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=26914041013

wzhao18 · 2026-06-04T14:44:47Z

@kedarpotdar-nv @functionstackx @Oseltamivir Ready for review/merge. Thanks!

Oseltamivir · 2026-06-05T18:13:08Z

/reuse-sweep-run

Oseltamivir

lgtm

github-actions · 2026-06-05T18:14:54Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=27032104407
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=27032104407

Enable EPLB for DEP configs

8b3e665

wzhao18 requested a review from a team June 3, 2026 16:46

github-project-automation Bot added this to InferenceMAX Board Jun 3, 2026

Perf changelog

bb31d48

wzhao18 added the full-sweep-enabled label Jun 3, 2026

Update to nightly image

c361dad

wzhao18 requested review from jgangani and kedarpotdar-nv as code owners June 3, 2026 17:10

wzhao18 added 2 commits June 3, 2026 14:42

Add NCCL_NET_PLUGIN and UCX_TLS exports

94e89cc

Set environment variables for NIXL EPLB configuration.

Merge branch 'main' into wzhao/dsv4-b200-eplb

eba41e5

wzhao18 added 2 commits June 3, 2026 17:26

Enhance EPLB_ARGS with eplb-config

31a12b0

Add eplb-config option to EPLB_ARGS for NCCL.

Clean up environment variable exports in script

783dfe8

Removed unnecessary environment variable exports for NCCL and UCX.

wzhao18 changed the title ~~[WIP] Update B200 Dsv4 configs~~ Update B200 Dsv4 configs Jun 4, 2026

Oseltamivir approved these changes Jun 5, 2026

View reviewed changes

Merge branch 'main' into wzhao/dsv4-b200-eplb

e1b0750

Oseltamivir merged commit 53f61f8 into main Jun 5, 2026
18 of 19 checks passed

Oseltamivir deleted the wzhao/dsv4-b200-eplb branch June 5, 2026 18:14

github-project-automation Bot moved this to Done in InferenceMAX Board Jun 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update B200 Dsv4 configs#1655

Update B200 Dsv4 configs#1655
Oseltamivir merged 8 commits into
mainfrom
wzhao/dsv4-b200-eplb

wzhao18 commented Jun 3, 2026 •

edited by cursor Bot

Loading

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 4, 2026

Uh oh!

github-actions Bot commented Jun 4, 2026

Uh oh!

wzhao18 commented Jun 4, 2026

Uh oh!

Oseltamivir commented Jun 5, 2026

Uh oh!

Oseltamivir left a comment

Uh oh!

Uh oh!

github-actions Bot commented Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

wzhao18 commented Jun 3, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 4, 2026

Uh oh!

github-actions Bot commented Jun 4, 2026

Uh oh!

wzhao18 commented Jun 4, 2026

Uh oh!

Oseltamivir commented Jun 5, 2026

Uh oh!

Oseltamivir left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions Bot commented Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

wzhao18 commented Jun 3, 2026 •

edited by cursor Bot

Loading