fix(zero): enable vmap on LinearFunctionForZeroStage3 by roycho96 · Pull Request #8023 · deepspeedai/DeepSpeed

roycho96 · 2026-05-22T11:03:30Z

Follow-up to #7916.

Adds generate_vmap_rule = True to LinearFunctionForZeroStage3 so torch.func.vmap works on the Function directly. The previous PR covered grad / jacrev via setup_context but not vmap.

Test:
pytest tests/unit/v1/zero/test_zero_functorch_linear.py::TestLinearFunctionVmap

The forward is a pure tensor op (addmm / matmul + bias) with no closure state, so PyTorch's auto-generated vmap rule produces correct batched semantics. Without this, vmap (and vmap(grad)) over the Function raises 'does not have vmap support', the case the setup_context fix in deepspeedai#7916 left unaddressed. Signed-off-by: Sung Hyun Cho <hope5487@gmail.com>

roycho96 requested review from loadams, tjruwase and tohtana as code owners May 22, 2026 11:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(zero): enable vmap on LinearFunctionForZeroStage3#8023

fix(zero): enable vmap on LinearFunctionForZeroStage3#8023
roycho96 wants to merge 1 commit into
deepspeedai:masterfrom
roycho96:fix/zero-linear-vmap-rule

roycho96 commented May 22, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

roycho96 commented May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

roycho96 commented May 22, 2026 •

edited

Loading