fix: compile inner model before DDP wrapping to prevent Dynamo tracing DDP internals by anishesg · Pull Request #4017 · huggingface/accelerate

anishesg · 2026-04-23T10:56:00Z

What does this PR do?

When using torch.compile with multi-GPU (DDP) training via Accelerate, users hit a crash during the forward pass:

torch._dynamo.exc.Unsupported: Unsupported method call
  Explanation: Dynamo does not know how to trace method `set_runtime_stats_and_log` of class `Logger`

The root cause is in accelerator.py's prepare_model: the code was wrapping the model with DistributedDataParallel first, then applying torch.compile to the DDP wrapper. This caused Dynamo to trace into DDP's internal _pre_forward hook which calls self.logger.set_runtime_stats_and_log() — a method on a user-defined object that Dynamo cannot trace.

The fix follows the PyTorch-recommended pattern for DDP + torch.compile: compile the inner model before wrapping it with DDP. DDP then operates outside the compiled region, so its internal logging and communication hooks are never seen by Dynamo. This is applied to both the MULTI_GPU and MULTI_CPU DDP paths in prepare_model. The final compile guard is also updated to skip models that already have compiled submodules (via has_compiled_regions), preventing the DDP wrapper from being double-compiled.

Fixes #3991

…g DDP internals ## What does this PR do? Signed-off-by: anish k <ak8686@princeton.edu>

yuxinyuan · 2026-05-21T02:38:31Z

https://docs.pytorch.org/docs/2.12/notes/ddp.html The pytorch notes specifically mention DDP works with TorchDynamo. When used with TorchDynamo, apply the DDP model wrapper before compiling the model, such that torchdynamo can apply DDPOptimizer (graph-break optimizations) based on DDP bucket sizes.

Maybe it's a pytorch issue ?

fix: compile inner model before DDP wrapping to prevent Dynamo tracin…

49d6499

…g DDP internals ## What does this PR do? Signed-off-by: anish k <ak8686@princeton.edu>

anishesg mentioned this pull request Apr 25, 2026

Isssue when using torch.compile #3991

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: compile inner model before DDP wrapping to prevent Dynamo tracing DDP internals#4017

fix: compile inner model before DDP wrapping to prevent Dynamo tracing DDP internals#4017
anishesg wants to merge 1 commit into
huggingface:mainfrom
anishesg:fix/ph-issue-3991

anishesg commented Apr 23, 2026

Uh oh!

yuxinyuan commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

anishesg commented Apr 23, 2026

What does this PR do?

Uh oh!

yuxinyuan commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants