Add UNet for Latent Diffusion by patil-suraj · Pull Request #5 · huggingface/diffusers

patil-suraj · 2022-06-08T09:30:52Z

No description provided.

Fix code quality

update for code quality check

[testing] pipeline test!

use op override and adjust block size to 2048

allow sft files to go.

Audio decoder

`fix-copies`

[agents docs] restructure modular.md: standalone reusability + IO-respect patterns Distilled from the ErnieImage modular pipeline review (PR #13498): - New "Common modular conventions" section: skim qwenimage / flux2 / wan / helios first, mirroring the references-driven shape of models.md / pipelines.md. - Promoted "Standalone block reusability" to a Key pattern. Each block (text encoder, VAE encoder, prepare-latents, denoise, decoder) must run on its own; encoders take raw inputs only, per-prompt expansion happens in a dedicated input step inside the core denoise sequence. Replaces old gotchas #4 (pre-computed encoder outputs) and #5 (VAE encode in prepare-latents). - Promoted "Flat block assembly" to a Key pattern (was gotcha #7). - New gotcha "Respect the declared IO system": one rule covering three bypass directions — defensive `getattr` reads of declared components/state, undeclared `block_state` writes, and direct `state.set()` calls that skip `set_block_state` entirely. - Reworked InputParam/OutputParam section to link to INPUT_PARAM_TEMPLATES / OUTPUT_PARAM_TEMPLATES in modular_pipeline_utils.py (the registry is dynamic) and added a non-template example. - Added a distilled-checkpoint exception to the `guidance_scale`-as-input gotcha — distilled flux-style models legitimately accept it. - Dropped the "inputs duplicating derivable state" gotcha (uncommon). Co-authored-by: yiyi@huggingface.co <yiyi@ip-26-0-160-103.ec2.internal> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

[agents docs] restructure modular.md: standalone reusability + IO-respect patterns Distilled from the ErnieImage modular pipeline review (PR huggingface#13498): - New "Common modular conventions" section: skim qwenimage / flux2 / wan / helios first, mirroring the references-driven shape of models.md / pipelines.md. - Promoted "Standalone block reusability" to a Key pattern. Each block (text encoder, VAE encoder, prepare-latents, denoise, decoder) must run on its own; encoders take raw inputs only, per-prompt expansion happens in a dedicated input step inside the core denoise sequence. Replaces old gotchas huggingface#4 (pre-computed encoder outputs) and huggingface#5 (VAE encode in prepare-latents). - Promoted "Flat block assembly" to a Key pattern (was gotcha huggingface#7). - New gotcha "Respect the declared IO system": one rule covering three bypass directions — defensive `getattr` reads of declared components/state, undeclared `block_state` writes, and direct `state.set()` calls that skip `set_block_state` entirely. - Reworked InputParam/OutputParam section to link to INPUT_PARAM_TEMPLATES / OUTPUT_PARAM_TEMPLATES in modular_pipeline_utils.py (the registry is dynamic) and added a non-template example. - Added a distilled-checkpoint exception to the `guidance_scale`-as-input gotcha — distilled flux-style models legitimately accept it. - Dropped the "inputs duplicating derivable state" gotcha (uncommon). Co-authored-by: yiyi@huggingface.co <yiyi@ip-26-0-160-103.ec2.internal> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Finding huggingface#1 — attention_kwargs plumbing: Both transformers now decorate forward() with @apply_lora_scale('attention_kwargs') (matches Wan); pipelines forward attention_kwargs to the transformer + encode_kv_cache, and the unused parameter is dropped from the inner _forward_train / _forward_cache / _forward_inference signatures. Pipeline docstrings updated to the standard wording. Finding huggingface#2 — naming: Rename far_cfg -> layout_cfg in the bidi transformer (the bidi path is not FAR; the FAR transformer keeps far_cfg, which is accurate there). Finding huggingface#3 — scheduler state machine: Add _step_index, _begin_index, step_index property, begin_index property, set_begin_index(), _init_step_index(). step() lazily initializes and advances the counter so downstream callbacks / composable schedulers can observe rollout progress. Sigma resolution remains a pure function of (timestep, r_timestep) — calling step() twice with identical args still returns identical prev_sample (idempotent). Finding huggingface#4 — redundant @torch.no_grad(): Drop the redundant decorators on bidi pipeline's encode_video and FAR pipeline's encode_kv_cache (callers are already in __call__'s no-grad scope). Finding huggingface#5 — dead code: Remove the unreachable temb.ndim == 2 else branch from the bidi transformer's output-norm path (condition_embedder.forward always returns a 3D temb). Finding huggingface#6 — private rename: forward_far_patchify[_inference] -> _forward_far_patchify[_inference] (only called internally by _forward_train / _forward_cache / _forward_inference). Finding huggingface#7 — pipeline comment numbering: Bidi + FAR pipelines renumber steps so the # 4. slot is no longer skipped. Finding huggingface#8 — mask-mod comment numbering: _build_causal_mask numbered comments now run 1) 2) 3) ... (was 1) 3) 4) ...). Tests: - New test_step_index_advances + test_set_begin_index_anchors_step_index in the scheduler test file exercise the new state machine. - All existing pipeline / transformer / scheduler tests still pass (85 passed, 85 skipped on CPU). Bit-exact: 8-step rollout vs the previous formulation, max abs diff = 0.0 (the new sigma-lookup is byte-identical to t/num_train_timesteps on this schedule).

patil-suraj added 4 commits June 8, 2022 11:29

add unet for ldm

4ea4429

rename to UNetLDMModel

2f24ce1

remove unused imports

a9374a0

fix einsum

b903d3d

patil-suraj merged commit e7026ed into main Jun 8, 2022

williamberman pushed a commit to williamberman/diffusers that referenced this pull request Sep 18, 2023

Merge pull request huggingface#5 from dotieuthien/add-convert-tensorrt

9a607d9

Fix code quality

yiyixuxu pushed a commit that referenced this pull request Jan 21, 2024

Merge pull request #5 from scxue/feat/sa-solver

de4de28

update for code quality check

Beinsezii mentioned this pull request Jun 12, 2024

StableDiffusion3Pipeline from_pretrained Exception #8488

Closed

sayakpaul added a commit that referenced this pull request Aug 1, 2024

Merge pull request #5 from huggingface/yiyi-test-pipeline

bd3320f

[testing] pipeline test!

deforum-art mentioned this pull request Aug 4, 2024

[FLUX] add Img2Img pipeline #9070

Closed

5 tasks

yuyanpeng-google pushed a commit to yuyanpeng-google/diffusers that referenced this pull request Oct 30, 2025

Merge pull request huggingface#5 from yuyanpeng-google/yuyan-dev

99c7fec

use op override and adjust block size to 2048

sayakpaul pushed a commit that referenced this pull request Nov 25, 2025

Merge pull request #5 from huggingface/small-flux2-transformer-fix

89e42d9

allow sft files to go.

dg845 added a commit that referenced this pull request Jan 6, 2026

Merge pull request #5 from huggingface/audio-decoder

7bb4cf7

Audio decoder

sayakpaul pushed a commit that referenced this pull request Jan 13, 2026

Merge pull request #5 from huggingface/zRzRzRzRzRzRzR-cogview

2c9c740

`fix-copies`

github-actions Bot mentioned this pull request Apr 29, 2026

feat: Add Motif-Video model and pipelines #13551

Merged

6 tasks

github-actions Bot mentioned this pull request May 1, 2026

[feat] JoyAI-JoyImage-Edit support #13444

Merged

This was referenced May 21, 2026

[agents docs] review-rules: add Tripwires section #13775

Open

Add AnyFlow Any-Step Video Diffusion Pipelines (Bidirectional + FAR Causal) #13745

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add UNet for Latent Diffusion#5

Add UNet for Latent Diffusion#5
patil-suraj merged 4 commits into
mainfrom
add-ldm

patil-suraj commented Jun 8, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

patil-suraj commented Jun 8, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant