Contrib: FLUX.1-lite-8B-alpha (native FLUX.1 compatibility) by jimburtoft · Pull Request #147 · aws-neuron/neuronx-distributed-inference

jimburtoft · 2026-04-28T06:06:59Z

Summary

FLUX.1-lite-8B-alpha (Freepik) is architecturally identical to FLUX.1-dev with 8 double-stream MMDiT blocks instead of 19. All other components (CLIP + T5-XXL encoders, VAE, scheduler, RoPE) are the same.
NxDI's first-party FLUX.1 implementation reads num_layers from the model's config.json at runtime, so FLUX.1-lite works out of the box with no custom modeling code.
This contrib provides a standalone generation script, integration tests, and documentation demonstrating native compatibility.

Validation Results (trn2.3xlarge, LNC=2, TP=4)

Metric	Value
Resolution	1024x1024
Inference steps	25
E2E generation time	5.91s avg
Pipeline steps/sec	4.23
Backbone forward/sec	4.49
Compilation time	~128s

Checklist

Model Type

Diffusion/image generation model

Contribution Contents

README with model info, benchmarks, usage instructions
src/ directory with generation script
test/integration/test_model.py with 3 passing tests
Sample output image
vLLM integration (N/A -- diffusion model)

Testing

All code tested on Neuron hardware (trn2.3xlarge)
All numbers in README are measured, not estimated
Integration tests pass: smoke test, image generation, timing

SDK Compatibility

Neuron SDK 2.29 (DLAMI 20260410)
NxD Inference 0.9
PyTorch 2.9

FLUX.1-lite-8B-alpha (Freepik) is architecturally identical to FLUX.1-dev with 8 double-stream blocks instead of 19. NxDI's FLUX.1 implementation reads num_layers from config.json at runtime, so it works out of the box with FLUX.1-lite weights -- no custom modeling code needed. Validated on trn2.3xlarge (LNC=2, TP=4): - 5.91s per 1024x1024 image (25 steps) - 4.49 backbone fwd/sec - ~128s compilation time - SDK 2.29, NxD Inference 0.9

jimburtoft added 3 commits April 28, 2026 02:06

Remove estimated comparison table from README (only measured numbers)

85261a0

Remove InternVL3 contrib (belongs to separate PR)

e4fc514

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contrib: FLUX.1-lite-8B-alpha (native FLUX.1 compatibility)#147

Contrib: FLUX.1-lite-8B-alpha (native FLUX.1 compatibility)#147
jimburtoft wants to merge 3 commits intoaws-neuron:mainfrom
jimburtoft:contrib/flux1-lite-8b

jimburtoft commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jimburtoft commented Apr 28, 2026

Summary

Validation Results (trn2.3xlarge, LNC=2, TP=4)

Checklist

Model Type

Contribution Contents

Testing

SDK Compatibility

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant