Add pixelization imaging profiling: A100 + RTX 2060 + CPU sweep#57
Merged
Conversation
…00 + RTX 2060 + CPU sweep Six configs side-by-side for the rectangular pixelization imaging likelihood (Sersic + Isothermal + ExternalShear lens with a RectangularAdaptDensity(28,28) source mesh + Constant regularization) covering consumer hardware (RTX 2060 Max-Q + i9-10885H), production A100, and both fp64 + mixed-precision variants. Generated by new tooling in z_projects/profiling/scripts/ (separate local-only commit; no PR target). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This was referenced May 10, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Adds long-term tracking artifacts for the rectangular pixelization imaging likelihood under
jax_profiling/results/jit/imaging/pixelization/— six configs side-by-side (CPU/GPU × fp64/mp on consumer hardware + A100 fp64/mp). Generated by new tooling inz_projects/profiling/scripts/(separate local-only commit, no PR target).Likelihood: Sersic + Isothermal + ExternalShear lens with a
RectangularAdaptDensity(28, 28)source mesh + Constant regularization. Mirrors the canonical reference atjax_profiling/jit/imaging/pixelization.py. Companion to the MGE sweep merged in #56 — same harness, different model, an extra three steps (Overlay grid, Regularization matrix H, Regularized reconstruction) on top of the MGE 8-step pipeline.Headline numbers
Key findings
use_mixed_precisionflag remains a CPU lever, not a GPU one — same conclusion as MGE.reconstruction_positive_only_from— it does not batch usefully. Batched pixelization evaluation needs a different reconstruction strategy.Caveats
PyAutoNSSvenv does not havejax_enable_x64=True. Doesn't affect timing data here, and the assertion usesrtol=1e-2for mp paths to absorb this. Worth confirming before quoting A100-served log Z values to high precision.comparison.jsonheadline section captures both numbers explicitly.comparison.png) uses log scale on the y axis to make the A100 / RTX 2060 / CPU classes coexist legibly, since they span ~3 orders of magnitude.Generated by
z_projects/profiling/scripts/pixelization_profile.py— single-config 11-step JIT profiler (per-step timings + full pipeline + vmap + memory analysis). Argparse-driven, honoursPYAUTO_ROOTfor worktree-aware canonical writes.z_projects/profiling/scripts/pixelization_aggregate.py—--ingest-pre-fix /tmp(no-op unless artifacts present);--consolidate-from <staging>to move HPC pulls into this canonical dir; default to emitcomparison.json+comparison.png.z_projects/profiling/scripts/_setup_pixelization.py— sharedbuild_dataset/build_model/build_analysisso the canonical reference'sEXPECTED_LOG_EVIDENCE_HST = 26232.068573757562constant carries through asserted on every run.z_projects/profiling/hpc/batch_gpu/submit_pixelization_profile_{fp64,mp}— A100 SLURM submits.The
z_projects/profiling/source side commits to its own (remote-less) main; only the result artifacts in this PR are version-tracked.Test plan
pixelization_aggregate.py)comparison.json+comparison.pngregenerated end-to-endjax_profiling/results/jit/imaging/outside the newpixelization/subdirpixelization_likelihood_summary_hst_v*.{json,png}) untouchedEXPECTED_LOG_EVIDENCE_HST(rtol=1e-4)🤖 Generated with Claude Code