Skip to content

Jmt collapse modules#8

Merged
johnmarktaylor91 merged 2 commits intomainfrom
jmt-collapse-modules
Jun 23, 2023
Merged

Jmt collapse modules#8
johnmarktaylor91 merged 2 commits intomainfrom
jmt-collapse-modules

Conversation

@johnmarktaylor91
Copy link
Copy Markdown
Owner

No description provided.

@johnmarktaylor91 johnmarktaylor91 merged commit 03ebe62 into main Jun 23, 2023
@johnmarktaylor91 johnmarktaylor91 deleted the jmt-collapse-modules branch June 23, 2023 03:58
johnmarktaylor91 added a commit that referenced this pull request Mar 4, 2026
… polish fixes

Wave 1 — Critical Correctness:
- Rewrite safe_copy() to eliminate numpy round-trip, fix bfloat16 overflow,
  handle sparse/meta/quantized dtypes (#103, #137, #128, #139)
- Guard print_override numpy conversion (#140), meta/sparse tensor memory (#24)
- Fix validation crash on None parent_values (#150), silent exception pass (#151)
- Add buffer duplicate guard in fast path (#116)
- Fix postprocess_fast shared reference and unguarded parent_layers (#8, #152)
- Add empty graph guard (#153), zombie label cleanup (#75)
- Handle defaultdict in _safe_copy_arg (#127)

Wave 2 — Exception Safety:
- Make module_forward_decorator exception-safe (#122)
- Guard _handle_module_entry for untracked tensors (#117)
- Clean up output tensors on fast-pass exception (#110)
- Wrap activation_postfunc in pause_logging (#96)
- Guard validation with save_function_args=False (#131)
- Fix CUDA perturbation device consistency (#36)

Wave 3 — Fast-Pass State Reset:
- Reset timing and stale lookup caches in save_new_activations (#87, #97, #98)
- Pass raw label (not final) to gradient hook in fast path (#86)
- Raise descriptive error for unexpected missing parents (#111)

Wave 4 — Argument Handling:
- Deep-copy nested tuples in creation_args (#44)
- Slice before cloning in display helper (#73)
- Use logged shape in display (#45)

Wave 5 — Interface Polish:
- Key layer_num_passes by no-pass label (#53)
- Use rsplit for colon-split safety (#54)
- Add slice indexing support (#78)
- Guard to_pandas before pass finished (#124)
- List candidates for ambiguous substring (#125)
- Add ModuleLog string indexing (#120)
- Support int/short-name in ParamAccessor.__contains__ (#84)

Wave 6 — Decoration/Model Prep:
- Only add mapper entries if setattr succeeded (#31)
- Add hasattr guard for identity asymmetry (#39)
- Guard buffer setattr (#40)
- Store dunder argnames stripped (#82)
- Use inspect.Parameter.kind for varargs (#123)

Wave 7 — Visualization:
- Fix vis_nesting_depth=0 crash (#94)
- Use rsplit for colon-split (#104)
- Fix collapsed node variable (#100)
- Guard None tensor_shape (#118)
- Guard LayerLog nesting depth (#138)
- Use exact equality instead of substring match (#129)

Wave 8 — Control Flow/Loop Detection:
- Remove dead sibling_layers iteration (#2)
- Prevent shared neighbor double-add (#148)

Wave 9 — Cleanup + GC:
- Lazy import IPython (#72)
- Filter callables from _trim_and_reorder (#134)
- Complete cleanup() with internal containers (GC-5, GC-12)
- Use weakref in backward hook closure (GC-8)
- Clear ModulePassLog forward_args/kwargs after build (GC-11)

585 tests passing across 10 test files, 47 new regression tests.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant