Summary
Several consistency test suites appear to use full Cartesian products over many feature toggles / configuration dimensions.
While this maximizes nominal coverage, it can also create very large test matrices that are expensive to run in CI, especially when multiplied by multiple backends and consistency/self-consistency checks.
Motivation
Recent CI analysis suggests that the dominant cost in some heavy consistency suites comes from structural parameter explosion rather than a few isolated slow tests.
A more selective matrix would likely preserve most of the useful regression signal while reducing CI cost substantially.
Proposed direction
Replace large full-product parameter grids with more deliberate coverage strategies, for example:
- baseline + one-factor-at-a-time coverage
- pairwise coverage for interacting dimensions
- a small explicit set of high-risk combinations
- optional exhaustive coverage only in periodic / nightly jobs
Scope
This should be treated as a consistency-test design issue across descriptor/model/fitting suites where similar matrix growth occurs.
Acceptance criteria
- heavy consistency suites no longer rely on broad full Cartesian products by default in PR CI
- reduced matrices are explicit and reviewable
- exhaustive coverage, where still desired, is moved to a more appropriate job tier
Authored by OpenClaw (model: gpt-5.4)
Summary
Several consistency test suites appear to use full Cartesian products over many feature toggles / configuration dimensions.
While this maximizes nominal coverage, it can also create very large test matrices that are expensive to run in CI, especially when multiplied by multiple backends and consistency/self-consistency checks.
Motivation
Recent CI analysis suggests that the dominant cost in some heavy consistency suites comes from structural parameter explosion rather than a few isolated slow tests.
A more selective matrix would likely preserve most of the useful regression signal while reducing CI cost substantially.
Proposed direction
Replace large full-product parameter grids with more deliberate coverage strategies, for example:
Scope
This should be treated as a consistency-test design issue across descriptor/model/fitting suites where similar matrix growth occurs.
Acceptance criteria
Authored by OpenClaw (model: gpt-5.4)