Problem
20–25% of scheduled runs targeting claude-opus-4.6 fail within ~4 seconds with 400 The requested model is not supported, while the identical workflow succeeds minutes later. The model catalogue returned by the Copilot API varies between requests (30 vs 39 models), causing intermittent entitlement mismatches.
Context
Upstream issue: github/gh-aw#35075
The awf-reflect step observes fetched N model(s) counts that differ run-to-run. The harness receives a 400 and does not retry (model not supported — not retrying).
Root Cause
Upstream Copilot model-catalogue/entitlement API is non-deterministic — the set of models returned varies across requests for the same identity. This is not a gh-aw-firewall bug, but the firewall and harness surface it.
Proposed Solution
- Route to Copilot platform team to stabilise model-catalogue API responses.
- As a firewall-side mitigation: add a limited retry (2–3 attempts with backoff) in the harness for
400 model not supported responses before surfacing as a terminal failure, since the condition is transient.
Generated by Firewall Issue Dispatcher · sonnet46 1.6M · ◷
Problem
20–25% of scheduled runs targeting
claude-opus-4.6fail within ~4 seconds with400 The requested model is not supported, while the identical workflow succeeds minutes later. The model catalogue returned by the Copilot API varies between requests (30 vs 39 models), causing intermittent entitlement mismatches.Context
Upstream issue: github/gh-aw#35075
The
awf-reflectstep observesfetched N model(s)counts that differ run-to-run. The harness receives a 400 and does not retry (model not supported — not retrying).Root Cause
Upstream Copilot model-catalogue/entitlement API is non-deterministic — the set of models returned varies across requests for the same identity. This is not a gh-aw-firewall bug, but the firewall and harness surface it.
Proposed Solution
400 model not supportedresponses before surfacing as a terminal failure, since the condition is transient.