Skip to content

fix(test): sync doc-maintainer test with max-turns 15 + prompt rewrite#5587

Merged
lpcox merged 1 commit into
mainfrom
fix/doc-maintainer-test-max-turns-15
Jun 26, 2026
Merged

fix(test): sync doc-maintainer test with max-turns 15 + prompt rewrite#5587
lpcox merged 1 commit into
mainfrom
fix/doc-maintainer-test-max-turns-15

Conversation

@lpcox

@lpcox lpcox commented Jun 26, 2026

Copy link
Copy Markdown
Collaborator

Problem

npm test is currently red on main: scripts/ci/doc-maintainer-workflow.test.ts has 2 failing assertions.

PR #5564 (fix(doc-maintainer): prevent maxRuns 403 from wasted shell turns) raised the Documentation Maintainer max-turns from 8 → 15 and rewrote the shell-restriction prompt line in .github/workflows/doc-maintainer.md (and recompiled the lock), but did not update the test guarding those values.

Failing assertions

Expected substring: "max-turns: 8"          (source)  -> now max-turns: 15
Expected substring: "GH_AW_MAX_TURNS: 8"     (lock)    -> now GH_AW_MAX_TURNS: 15
Expected substring: "**Do not run any `git` commands**"  -> replaced by new wording

Fix

Update the three stale assertions to match the current source workflow and lock:

  • max-turns: 8max-turns: 15
  • GH_AW_MAX_TURNS: 8GH_AW_MAX_TURNS: 15
  • **Do not run any \git` commands**Do not use the `shell` tool (and the `bash` tool is disabled). Do not attempt to run `git`, `npm test`, `ls`, or any other shell command`

No production code changes — test-only sync.

Verification

Test Suites: 196 passed, 196 total
Tests:       3220 passed, 3220 total

npm test is fully green.

PR #5564 raised the Documentation Maintainer `max-turns` from 8 to 15
(to stop the agent exhausting its maxRuns budget on denied shell turns
and surfacing a misleading 403) and rewrote the shell-restriction prompt
line, but did not update scripts/ci/doc-maintainer-workflow.test.ts.

This left `npm test` red with two stale assertions:
- `max-turns: 8` / `GH_AW_MAX_TURNS: 8` -> now 15
- `**Do not run any \`git\` commands**` -> replaced by the new
  `**Do not use the \`shell\` tool** ...` wording

Update the assertions to match the current source workflow and lock.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Copilot AI review requested due to automatic review settings June 26, 2026 20:02

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Updates the doc-maintainer workflow guard test to match the current workflow configuration after #5564 increased the agent turn budget and rewrote the shell-restriction prompt text, restoring npm test on main.

Changes:

  • Sync source-workflow assertion from max-turns: 8 to max-turns: 15.
  • Sync lock-workflow assertion from GH_AW_MAX_TURNS: 8 to GH_AW_MAX_TURNS: 15.
  • Update the prompt-string assertion to match the new “do not use the shell tool” wording.
Show a summary per file
File Description
scripts/ci/doc-maintainer-workflow.test.ts Updates stale assertions so the test matches the current doc-maintainer workflow source + lock.

Review details

Tip

Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

  • Files reviewed: 1/1 changed files
  • Comments generated: 0
  • Review effort level: Low

@github-actions

Copy link
Copy Markdown
Contributor

✅ Coverage Check Passed

Overall Coverage

Metric Base PR Delta
Lines 98.24% 98.28% 📈 +0.04%
Statements 98.17% 98.21% 📈 +0.04%
Functions 99.53% 99.53% ➡️ +0.00%
Branches 94.00% 94.00% ➡️ +0.00%
📁 Per-file Coverage Changes (1 files)
File Lines (Before → After) Statements (Before → After)
src/workdir-setup.ts 92.7% → 94.5% (+1.82%) 92.7% → 94.5% (+1.82%)

Coverage comparison generated by scripts/ci/compare-coverage.ts

@github-actions

github-actions Bot commented Jun 26, 2026

Copy link
Copy Markdown
Contributor

🔑 Smoke Copilot PAT PAT auth validated. All systems operational. ✅

@github-actions

github-actions Bot commented Jun 26, 2026

Copy link
Copy Markdown
Contributor

Smoke Copilot BYOK AOAI (api-key) reports failed. AOAI BYOK (api-key) mode investigation needed...

@github-actions

github-actions Bot commented Jun 26, 2026

Copy link
Copy Markdown
Contributor

Chroot tests passed! Smoke Chroot - All security and functionality tests succeeded.

@github-actions

github-actions Bot commented Jun 26, 2026

Copy link
Copy Markdown
Contributor

Smoke Copilot BYOK AOAI (Entra) completed. Copilot AOAI BYOK (Entra) mode operational. 🔓

@github-actions

github-actions Bot commented Jun 26, 2026

Copy link
Copy Markdown
Contributor

Contribution Check completed successfully!

Contribution guidelines review complete for PR #5587: no important missing items found; no comment needed.

@github-actions

github-actions Bot commented Jun 26, 2026

Copy link
Copy Markdown
Contributor

Smoke Copilot BYOK completed. Copilot BYOK mode operational. 🔓

@github-actions

github-actions Bot commented Jun 26, 2026

Copy link
Copy Markdown
Contributor

📰 VERDICT: Smoke Copilot has concluded. All systems operational. This is a developing story. 🎤

@github-actions

github-actions Bot commented Jun 26, 2026

Copy link
Copy Markdown
Contributor

Smoke Gemini completed. All facets verified. 💎

@github-actions

github-actions Bot commented Jun 26, 2026

Copy link
Copy Markdown
Contributor

🔌 Smoke Services — All services reachable! ✅

@github-actions

github-actions Bot commented Jun 26, 2026

Copy link
Copy Markdown
Contributor

Smoke Claude passed

@github-actions

github-actions Bot commented Jun 26, 2026

Copy link
Copy Markdown
Contributor

📡 Smoke OTel Tracing completed. All tracing scenarios validated. ✅

@github-actions

github-actions Bot commented Jun 26, 2026

Copy link
Copy Markdown
Contributor

Build Test Suite completed successfully!

@github-actions

github-actions Bot commented Jun 26, 2026

Copy link
Copy Markdown
Contributor

✨ The prophecy is fulfilled... Smoke Codex has completed its mystical journey. The stars align. 🌟

@github-actions

Copy link
Copy Markdown
Contributor

🚀 Security Guard has started processing this pull request

@github-actions github-actions Bot mentioned this pull request Jun 26, 2026
@github-actions

Copy link
Copy Markdown
Contributor

Smoke Test: Claude Engine Validation

  • API check: ✅ PASS
  • gh CLI check: ✅ PASS
  • File status check: ✅ PASS

Overall result: PASS

Generated by Smoke Claude for issue #5587 · 61.5 AIC · ⊞ 3.3K ·

@github-actions

Copy link
Copy Markdown
Contributor

🔬 Smoke Test: Copilot PAT Auth — FAIL

Test Result
GitHub MCP connectivity ✅ Listed PRs from github/gh-aw-firewall
GitHub.com HTTP ❌ Pre-step output not resolved (template variable unsubstituted)
File write/read ❌ Pre-step output not resolved (template variable unsubstituted)

Overall: FAIL — Pre-step outputs (SMOKE_HTTP_CODE, SMOKE_FILE_PATH, SMOKE_FILE_CONTENT) were not substituted; workflow step likely failed silently.

Auth mode: PAT (COPILOT_GITHUB_TOKEN) | PR author: @lpcox

🔑 PAT report filed by Smoke Copilot PAT

@github-actions

Copy link
Copy Markdown
Contributor

Smoke Test: BYOK Direct Mode ✅

Test Results:

Status: PASS — direct BYOK mode (COPILOT_PROVIDER_API_KEY) operational

cc @lpcox

🔑 BYOK report filed by Smoke Copilot BYOK

@github-actions

Copy link
Copy Markdown
Contributor

Smoke Test: Gemini Engine Validation

  • GitHub MCP Testing: ❌ (Could not fetch PR titles)
  • GitHub.com Connectivity: ❌ (HTTP 000 / SSL Error)
  • File Writing Testing: ✅
  • Bash Tool Testing: ✅

Overall status: FAIL

Warning

Firewall blocked 1 domain

The following domain was blocked by the firewall during workflow execution:

  • localhost

To allow these domains, add them to the network.allowed list in your workflow frontmatter:

network:
  allowed:
    - defaults
    - "localhost"

See Network Configuration for more information.

💎 Faceted by Smoke Gemini

@github-actions

Copy link
Copy Markdown
Contributor

@lpcox

✅ GitHub MCP Testing
✅ GitHub.com Connectivity
✅ File Write/Read Test
✅ BYOK Inference Test

Running in direct BYOK mode (AWF_AUTH_TYPE=github-oidc + AWF_AUTH_AZURE_* + COPILOT_PROVIDER_BASE_URL) via api-proxy → Azure OpenAI (Foundry, o4-mini-aw) authenticated via Microsoft Entra

Overall: PASS

🪪 BYOK (AOAI Entra) report filed by Smoke Copilot BYOK AOAI (Entra)

@github-actions

Copy link
Copy Markdown
Contributor

🔬 Smoke Test: API Proxy OpenTelemetry Tracing

Scenario Result Detail
1. Module Loading ✅ Pass otel.js loads, isEnabled: true, exports 14 functions (startRequestSpan, setTokenAttributes, setBudgetAttributes, endSpan, endSpanError, shutdown, ...)
2. Test Suite ✅ Pass 39 tests passed, 0 failed (otel.test.js)
3. Env Var Forwarding ⚠️ Expected api-proxy-service.ts doesn't directly forward OTEL vars — forwarding lives in api-proxy-env-config.ts instead; workflow step reports "expected during development"
4. Token Tracker Integration ✅ Pass onUsage callback present in token-tracker-http.js (OTEL hook point confirmed)
5. OTEL Diagnostics ✅ Pass /tmp/gh-aw/otel.jsonl found — 1 span exported during this run

Overall: ✅ All scenarios pass (Scenario 3 ⚠️ is expected per workflow definition).

📡 OTel tracing validated by Smoke OTel Tracing

@github-actions

Copy link
Copy Markdown
Contributor

Smoke test

  • perf(contribution-check): cut token/tool overhead per #5558
  • Split squid config tests by concern
  • GitHub PR query: ✅
  • Discussion comment: ✅
  • Playwright: ✅
  • File write/read: ✅
  • Build: ✅
  • Overall status: PASS

Warning

Firewall blocked 1 domain

The following domain was blocked by the firewall during workflow execution:

  • registry.npmjs.org

To allow these domains, add them to the network.allowed list in your workflow frontmatter:

network:
  allowed:
    - defaults
    - "registry.npmjs.org"

See Network Configuration for more information.

🔮 The oracle has spoken through Smoke Codex

@github-actions

Copy link
Copy Markdown
Contributor

Chroot Version Comparison Results

Runtime Host Version Chroot Version Match?
Python Python 3.12.13 Python 3.12.3 ❌ NO
Node.js v24.17.0 v22.23.0 ❌ NO
Go go1.22.12 go1.22.12 ✅ YES

Overall: ❌ Not all tests passed — Python and Node.js versions differ between host and chroot environments.

Tested by Smoke Chroot

@github-actions

Copy link
Copy Markdown
Contributor

🔬 Smoke Test Results

Test Result
GitHub MCP connectivity ✅ PASS
GitHub.com HTTP connectivity ❌ FAIL (pre-step data not resolved)
File write/read ❌ FAIL (pre-step data not resolved)

PR: fix(test): sync doc-maintainer test with max-turns 15 + prompt rewrite
Author: @lpcox

Overall: FAIL — pre-step template variables were not substituted; tests 2 & 3 could not be verified.

📰 BREAKING: Report filed by Smoke Copilot

@github-actions

Copy link
Copy Markdown
Contributor

Smoke Test Results

Check Result
Redis PING ❌ timeout (no PONG)
PostgreSQL pg_isready ❌ no response
PostgreSQL SELECT 1 ❌ skipped (host unreachable)

Overall: FAILhost.docker.internal is not reachable from this environment. Service containers are not accessible.

🔌 Service connectivity validated by Smoke Services

@github-actions

Copy link
Copy Markdown
Contributor

🏗️ Build Test Suite Results

Ecosystem Project Build/Install Tests Status
Bun elysia 1/1 passed ✅ PASS
Bun hono 1/1 passed ✅ PASS
C++ fmt N/A ✅ PASS
C++ json N/A ✅ PASS
Deno oak N/A 1/1 passed ✅ PASS
Deno std N/A 1/1 passed ✅ PASS
.NET hello-world N/A ✅ PASS
.NET json-parse N/A ✅ PASS
Go color 1/1 passed ✅ PASS
Go env 1/1 passed ✅ PASS
Go uuid 1/1 passed ✅ PASS
Java gson 1/1 passed ✅ PASS
Java caffeine 1/1 passed ✅ PASS
Node.js clsx All passed ✅ PASS
Node.js execa All passed ✅ PASS
Node.js p-limit All passed ✅ PASS
Rust fd 1/1 passed ✅ PASS
Rust zoxide 1/1 passed ✅ PASS

Overall: 8/8 ecosystems passed — ✅ PASS

Generated by Build Test Suite for issue #5587 · 72.7 AIC · ⊞ 7.8K ·

@lpcox lpcox merged commit 948e007 into main Jun 26, 2026
87 of 89 checks passed
@lpcox lpcox deleted the fix/doc-maintainer-test-max-turns-15 branch June 26, 2026 21:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants