[codex] Centralize session status policy by ThomasK33 · Pull Request #67 · coder/agent-tty

ThomasK33 · 2026-04-29T09:59:20Z

Closes #58

What changed

Added a shared session status policy module that classifies every SessionStatus as active, terminal, commandable, live-host eligible, offline-replay eligible, collectable, and destroyed.
Moved lifecycle, inspect, garbage collection, wait, host, and command-state checks onto the shared predicates while preserving current CLI JSON and human output.
Captured the resolved status vocabulary in CONTEXT.md, including the destroying split: active and offline-replay eligible, but not terminal or collectable.

Impact

This is intended to be behavior-preserving. The public status values, protocol schemas, event log behavior, and artifact formats are unchanged.

Validation

Passed:

npx vitest run test/unit/protocol/sessionStatusPolicy.test.ts
npx vitest run test/unit/commands/type.test.ts
npx vitest run test/unit/commands/mark.test.ts
npx vitest run test/unit/commands/wait.test.ts
npx vitest run test/unit/host/hostMain.test.ts
npx vitest run --maxWorkers=1 test/integration/lifecycle.test.ts
npx vitest run test/unit/protocol/sessionStatusPolicy.test.ts test/unit/commands/inspect.test.ts test/unit/commands/gc.test.ts test/unit/commands/run.test.ts test/unit/commands/type.test.ts test/unit/commands/mark.test.ts test/unit/commands/wait.test.ts test/unit/host/hostMain.test.ts
npx vitest run --maxWorkers=1 test/integration/lifecycle.test.ts test/integration/gc.test.ts
npx vitest run --maxWorkers=1 test/integration/run.test.ts test/integration/wait-render.test.ts
npx vitest run test/unit/commands/golden-envelopes.test.ts
npm run typecheck
npm run format:check
git diff --check
npx vitest run --maxWorkers=1 test/integration/backend-selection.test.ts test/integration/run.test.ts

Note: mise run ci passed format-check, workflow-lint, lint, and typecheck, then the local full Vitest run hit two 5s backend-selection timeouts and one run integration status mismatch. The failing suites passed when rerun serially with --maxWorkers=1, so this looks like local full-suite contention rather than a status-policy regression. Remote PR checks should give the canonical signal.

Change-Id: Ieb854aba647511d3797d6c5319cc31d06eb20446 Signed-off-by: Thomas Kosiewski <tk@coder.com>

ThomasK33 · 2026-04-29T10:04:24Z

/coder-agents-review

coder-agents-review

The policy module is well-designed: satisfies Record<SessionStatus, SessionStatusPolicy> catches table completeness at compile time, module-load invariants enforce structural relationships, and the test suite verifies every predicate against every status including the destroying split (active + offline-replay, not terminal or collectable). Behavioral equivalence verified for all 11 migrated files.

2 P2, 5 P3, 2 P4, 5 Nit.

The two P2s are convergent: six reviewers independently flagged the rendering-path invariant gap, three flagged the active/terminal gap. The old code enforced both at compile time via exhaustive switches with never defaults. The policy table replaces those guarantees with runtime invariants, but these two are missing. The fix is two invariant() calls in the existing loop.

"The host-side messages don't even include the session ID or current status. 'Session is not running.' with no details. That is a doctor saying 'you're sick.'" (Leorio)

Process: the PR description mentions "one run integration status mismatch" during local testing and defers to CI without identifying the test, the assertion, or the status values. In a PR that changes status classification across 11 files, a status mismatch merits investigation, not deference.

The sessionState.ts status comparisons (6 direct comparisons for state-machine transition guards) were correctly excluded from migration. These define the state machine itself, not policy consumption. Consider documenting that boundary.

src/cli/commands/snapshot.ts:315

P3 [DEREM-5] snapshot.ts:315 and screenshot.ts:258 retain manifest.status === 'running' for the live-host-vs-offline-replay decision. The policy table classifies exiting as liveHostEligible: true, and inspect.ts follows via isLiveHostEligibleSessionStatus. These two files silently disagree.

The divergence predates this PR. Migrating would change exiting behavior, which violates the behavior-preserving claim. But the policy module is now the canonical authority, and these files are not documented as intentional exclusions. A developer adding a new live-host-eligible status would update the policy table, see inspect pick it up, and miss snapshot and screenshot. At minimum, add a comment explaining the intentional narrower check. (Kite P3, Pariston P3, Robin P3, Mafu-san P2)

🤖

test/unit/commands/mark.test.ts:52

P3 [DEREM-7] The PR rewrote the destroyed-session guard in mark.ts, type.ts, and wait.ts from === 'destroyed' to isDestroyedSessionStatus. None of these commands' test files exercise the destroyed status (createSessionRecord restricts status to 'running' | 'exiting' | 'exited'). run.test.ts already has this test (line 220). The gap was pre-existing, but the indirection through the policy table means a wiring mistake (e.g., swapping isDestroyedSessionStatus with isTerminalSessionStatus) would go undetected at the command level. (Bisky)

🤖

src/cli/commands/inspect.ts:40

Nit [DEREM-9] computeUptime retains session.status === 'running' in a file that otherwise imports from the policy module. Whether exiting and destroying sessions should count elapsed time as live uptime is a legitimate semantic question, but having an unmigrated inline check alongside migrated predicates is a maintenance trap. (Chopper, Robin, Pariston, Meruem)

🤖

🤖 This review was automatically generated with Coder Agents.

ThomasK33 · 2026-04-29T10:47:32Z

Addressed the remaining review-body items in d0b9297:

DEREM-5: documented the intentional narrower live-RPC gate in snapshot and screenshot so it stays behavior-preserving for exiting sessions.
DEREM-7: added destroyed-session command tests for mark, type, and wait.
DEREM-9: computeUptime now uses the status policy via isCommandableSessionStatus instead of an inline running check.
Process note: the earlier local full-suite status mismatch was investigated by rerunning the failing backend-selection/run suites serially; they passed, and the prior remote CI was green.

ThomasK33 · 2026-04-29T10:47:41Z

/coder-agents-review

coder-agents-review

All 14 Round 1 findings addressed in d0b9297. The fixes are thorough: the author added both missing invariants (DEREM-2, DEREM-3), extracted the shared guard (DEREM-8), restructured the test table from verbatim copy to classification-set mapping (DEREM-16), and documented every intentional exclusion (DEREM-5 snapshot/screenshot, DEREM-6 legacy error text, DEREM-10/11 destroyed convenience flag). Netero verified all fixes; the panel confirmed.

1 P3, 1 P4, 3 Nit new in this round.

The P3 is a false comment introduced in the fix: assertSessionCommandable claims its details: { status } reaches callers, but buildCliErrorResponse only serializes code and message, and RpcErrorSchema is .strict(). The details are dead data and the comment is factually incorrect about the wire format.

"The truth table held up under inspection. Solid opponent." (Hisoka)

Process note: commit d0b9297 subject "refactor: address status policy review" conveys mechanism, not intent. If this survives as a non-squashed entry, a reader six months from now learns nothing about what changed. Something like "refactor: add policy invariants and destroyed-session guard tests" names the changes.

test/unit/commands/wait.test.ts:350

P4 [DEREM-19] No test for wait --exit on an already-terminal session. The early-return path at wait.ts:350 (isTerminalSessionStatus(manifestStatus)) emits success with the exit code and returns without contacting the host. Every waitForExit: true test uses a running session. A wrong predicate would silently fall through to RPC against a dead host instead of returning immediately. Risk is low (predicate well-tested in sessionStatusPolicy.test.ts), but the path itself is uncovered.

Sketch: feed createSessionRecord('exited', 42) + waitForExit: true, assert emitSuccess called with { timedOut: false, exitCode: 42 } and sendRpc never called. (Bisky)

🤖

🤖 This review was automatically generated with Coder Agents.

ThomasK33 · 2026-04-29T11:15:55Z

Addressed the remaining review-body items in 9a5e318:

DEREM-19: added a unit test for wait --exit on an already-terminal exited session, asserting immediate success and no RPC call.
Process note: amended the review-fix commit subject to refactor: add policy invariants and command guard tests.

Local validation for this round:

npx vitest run test/unit/commands/wait.test.ts test/unit/commands/type.test.ts test/unit/commands/mark.test.ts test/unit/host/hostMain.test.ts test/unit/commands/inspect.test.ts
npm run format:check
npm run typecheck
npm run lint

ThomasK33 · 2026-04-29T11:16:04Z

/coder-agents-review

coder-agents-review

All 19 findings from Rounds 1-2 addressed. The fixes are thorough and demonstrate strong engineering judgment: the author generalized from corrections (DEREM-8 guard extraction), restructured rather than parroted (DEREM-16 test rewrite), and documented intentional exclusions rather than silently skipping them (DEREM-5 snapshot/screenshot comments).

1 P4 new. 9 of 11 reviewers report no findings. Netero clean.

The P4 is a narrow test gap in the DEREM-19 fix: the wait terminal early-return test covers exitCode: 42 but not the exitCode === null branch, where the conditional spread at wait.ts:353 omits the field entirely. A wrong spread would change the public JSON contract from {timedOut: false} to {timedOut: false, exitCode: null}. Low risk (the conditional spread is straightforward), but worth adding a null case.

"The truth table held up under inspection. Solid opponent." (Hisoka)

🤖 This review was automatically generated with Coder Agents.

Change-Id: I317c32b10c7bd62ac82784bfb9d3d6c791d373b9 Signed-off-by: Thomas Kosiewski <tk@coder.com>

ThomasK33 · 2026-04-29T11:40:06Z

Addressed DEREM-22 in 999d42c:

Added a wait --exit unit case for an already-terminal session with exitCode: null.
The test asserts the result is exactly { timedOut: false } and that no host RPC is attempted.

Local validation for this round:

npx vitest run test/unit/commands/wait.test.ts
npm run format:check
npm run typecheck
npm run lint

ThomasK33 · 2026-04-29T11:40:32Z

/coder-agents-review

coder-agents-review

All 20 posted findings across 3 rounds addressed. DEREM-22 (null exitCode test gap) verified fixed in 999d42c. 11 reviewers, 0 new findings. Netero clean.

This PR is ready to merge. The session status policy module centralizes scattered inline checks into a single truth table with compile-time exhaustiveness (satisfies Record<SessionStatus, SessionStatusPolicy>), import-time structural invariants (active XOR terminal, collectable implies terminal, commandable implies active, exactly one renderer source), and named predicates that make call sites self-documenting. Behavioral equivalence verified across all substitution sites by multiple reviewers across 4 rounds.

🤖 This review was automatically generated with Coder Agents.

refactor: centralize session status policy

6e9ec0a

Change-Id: Ieb854aba647511d3797d6c5319cc31d06eb20446 Signed-off-by: Thomas Kosiewski <tk@coder.com>

ThomasK33 marked this pull request as ready for review April 29, 2026 10:04

coder-agents-review Bot suggested changes Apr 29, 2026

View reviewed changes

coder-agents-review Bot reviewed Apr 29, 2026

View reviewed changes

Comment thread src/host/hostMain.ts Outdated

Comment thread src/cli/commands/inspect.ts

Comment thread src/host/hostMain.ts

Comment thread src/cli/sessionGuards.ts Outdated

ThomasK33 force-pushed the codex/session-status-policy branch from d0b9297 to 9a5e318 Compare April 29, 2026 11:14

coder-agents-review Bot approved these changes Apr 29, 2026

View reviewed changes

Comment thread test/unit/commands/wait.test.ts

refactor: add policy invariants and command guard tests

999d42c

Change-Id: I317c32b10c7bd62ac82784bfb9d3d6c791d373b9 Signed-off-by: Thomas Kosiewski <tk@coder.com>

ThomasK33 force-pushed the codex/session-status-policy branch from 9a5e318 to 999d42c Compare April 29, 2026 11:38

coder-agents-review Bot approved these changes Apr 29, 2026

View reviewed changes

ThomasK33 merged commit a9a6836 into main Apr 29, 2026
11 checks passed

ThomasK33 deleted the codex/session-status-policy branch April 29, 2026 12:01

Conversation

ThomasK33 commented Apr 29, 2026

What changed

Impact

Validation

Uh oh!

ThomasK33 commented Apr 29, 2026

Uh oh!

coder-agents-review Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ThomasK33 commented Apr 29, 2026

Uh oh!

ThomasK33 commented Apr 29, 2026

Uh oh!

coder-agents-review Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ThomasK33 commented Apr 29, 2026

Uh oh!

ThomasK33 commented Apr 29, 2026

Uh oh!

coder-agents-review Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ThomasK33 commented Apr 29, 2026

Uh oh!

ThomasK33 commented Apr 29, 2026

Uh oh!

coder-agents-review Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant