diff --git a/.cursor/mcp.json b/.cursor/mcp.json
deleted file mode 100644
index 19951a368..000000000
--- a/.cursor/mcp.json
+++ /dev/null
@@ -1,14 +0,0 @@
-{
-  "mcpServers": {
-    "relaycast": {
-      "command": "npx",
-      "args": [
-        "-y",
-        "@relaycast/mcp"
-      ],
-      "env": {
-        "RELAY_BASE_URL": "https://api.relaycast.dev"
-      }
-    }
-  }
-}
diff --git a/.cursor/settings.json b/.cursor/settings.json
deleted file mode 100644
index 1cc52554d..000000000
--- a/.cursor/settings.json
+++ /dev/null
@@ -1,7 +0,0 @@
-{
-  "permissions": {
-    "allow": [
-      "mcp__agent-relay__*"
-    ]
-  }
-}
diff --git a/.factory/settings.json b/.factory/settings.json
deleted file mode 100644
index 565f14af7..000000000
--- a/.factory/settings.json
+++ /dev/null
@@ -1,5 +0,0 @@
-{
-  "enabledPlugins": {
-    "core@factory-plugins": true
-  }
-}
\ No newline at end of file
diff --git a/ARCHITECTURE.md b/ARCHITECTURE.md
deleted file mode 100644
index f416a9f92..000000000
--- a/ARCHITECTURE.md
+++ /dev/null
@@ -1,738 +0,0 @@
-# Agent Relay: Architecture & Design Document
-
-## Executive Summary
-
-Agent Relay is a real-time messaging system that enables autonomous agent-to-agent communication. It allows AI coding assistants (Claude, Codex, Gemini, etc.) running in separate terminal sessions to discover each other and exchange messages without human intervention.
-
-The system works by:
-
-1. Wrapping agent CLI processes in PTY sessions managed by a Rust broker
-2. Providing MCP tools for agent communication (mcp__relaycast__message_dm_send, mcp__relaycast__agent_add, etc.)
-3. Routing messages through Relaycast (cloud WebSocket service)
-4. Injecting incoming messages directly into agent terminal input
-
-This document provides complete transparency into how the system works, its design decisions, limitations, and trade-offs.
-
----
-
-## Table of Contents
-
-1. [System Overview](#1-system-overview)
-2. [Architecture Layers](#2-architecture-layers)
-3. [Component Deep Dive](#3-component-deep-dive)
-4. [Protocol Specification](#4-protocol-specification)
-5. [Message Flow](#5-message-flow)
-6. [Data Storage](#7-data-storage)
-7. [Security Model](#8-security-model)
-8. [Design Decisions & Trade-offs](#9-design-decisions--trade-offs)
-9. [Known Limitations](#10-known-limitations)
-10. [Future Considerations](#11-future-considerations)
-
----
-
-## 1. System Overview
-
-### 1.1 Problem Statement
-
-Modern AI coding assistants operate in isolation. When you run multiple agents on different parts of a codebase, they cannot:
-
-- Share discoveries or context
-- Coordinate on interdependent tasks
-- Request help from specialized agents
-- Avoid duplicate work
-
-Agent Relay solves this by providing a communication layer that requires **zero modification** to the underlying AI systems.
-
-### 1.2 Core Principle: MCP Tool Protocol
-
-The fundamental insight is that AI agents can invoke MCP (Model Context Protocol) tools. By providing relay tools (`mcp__relaycast__message_dm_send`, `mcp__relaycast__agent_add`, `mcp__relaycast__agent_list`, etc.) via MCP, agents can communicate without modifying the underlying AI system.
-
-This approach:
-
-- Works with any CLI-based agent that supports MCP
-- Requires no agent-side code changes
-- Preserves the user's normal terminal experience
-- Allows agents to communicate using natural language
-
-### 1.3 High-Level Architecture
-
-```
-┌─────────────────────────────────────────────────────────────────────────┐
-│                         User's Terminal                                  │
-│  ┌─────────────────┐  ┌─────────────────┐  ┌─────────────────┐         │
-│  │  agent-relay    │  │  agent-relay    │  │  agent-relay    │         │
-│  │  spawn Alice    │  │  spawn Bob      │  │  spawn Carol    │         │
-│  │  claude         │  │  codex          │  │  gemini         │         │
-│  └────────┬────────┘  └────────┬────────┘  └────────┬────────┘         │
-│           │                    │                    │                   │
-│           │ PTY Sessions       │ PTY Sessions       │ PTY Sessions     │
-│           │                    │                    │                   │
-│           └────────────────────┼────────────────────┘                   │
-│                                │                                        │
-│                    ┌───────────▼───────────┐                           │
-│                    │   Broker (Rust)       │                           │
-│                    │   agent-relay-broker  │                           │
-│                    └───────────┬───────────┘                           │
-│                                │                                        │
-│                    ┌───────────▼───────────┐                           │
-│                    │   Relaycast Cloud     │                           │
-│                    │   (WebSocket)         │                           │
-│                    └───────────────────────┘                           │
-└─────────────────────────────────────────────────────────────────────────┘
-```
-
----
-
-## 2. Architecture Layers
-
-The system is organized into five distinct layers:
-
-### Layer 1: CLI Interface (`src/cli/`)
-
-Entry point for users. Parses commands, manages broker lifecycle, handles agent spawning and messaging.
-
-### Layer 2: Broker (`src/main.rs` + `src/lib.rs`)
-
-Rust binary that manages PTY sessions, parses agent output, routes messages via Relaycast WebSocket, and handles agent lifecycle.
-
-### Layer 3: SDK (`packages/sdk/`)
-
-TypeScript SDK for programmatic access. Drives the broker binary over stdio, provides spawn/release/event APIs.
-
-### Layer 4: Storage (`packages/storage/`)
-
-Message persistence using JSONL format. Supports queries by sender/recipient/time.
-
-### Layer 5: Dashboard (`packages/dashboard/`)
-
-Web UI for monitoring. Shows connected agents, message flow, real-time updates.
-
-```
-┌─────────────────────────────────────────────────────────────────┐
-│  Layer 1: CLI                                                   │
-│  ┌─────────────────────────────────────────────────────────────┐│
-│  │ Commands: up, down, status, spawn, bridge, doctor           ││
-│  └─────────────────────────────────────────────────────────────┘│
-├─────────────────────────────────────────────────────────────────┤
-│  Layer 2: Broker (Rust)                                         │
-│  ┌───────────────┐ ┌───────────────┐ ┌───────────────┐        │
-│  │ PTY Manager   │ │ MCP Tools     │ │ Relaycast WS  │        │
-│  │ (Agent mgmt)  │ │ (send_dm)     │ │ (Routing)     │        │
-│  └───────────────┘ └───────────────┘ └───────────────┘        │
-├─────────────────────────────────────────────────────────────────┤
-│  Layer 3: SDK                                                   │
-│  ┌───────────────┐ ┌───────────────┐ ┌───────────────┐        │
-│  │ Client        │ │ Workflows     │ │ Relay Adapter │        │
-│  │ (Stdio I/O)   │ │ (DAG runner)  │ │ (High-level)  │        │
-│  └───────────────┘ └───────────────┘ └───────────────┘        │
-├─────────────────────────────────────────────────────────────────┤
-│  Layer 4: Storage                                               │
-│  ┌───────────────┐ ┌───────────────┐                          │
-│  │ Adapter       │ │ JSONL         │                          │
-│  │ (Interface)   │ │ (Persistence) │                          │
-│  └───────────────┘ └───────────────┘                          │
-├─────────────────────────────────────────────────────────────────┤
-│  Layer 5: Dashboard                                             │
-│  ┌───────────────┐ ┌───────────────┐                          │
-│  │ Next.js       │ │ WebSocket     │                          │
-│  │ (REST API)    │ │ (Real-time)   │                          │
-│  └───────────────┘ └───────────────┘                          │
-└─────────────────────────────────────────────────────────────────┘
-```
-
----
-
-## 3. Component Deep Dive
-
-### 3.1 Broker (`src/main.rs`)
-
-The broker is a Rust binary (`agent-relay-broker`) that serves as the core runtime. It has several subcommands:
-
-- **`init`** — Starts as a broker hub, connecting to Relaycast and managing spawned agents via stdio protocol. Supports `--api-port <port>` to start an HTTP API for dashboard proxy (spawn/release/list endpoints).
-- **`pty`** — Wraps a single CLI in a PTY session with message injection
-- **`headless`** — Runs a provider (Claude, etc.) in headless/API mode
-- **`wrap`** — Internal command used by the SDK to wrap a CLI in a PTY with passthrough
-
-#### PTY Session Management
-
-The broker uses native PTY sessions (via `portable-pty`) instead of tmux:
-
-```
-┌─────────────────────────────────────────────────────────────────┐
-│                      Broker Process                               │
-│                                                                  │
-│  ┌──────────────────────────────────────────────────────────┐  │
-│  │                   PTY Session                             │  │
-│  │  ┌────────────────────────────────────────────────────┐  │  │
-│  │  │              Agent Process (claude, etc.)          │  │  │
-│  │  │                                                    │  │  │
-│  │  │  Output: "I'll send a message to Bob"             │  │  │
-│  │  │  MCP call: mcp__relaycast__message_dm_send(to: "Bob", text: "...")│  │  │
-│  │  │                                                    │  │  │
-│  │  └────────────────────────────────────────────────────┘  │  │
-│  └──────────────────────────────────────────────────────────┘  │
-│                              │                                  │
-│                              │ PTY output streaming              │
-│                              ▼                                  │
-│  ┌──────────────────────────────────────────────────────────┐  │
-│  │  MCP Tool Handler                                         │  │
-│  │  - Process MCP tool invocations from agents               │  │
-│  │  - Parse send_dm, agent_add, etc.                         │  │
-│  │  - Deduplicate (hash-based)                               │  │
-│  └──────────────────────────────────────────────────────────┘  │
-│                              │                                  │
-│                              ▼                                  │
-│  ┌──────────────────────────────────────────────────────────┐  │
-│  │  Relaycast WebSocket                                      │  │
-│  │  - Send message to Relaycast cloud                        │  │
-│  │  - Receive messages from other agents                     │  │
-│  │  - Handle workspace authentication                        │  │
-│  └──────────────────────────────────────────────────────────┘  │
-│                              │                                  │
-│                              ▼                                  │
-│  ┌──────────────────────────────────────────────────────────┐  │
-│  │  Message Injection                                        │  │
-│  │  - Wait for agent idle (configurable threshold)           │  │
-│  │  - Write to PTY stdin: "Relay message from X [id]: ..."   │  │
-│  │  - Press Enter                                            │  │
-│  └──────────────────────────────────────────────────────────┘  │
-│                                                                  │
-└─────────────────────────────────────────────────────────────────┘
-```
-
-#### Key Implementation Details
-
-**1. PTY-Based Agent Wrapping**
-The broker uses `portable-pty` for cross-platform PTY management, replacing the previous tmux-based approach. This eliminates the tmux dependency and provides more direct control over agent I/O.
-
-**2. ANSI Stripping**
-Output is stripped of ANSI escape codes before pattern matching to handle terminal formatting.
-
-**3. MCP Tool Protocol**
-Agents communicate by invoking MCP tools (e.g., `mcp__relaycast__message_dm_send`, `mcp__relaycast__agent_add`, `mcp__relaycast__agent_list`). The broker processes these tool calls and routes messages accordingly.
-
-**4. Message Deduplication**
-Uses a hash-based dedup cache to prevent re-sending the same message:
-
-```rust
-let dedup = DedupCache::new();
-// Messages are hashed and checked before routing
-```
-
-**5. Idle Detection for Injection**
-Configurable idle threshold (default 30s) before injecting messages. The broker monitors agent output and waits for silence before delivering incoming messages.
-
-**6. CLI-Specific Handling**
-Different CLIs need different injection strategies. The broker handles CLI-specific quirks for Claude, Codex, Gemini, Aider, and Goose.
-
-### 3.2 SDK (`packages/sdk/`)
-
-The TypeScript SDK provides programmatic access to the broker:
-
-```typescript
-import { AgentRelayClient } from '@agent-relay/sdk';
-
-// Start broker and connect
-const client = await AgentRelayClient.start({ env: process.env });
-
-// Spawn agents in PTY sessions
-await client.spawnPty({ name: 'Worker', cli: 'claude', channels: ['general'] });
-
-// Listen for events
-client.on('event', (event) => console.log(event));
-
-// Clean up
-await client.release('Worker');
-await client.shutdown();
-```
-
-The SDK communicates with the broker via stdio using a JSON-based request/response protocol.
-
-#### High-Level API (`AgentRelay`)
-
-```typescript
-import { AgentRelay } from '@agent-relay/sdk';
-
-const relay = new AgentRelay();
-
-// Idle detection
-relay.onAgentIdle = ({ name, idleSecs }) => {
-  console.log(`${name} idle for ${idleSecs}s`);
-};
-
-const agent = await relay.spawnPty({
-  name: 'Worker',
-  cli: 'claude',
-  channels: ['general'],
-  idleThresholdSecs: 30,
-});
-
-await agent.waitForIdle(120_000);
-await relay.shutdown();
-```
-
-### 3.3 Relaycast Cloud
-
-Messages are routed through Relaycast, a cloud WebSocket service:
-
-- Workspace-based isolation (each project gets a workspace)
-- Agent registration and presence
-- Channel-based messaging
-- Direct messages and threading
-- Persistent message history
-
-### 3.4 Workflow Engine (`packages/sdk/src/workflows/`)
-
-The SDK includes a DAG-based workflow runner for multi-step agent coordination:
-
-- Define workflows as YAML templates or programmatically via `WorkflowBuilder`
-- Steps can have dependencies, creating a directed acyclic graph
-- Built-in templates for common patterns: code review, bug fix, feature development
-- Step output chaining via `{{steps.X.output}}` template syntax
-
----
-
-## 4. Protocol Specification
-
-### 4.1 MCP Tool Protocol
-
-Agents communicate by invoking MCP tools provided by the Relaycast MCP server:
-
-| Tool                                          | Description                  |
-| --------------------------------------------- | ---------------------------- |
-| `mcp__relaycast__message_dm_send(to, text)`   | Send a DM to an agent        |
-| `mcp__relaycast__post_message(channel, text)` | Post a message to a channel  |
-| `mcp__relaycast__agent_add(name, cli, task)`  | Spawn a worker agent         |
-| `mcp__relaycast__agent_remove(name)`          | Release a worker agent       |
-| `mcp__relaycast__agent_list()`                | List connected agents        |
-| `mcp__relaycast__message_inbox_check()`       | Check incoming messages      |
-
-### 4.2 Broker Stdio Protocol
-
-The SDK communicates with the broker binary via JSON-line stdio:
-
-**Requests** (SDK → Broker):
-
-```json
-{ "id": "uuid", "method": "spawn_pty", "params": { "name": "Worker", "cli": "claude" } }
-```
-
-**Responses** (Broker → SDK):
-
-```json
-{ "id": "uuid", "result": { "ok": true } }
-```
-
-**Events** (Broker → SDK):
-
-```json
-{ "event": "agent_idle", "data": { "name": "Worker", "idle_secs": 30 } }
-```
-
-### 4.3 Spawn/Release Protocol
-
-```
-# Spawn
-KIND: spawn
-NAME: WorkerName
-CLI: claude
-
-Task description here.
-
-# Release
-KIND: release
-NAME: WorkerName
-```
-
-### 4.4 Message Delivery
-
-```
-Alice (Agent)          Broker              Relaycast           Bob (Agent)
-  │                      │                    │                    │
-  │── send_dm() ─────────▶│                    │                    │
-  │                      │── WebSocket msg ──▶│                    │
-  │                      │                    │── WebSocket msg ──▶│ (Bob's broker)
-  │                      │                    │                    │
-  │                      │                    │      inject into PTY
-  │                      │                    │     "Relay message  │
-  │                      │                    │      from Alice..." │
-```
-
----
-
-## 5. Message Flow
-
-### 5.1 Complete End-to-End Flow
-
-```
-┌─────────────────────────────────────────────────────────────────────────┐
-│ 1. AGENT INVOKES MCP TOOL                                               │
-│    Agent calls: mcp__relaycast__message_dm_send(to: "Bob", text: "Can you review auth.ts?")
-└─────────────────────────────────────────────────────────────────────────┘
-                                    │
-                                    ▼
-┌─────────────────────────────────────────────────────────────────────────┐
-│ 2. BROKER PROCESSES TOOL CALL                                           │
-│    Broker receives MCP tool invocation                                  │
-│    Deduplication check (hash-based)                                     │
-└─────────────────────────────────────────────────────────────────────────┘
-                                    │
-                                    ▼
-┌─────────────────────────────────────────────────────────────────────────┐
-│ 4. RELAYCAST ROUTING                                                    │
-│    Broker sends message via WebSocket to Relaycast cloud                │
-│    Relaycast routes to Bob's workspace/channel                          │
-└─────────────────────────────────────────────────────────────────────────┘
-                                    │
-                                    ▼
-┌─────────────────────────────────────────────────────────────────────────┐
-│ 5. BOB'S BROKER RECEIVES                                                │
-│    WebSocket delivers message to Bob's broker                           │
-│    Message queued for injection                                         │
-└─────────────────────────────────────────────────────────────────────────┘
-                                    │
-                                    ▼
-┌─────────────────────────────────────────────────────────────────────────┐
-│ 6. IDLE DETECTION + INJECTION                                           │
-│    Wait for idle threshold (no output from Bob's agent)                 │
-│    Write to PTY stdin: "Relay message from Alice [abc12345]:            │
-│                         Can you review auth.ts?"                        │
-│    Press Enter                                                          │
-└─────────────────────────────────────────────────────────────────────────┘
-                                    │
-                                    ▼
-┌─────────────────────────────────────────────────────────────────────────┐
-│ 7. BOB'S AGENT PROCESSES                                                │
-│    The message appears as user input in Bob's PTY                       │
-│    Bob's agent processes it as a new message                            │
-└─────────────────────────────────────────────────────────────────────────┘
-```
-
-### 5.2 Broadcast Flow
-
-When sending to `TO: *`:
-
-```
-Alice                    Relaycast                  Bob, Carol, Dave
-  │                        │                        │
-  │──── message ──────────▶│                        │
-  │  { to: "*", ... }      │                        │
-  │                        │                        │
-  │                        │──── deliver ──────────▶│ Bob
-  │                        │──── deliver ──────────▶│ Carol
-  │                        │──── deliver ──────────▶│ Dave
-  │                        │                        │
-  │                        │ (Alice excluded)       │
-```
-
----
-
-## 6. Data Storage
-
-### 6.1 Storage Architecture
-
-```
-┌─────────────────────────────────────────────────────────────────┐
-│                     StorageAdapter Interface                     │
-├─────────────────────────────────────────────────────────────────┤
-│  init(): Promise<void>                                          │
-│  saveMessage(message: StoredMessage): Promise<void>             │
-│  getMessages(query: MessageQuery): Promise<StoredMessage[]>     │
-│  getMessageById(id: string): Promise<StoredMessage | null>      │
-│  close(): Promise<void>                                         │
-└─────────────────────────────────────────────────────────────────┘
-                              │
-              ┌───────────────┼───────────────┐
-              │               │               │
-              ▼               ▼               ▼
-       ┌───────────┐   ┌───────────┐   ┌───────────┐
-       │  JSONL    │   │  Memory   │   │   DLQ     │
-       │  Adapter  │   │  Adapter  │   │  Adapter  │
-       └───────────┘   └───────────┘   └───────────┘
-```
-
-### 6.2 File Locations
-
-```
-.agent-relay/
-├── credentials/             # Auth tokens
-├── state.json               # Broker state (agents, channels)
-└── pending/                 # Pending deliveries
-```
-
----
-
-## 7. Security Model
-
-### 7.1 Trust Boundaries
-
-```
-┌─────────────────────────────────────────────────────────────────┐
-│                    TRUST BOUNDARY: Local Machine                │
-│                                                                 │
-│  ┌──────────────────────────────────────────────────────────┐  │
-│  │                 User's Terminal Session                   │  │
-│  │                                                           │  │
-│  │  Agents run with user's permissions                       │  │
-│  │  Broker authenticates via Relaycast API keys              │  │
-│  │  WebSocket connection is TLS-encrypted                    │  │
-│  │                                                           │  │
-│  └──────────────────────────────────────────────────────────┘  │
-│                                                                 │
-│  ┌──────────────────────────────────────────────────────────┐  │
-│  │                 Relaycast Cloud                            │  │
-│  │                                                           │  │
-│  │  Workspace isolation via API keys                         │  │
-│  │  Agent registration and authentication                    │  │
-│  │  Message persistence and routing                          │  │
-│  │                                                           │  │
-│  └──────────────────────────────────────────────────────────┘  │
-└─────────────────────────────────────────────────────────────────┘
-```
-
-### 7.2 Current Security Properties
-
-| Property               | Status | Notes                           |
-| ---------------------- | ------ | ------------------------------- |
-| Workspace isolation    | ✅     | Separate API keys per workspace |
-| TLS encryption         | ✅     | WebSocket over TLS to Relaycast |
-| Agent authentication   | ✅     | API key + agent registration    |
-| Local file permissions | ✅     | Outbox/inbox owned by user      |
-| Rate limiting          | ⚠️     | Server-side via Relaycast       |
-| Message validation     | ⚠️     | Basic field presence checks     |
-
----
-
-## 8. Design Decisions & Trade-offs
-
-### 8.1 Why a Rust Broker Instead of Node.js Daemon?
-
-**Decision**: Replace the Node.js daemon with a Rust binary.
-
-**Rationale**:
-
-- Single binary distribution — no Node.js runtime required
-- Lower memory footprint and faster startup
-- Native PTY support via `portable-pty`
-- Better concurrency model for managing multiple agents
-
-**Trade-offs**:
-
-- ❌ Requires cross-compilation for multiple platforms
-- ❌ Harder to prototype new features quickly
-- ✅ Zero runtime dependencies for users
-- ✅ Sub-millisecond message handling
-- ✅ Single binary install via curl
-
-### 8.2 Why PTY Instead of Tmux?
-
-**Decision**: Use native PTY sessions instead of tmux.
-
-**Rationale**:
-
-- Eliminates tmux as a dependency
-- More direct control over agent I/O
-- Works on platforms without tmux
-- Better process lifecycle management
-
-**Trade-offs**:
-
-- ❌ Users cannot detach/reattach to agent sessions directly
-- ✅ No dependency installation required
-- ✅ Cross-platform (including Windows)
-- ✅ More reliable output capture
-
-### 8.3 Why MCP Tools Instead of Output Parsing?
-
-**Decision**: Use MCP tools (`mcp__relaycast__message_dm_send()`, `mcp__relaycast__agent_add()`, etc.) instead of inline output parsing (`->relay:Target message`).
-
-**Rationale**:
-
-- Native integration with AI agent tool-calling capabilities
-- Structured parameters with type safety
-- No line-wrapping or ANSI code issues
-- Works reliably across all MCP-compatible CLIs
-
-**Trade-offs**:
-
-- ❌ Requires MCP-compatible CLI
-- ✅ No parsing ambiguity
-- ✅ Supports multi-line messages naturally
-- ✅ Structured parameters and return values
-- ✅ Single-step invocation (no file write + trigger)
-
-### 8.4 Why Relaycast Cloud Instead of Local Sockets?
-
-**Decision**: Route messages through Relaycast cloud WebSocket service.
-
-**Rationale**:
-
-- Cross-machine agent communication
-- Persistent message history
-- Workspace management and agent presence
-- Dashboard integration
-
-**Trade-offs**:
-
-- ❌ Requires internet connection
-- ❌ Introduces cloud dependency
-- ✅ Cross-machine and cross-project messaging
-- ✅ Persistent history and search
-- ✅ Team collaboration features
-
----
-
-## 9. Known Limitations
-
-### 9.1 Message Delivery Reliability
-
-| Issue                                 | Impact | Mitigation                          |
-| ------------------------------------- | ------ | ----------------------------------- |
-| Messages can be lost if agent is busy | Medium | Idle detection, retry logic         |
-| WebSocket disconnection               | Medium | Automatic reconnection with backoff |
-| Dedup cache memory growth             | Low    | Cache size limits                   |
-
-### 9.2 Platform Support
-
-| Platform | Status     | Notes                        |
-| -------- | ---------- | ---------------------------- |
-| Linux    | ✅ Full    | Primary development platform |
-| macOS    | ✅ Full    | Well tested                  |
-| Windows  | ⚠️ Partial | PTY support varies           |
-
-### 9.3 Scalability
-
-| Metric            | Current Limit | Notes                            |
-| ----------------- | ------------- | -------------------------------- |
-| Concurrent agents | ~50           | Limited by broker resources      |
-| Message rate      | High          | Limited by Relaycast rate limits |
-| Message size      | ~1 MiB        | Practical limit                  |
-
----
-
-## 10. Future Considerations
-
-### 10.1 Potential Enhancements
-
-**Reliability**:
-
-- Guaranteed delivery with acknowledgment
-- Persistent local queue for offline operation
-- Message ordering guarantees
-
-**Features**:
-
-- Typed message schemas
-- Priority queues
-- Advanced workflow patterns
-
-### 10.2 Architectural Evolution
-
-```
-Current:
-  Agent ──▶ MCP Tools ──▶ Broker ──▶ Relaycast WS ──▶ Agent
-
-The MCP tool protocol with Rust broker has proven effective for
-the target use case of multi-agent coordination across any CLI tool.
-```
-
----
-
-## Appendix A: File Map
-
-```
-agent-relay/
-├── src/
-│   ├── main.rs                  # Broker entry point (init, pty, headless, wrap)
-│   ├── lib.rs                   # Library exports (auth, dedup, protocol, etc.)
-│   ├── spawner.rs               # Agent spawning and process management
-│   ├── config.rs                # Configuration handling
-│   ├── protocol.rs              # Protocol types and envelope definitions
-│   ├── snippets.rs              # Agent instruction snippets and MCP config
-│   ├── cli/
-│   │   ├── bootstrap.ts         # CLI entry point, command registration
-│   │   ├── commands/
-│   │   │   ├── core.ts          # up, down, status, spawn, bridge
-│   │   │   ├── agent-management.ts  # Agent CRUD operations
-│   │   │   ├── messaging.ts     # send, read, inbox commands
-│   │   │   ├── cloud.ts         # Cloud link, status, agents
-│   │   │   ├── monitoring.ts    # Logs, health, metrics
-│   │   │   ├── auth.ts          # Login, logout, SSH key auth
-│   │   │   ├── setup.ts         # Install, setup commands
-│   │   │   └── doctor.ts        # Diagnostic command
-│   │   └── lib/                 # Shared CLI utilities
-│   └── index.ts                 # Package exports
-├── packages/
-│   ├── sdk/                     # TypeScript SDK (broker client, workflows)
-│   ├── acp-bridge/              # ACP protocol bridge for editors
-│   ├── config/                  # Configuration loading
-│   ├── hooks/                   # Hook system for events
-│   ├── storage/                 # Message persistence (JSONL)
-│   ├── utils/                   # Shared utilities
-│   ├── telemetry/               # Usage analytics
-│   ├── trajectory/              # Work trajectory tracking
-│   ├── user-directory/          # Agent directory management
-│   ├── memory/                  # Agent memory persistence
-│   └── policy/                  # Policy enforcement
-├── Cargo.toml                   # Rust dependencies
-├── package.json                 # Node.js dependencies
-├── CLAUDE.md                    # Agent instructions
-└── ARCHITECTURE.md              # This document
-```
-
----
-
-## Appendix B: Environment Variables
-
-| Variable                     | Default                     | Description                                |
-| ---------------------------- | --------------------------- | ------------------------------------------ |
-| `AGENT_RELAY_DASHBOARD_PORT` | 3888                        | Dashboard HTTP port                        |
-| `RELAY_AGENT_NAME`           | -                           | Agent name for broker registration         |
-| `RELAY_API_KEY`              | -                           | Relaycast workspace API key                |
-| `RELAY_BASE_URL`             | `https://api.relaycast.dev` | Relaycast API base URL                     |
-| `RELAY_CHANNELS`             | `general`                   | Comma-separated channel list               |
-| `AGENT_RELAY_DEBUG`          | false                       | Enable debug logging                       |
-| `RUST_LOG`                   | -                           | Rust log level (uses `tracing-subscriber`) |
-
----
-
-## Appendix C: Quick Reference
-
-### Starting the System
-
-```bash
-# Start broker + dashboard
-agent-relay up --dashboard
-
-# Spawn agents
-agent-relay spawn Alice claude "Your task here"
-agent-relay spawn Bob codex "Another task"
-```
-
-### Agent Communication (MCP Tools)
-
-```
-# Send a direct message
-mcp__relaycast__message_dm_send(to: "Bob", text: "Please review the auth module")
-
-# Post to a channel
-mcp__relaycast__post_message(channel: "general", text: "I've finished the database migration")
-```
-
-### Troubleshooting
-
-```bash
-# Check broker status
-agent-relay status
-
-# Run diagnostics
-agent-relay doctor
-
-# View logs
-RUST_LOG=debug agent-relay up
-```
-
----
-
-_Document updated for agent-relay v2.x (Rust broker architecture)_
-_Last updated: 2026_
diff --git a/BUDGET_AUDIT.md b/BUDGET_AUDIT.md
deleted file mode 100644
index 908790158..000000000
--- a/BUDGET_AUDIT.md
+++ /dev/null
@@ -1,168 +0,0 @@
-# Token Budget Tracking — Audit Report
-
-## 1. Token Collection: Exact File Locations
-
-### Collection Entry Point
-
-- **`packages/sdk/src/workflows/cli-session-collector.ts:51-58`** — `collectCliSession()` dispatches to CLI-specific collectors based on `AgentCli` type
-- **`packages/sdk/src/workflows/cli-session-collector.ts:38-49`** — `createCollector()` factory: supports `claude`, `codex`, `opencode`; returns `null` for all other CLIs
-
-### CLI-Specific Collectors
-
-| Collector   | File                             | Token Extraction                                                                                                                                                                |
-| ----------- | -------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
-| Claude Code | `collectors/claude.ts:87-186`    | Parses `~/.claude/projects/<project>/<sessionId>.jsonl`; sums `usage.input_tokens`, `usage.output_tokens`, `cache_read_input_tokens` from each `assistant` entry (line 123-128) |
-| Codex       | `collectors/codex.ts:149-169`    | Reads `~/.codex/state_5.sqlite` `threads` table; extracts `input_tokens`, `output_tokens`, `cache_read_tokens` columns (or falls back to `tokens_used`)                         |
-| OpenCode    | `collectors/opencode.ts:222-231` | Reads `~/.local/share/opencode/opencode.db` `message` table; sums `tokens.input`, `tokens.output`, `tokens.cache.read` from JSON `data` column                                  |
-
-### CliSessionReport Shape (`cli-session-collector.ts:6-24`)
-
-```typescript
-interface CliSessionReport {
-  cli: AgentCli;
-  tokens: { input: number; output: number; cacheRead: number } | null;
-  cost: number | null; // Only OpenCode populates this
-  durationMs: number | null;
-  model: string | null;
-  turns: number;
-  errors: { turn: number; text: string }[];
-  finalStatus: 'completed' | 'failed' | 'unknown';
-  // ...
-}
-```
-
-## 2. Token Data Flow
-
-```
-CLI session files (JSONL / SQLite)
-        │
-        ▼
-collectCliSession()                    (cli-session-collector.ts:51)
-        │
-        ▼
-captureAgentReport()                   (runner.ts:6623-6650)
-  ├─ this.agentReports.set(stepName)   (runner.ts:6642)  — in-memory Map
-  ├─ this.emit('step:agent-report')    (runner.ts:6643)  — event for listeners
-  └─ persistAgentReport()              (runner.ts:7135-7143) — writes <step>.report.json
-        │
-        ▼
-formatRunSummaryTable()                (run-summary-table.ts:41-110)
-  reads from agentReports Map          (runner.ts:6833)
-  displays: Step | Status | Model | Cost | Tokens | Duration | Errors
-```
-
-### Key Details
-
-- **`agentReports`** is declared as `private readonly agentReports = new Map<string, CliSessionReport>()` at **runner.ts:482**
-- Cleared at workflow start: **runner.ts:2860** (`this.agentReports.clear()`)
-- Populated post-execution per step: **runner.ts:6634-6642**
-- Displayed in final summary: **runner.ts:6833**
-- Token formatting in table: `run-summary-table.ts:8-12` sums `input + output + cacheRead`
-
-## 3. Where maxTokens Is Currently Referenced (NO Enforcement)
-
-| Location         | Line                                                           | Usage                                                                                                                                                 |
-| ---------------- | -------------------------------------------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------- |
-| `types.ts:206`   | `AgentConstraints.maxTokens?: number`                          | **Field definition only**                                                                                                                             |
-| `runner.ts:1663` | `agentDef.constraints?.maxTokens ?? proxyConfig.defaultBudget` | Used as `budget` in **credential proxy JWT** — passed to `mintProxyToken()` for the proxy's own rate limiting. **NOT enforced by the runner itself.** |
-| `runner.ts:3995` | `specialistDef.constraints?.maxTokens`                         | Passed as `defaultMaxTokens` to API-mode executor config. **NOT enforced during execution.**                                                          |
-
-**Finding: The runner reads `maxTokens` but never checks actual token consumption against the budget. There is zero enforcement at the workflow/runner level.**
-
-## 4. Timeout Enforcement Pattern to Follow
-
-The timeout enforcement in `waitForExitWithIdleNudging()` (**runner.ts:6338-6470**) provides the exact structural pattern for token budget enforcement:
-
-### Timeout Pattern Structure
-
-```
-1. CONFIGURATION:  timeoutMs from step.timeoutMs or swarm.timeoutMs
-2. LOOP:           while (true) { ... }
-3. TRACKING:       elapsed = Date.now() - startTime
-4. CHECK:          remaining = timeoutMs - elapsed; if (remaining <= 0) return 'timeout'
-5. WAIT:           exitResult = await agent.waitForExit(waitMs)
-6. GRACE:          On timeout, check verification before hard-failing (runner.ts:6169-6196)
-7. ESCALATION:     Nudge → escalate → force-release progression
-```
-
-### Proposed Token Budget Enforcement (Same Structure)
-
-```
-1. CONFIGURATION:  maxTokens from step agent's constraints.maxTokens
-2. LOOP:           Poll token consumption periodically during execution
-3. TRACKING:       currentTokens = read from agentReports or live polling
-4. CHECK:          if (currentTokens >= maxTokens) → trigger budget exceeded
-5. WAIT:           Continue waiting for exit with budget check interval
-6. GRACE:          On budget exceeded, allow current turn to complete
-7. ESCALATION:     Warn at 80% → soft-stop at 100% → force-release at 110%
-```
-
-### Enforcement Hook Points
-
-| Phase                | Location                                                     | Action                                                         |
-| -------------------- | ------------------------------------------------------------ | -------------------------------------------------------------- |
-| **Pre-spawn**        | `executeAgentStep()` (~runner.ts:6050)                       | Validate maxTokens is set; calculate remaining workflow budget |
-| **During execution** | Inside `waitForExitWithIdleNudging()` loop (~runner.ts:6424) | Periodically poll token usage; compare against budget          |
-| **Post-execution**   | `captureAgentReport()` (~runner.ts:6623)                     | Record final token count; deduct from workflow-level budget    |
-
-### Challenge: Live Token Polling
-
-The current collectors (`claude.ts`, `codex.ts`, `opencode.ts`) read session files **after** the agent exits. For mid-execution enforcement, one of these approaches is needed:
-
-1. **Tail the session JSONL** (Claude) or poll the SQLite DB (Codex/OpenCode) periodically during execution
-2. **Use the credential proxy's own budget tracking** — the proxy already receives the budget via JWT and could reject requests when exhausted
-3. **Parse PTY output** for token usage patterns (fragile, CLI-specific)
-
-**Recommendation**: Option 2 (credential proxy enforcement) for hard limits, with Option 1 (periodic polling) for soft warnings and reporting.
-
-## 5. Edge Cases
-
-### 5a. Concurrent Parallel Steps Sharing a Workflow Budget
-
-- Currently: each step's `maxTokens` is independent (per-agent constraint)
-- No workflow-level `maxTokens` field exists in `WorkflowDefinition` (types.ts:467-474)
-- **Gap**: If 5 parallel agents each have `maxTokens: 100_000`, the workflow could consume 500K tokens with no aggregate cap
-- **Fix needed**: Add `maxTokens` to `WorkflowDefinition` or `SwarmConfig`; maintain an `AtomicBudget` counter decremented by each step's actual consumption; use `Atomics` or a mutex for thread-safe concurrent deductions
-
-### 5b. Retry Attempts Consuming from the Same Budget
-
-- Retries are configured via `step.retries` (types.ts:552) and `AgentConstraints.retries` (types.ts:208)
-- Current retry logic re-spawns the agent with the same constraints
-- **Gap**: Each retry gets a fresh `maxTokens` budget (via new JWT), not the remaining budget from prior attempts
-- **Fix needed**: Track cumulative tokens across retries in `StepState`; deduct prior attempt's actual consumption from retry budget; fail the step if cumulative consumption exceeds `maxTokens × (retries + 1)` or a separate `maxTokensPerStep` field
-
-### 5c. Non-Interactive vs Interactive Agents
-
-- **Interactive agents** (PTY mode, `interactive: true`): Token collection works via session file parsing after exit. Mid-execution polling possible by tailing session files.
-- **Non-interactive agents** (`interactive: false`): Run as child processes with stdout capture. Token collection still works post-execution (collectors read the same session files). However, non-interactive agents using `preset: 'worker'` may not write session files if they're invoked with `--print` or similar flags.
-- **API-mode agents** (`cli: 'api'`): Use `executeApiStep()` (runner.ts:45) — token usage comes directly from API response `usage` field. Easiest to enforce in real-time.
-- **Gap**: No unified mid-execution token query interface across all three modes
-
-### 5d. Steps That Fail Before Collection Happens
-
-- `captureAgentReport()` is called in the step lifecycle regardless of success/failure (runner.ts:6623-6650)
-- But if a step crashes before the CLI writes any session data (e.g., spawn failure, immediate OOM), `collectCliSession()` returns `null` (cli-session-collector.ts:53)
-- **Gap**: Tokens consumed before crash are lost — the partial consumption is not tracked
-- **Fix needed**: For credential proxy mode, the proxy itself tracks per-session token consumption server-side. Query the proxy for actual consumption on step failure. For non-proxy mode, accept that crash-before-write results in underreporting.
-
-### 5e. Additional Edge Case: Token Counts Available Only After Exit
-
-- Claude collector reads `~/.claude/projects/.../<sessionId>.jsonl` which is written incrementally — can be tailed
-- Codex collector reads `~/.codex/state_5.sqlite` — SQLite is updated during execution, can be polled
-- OpenCode collector reads `~/.local/share/opencode/opencode.db` — same as Codex
-- **All three can theoretically be polled mid-execution**, but the current `CliSessionCollector` interface (`collect()`) is designed for post-execution one-shot reads, not streaming
-
-## Summary
-
-| Component                         | Status              | Location                                    |
-| --------------------------------- | ------------------- | ------------------------------------------- |
-| Token collection (post-execution) | IMPLEMENTED         | cli-session-collector.ts + collectors/\*.ts |
-| Token storage in memory           | IMPLEMENTED         | runner.ts:482 (agentReports Map)            |
-| Token persistence to disk         | IMPLEMENTED         | runner.ts:7135-7143 (\*.report.json)        |
-| Token display in summary          | IMPLEMENTED         | run-summary-table.ts                        |
-| maxTokens field in types          | DEFINED             | types.ts:206                                |
-| maxTokens passed to proxy JWT     | IMPLEMENTED         | runner.ts:1663-1688                         |
-| maxTokens enforcement in runner   | **NOT IMPLEMENTED** | —                                           |
-| Mid-execution token polling       | **NOT IMPLEMENTED** | —                                           |
-| Workflow-level aggregate budget   | **NOT IMPLEMENTED** | —                                           |
-| Cross-retry budget tracking       | **NOT IMPLEMENTED** | —                                           |
diff --git a/CUSTOM_VERIFY_DESIGN.md b/CUSTOM_VERIFY_DESIGN.md
deleted file mode 100644
index a7787aebf..000000000
--- a/CUSTOM_VERIFY_DESIGN.md
+++ /dev/null
@@ -1,169 +0,0 @@
-# Custom Verification Design
-
-## Overview
-
-The `custom` verification type allows workflow authors to run arbitrary shell commands
-(or regex patterns) as verification gates after an agent step completes. This replaces
-the need for separate deterministic steps to validate agent output.
-
-## Current Implementation Status
-
-Custom verification is **already implemented** in the codebase:
-
-- `packages/sdk/src/workflows/verification.ts` — `checkCustom()` function (lines 191-226)
-- `packages/sdk/src/workflows/types.ts` — `VerificationCheck` interface (lines 621-625)
-- `packages/sdk/src/workflows/schema.json` — `VerificationCheck` JSON schema
-
-## How It Works
-
-### Shell Command Mode
-
-The `value` field contains a shell command. After the agent step completes, the command
-is executed via `execSync`. The agent's output is available as `$STEP_OUTPUT` env var.
-
-```yaml
-verification:
-  type: 'custom'
-  value: 'cd nango-integrations && npx nango compile'
-```
-
-**Behavior:**
-
-- Exit code 0 = verification passed
-- Non-zero exit code = verification failed
-- stderr is captured as the verification error message
-- Configurable timeout via `CUSTOM_VERIFY_TIMEOUT_MS` env var (default: 30s)
-- Max output buffer: 1MB
-
-### Regex Mode
-
-Prefix the value with `regex:` to match a pattern against the step output:
-
-```yaml
-verification:
-  type: 'custom'
-  value: 'regex:Successfully compiled'
-```
-
-**Behavior:**
-
-- Pattern is compiled as a JavaScript `RegExp`
-- Tested against the step's combined output
-- Invalid regex returns a clear error message
-
-## Retry Integration
-
-When verification fails and `retries` is configured, the runner injects failure
-context into the retry prompt (runner.ts, lines 4195-4202):
-
-```
-[RETRY - Attempt 2/3]
-Previous attempt failed: Verification failed for "step-name": custom check failed - <stderr output>
-Previous output (last 2000 chars):
-<agent's prior output>
----
-<original task>
-```
-
-This gives the agent diagnostic context from the failed verification command,
-enabling it to fix the issue on retry.
-
-## Type Definition
-
-```typescript
-// packages/sdk/src/workflows/types.ts
-export interface VerificationCheck {
-  type: 'output_contains' | 'exit_code' | 'file_exists' | 'custom';
-  value: string;
-  description?: string;
-}
-```
-
-## Implementation Details
-
-### `checkCustom(value, output, cwd)` — verification.ts
-
-```typescript
-function checkCustom(value, output, cwd): { passed: boolean; stdout?: string; error?: string };
-```
-
-1. **Regex branch** (`value.startsWith('regex:')`)
-   - Strips prefix, compiles RegExp, tests against output
-   - Returns `{ passed: false, error }` on mismatch or invalid regex
-
-2. **Shell command branch** (default)
-   - Runs `execSync(value, { cwd, env: { ...process.env, STEP_OUTPUT: output } })`
-   - Timeout: `CUSTOM_VERIFY_TIMEOUT_MS` (default 30000)
-   - stdio: pipe (captures stdout + stderr)
-   - On success: `{ passed: true, stdout }`
-   - On failure: `{ passed: false, error: stderr || error.message }`
-
-### Side Effects on Failure
-
-When custom verification fails, `runVerification()` records:
-
-- A `verification_observed` tool side effect with `passed: false`
-- A `verification_failed` coordination signal in the step's evidence record
-- If `allowFailure` is false (default), throws `WorkflowCompletionError`
-
-### Side Effects on Success
-
-- A `verification_observed` tool side effect with `passed: true`
-- A `verification_passed` coordination signal
-- Returns `{ passed: true, completionReason: 'completed_verified' }`
-
-## Callback Variant (Future / Programmatic Use)
-
-For embedding the runner in another system where the host provides verification
-logic programmatically, a callback variant is reserved:
-
-```typescript
-// Proposed extension to VerificationCheck:
-interface VerificationCheck {
-  type: 'output_contains' | 'exit_code' | 'file_exists' | 'custom';
-  value: string;
-  description?: string;
-  /** Optional async callback for programmatic verification.
-   *  When provided with type: 'custom', the callback is invoked instead of
-   *  running the value as a shell command. */
-  callback?: (output: string) => Promise<boolean> | boolean;
-}
-```
-
-**Behavior:**
-
-- If `callback` is present and `type === 'custom'`, invoke the callback
-- The callback receives the step's combined output
-- Return `true` = passed, `false` = failed
-- The `value` field serves as a human-readable label in this mode
-- Falls back to shell command execution if no callback is provided
-
-**Note:** This callback field cannot be expressed in YAML — it's only available
-when using the runner programmatically via the SDK. The JSON schema does not
-include it; it lives only in the TypeScript type.
-
-## Backwards Compatibility
-
-- Existing workflows using `{ type: 'custom', value: '<command>' }` work unchanged
-- The `value` field is always required (enforced by schema)
-- Empty `value` with no callback will execute an empty command, which typically
-  succeeds (exit 0) — authors should always provide a meaningful command
-
-## Example Workflow
-
-```yaml
-workflows:
-  - name: build-and-verify
-    steps:
-      - name: implement-feature
-        agent: coder
-        task: 'Implement the new API endpoint'
-        verification:
-          type: 'custom'
-          value: 'cd nango-integrations && npx nango compile'
-          description: 'Ensure Nango integration compiles'
-        retries: 2
-```
-
-On failure, the coder agent receives the compile errors in its retry prompt
-and can fix the issues without a separate verification step.
diff --git a/DESIGN.md b/DESIGN.md
deleted file mode 100644
index c41a3ccad..000000000
--- a/DESIGN.md
+++ /dev/null
@@ -1,645 +0,0 @@
-# Credential Proxy — Design Document
-
-## Problem
-
-Nango runs AI agents in sandboxes that need LLM API access (OpenRouter, Anthropic, OpenAI). Today, raw API keys are passed as environment variables — agents can exfiltrate them. LiteLLM was rejected (heavy Python server). We need a lightweight, transparent proxy that:
-
-- Hides real API keys from sandbox agents
-- Validates short-lived JWTs instead of long-lived secrets
-- Forwards LLM requests unchanged (agents don't know they're proxied)
-- Meters token usage per workspace/session
-
----
-
-## Package Location
-
-```
-packages/credential-proxy/
-├── package.json
-├── tsconfig.json
-├── src/
-│   ├── index.ts              # Hono app factory + exports
-│   ├── router.ts             # Route definitions (/v1/chat/completions, /v1/messages)
-│   ├── jwt.ts                # JWT validation, claims extraction
-│   ├── credential-store.ts   # Interface to relay's encrypted credential storage
-│   ├── metering.ts           # Token usage extraction and recording
-│   ├── providers/
-│   │   ├── types.ts          # ProviderAdapter interface
-│   │   ├── openai.ts         # OpenAI adapter
-│   │   ├── anthropic.ts      # Anthropic adapter
-│   │   └── openrouter.ts     # OpenRouter adapter
-│   └── errors.ts             # Error types and HTTP error responses
-├── test/
-│   ├── jwt.test.ts
-│   ├── router.test.ts
-│   ├── metering.test.ts
-│   └── providers/
-│       ├── openai.test.ts
-│       ├── anthropic.test.ts
-│       └── openrouter.test.ts
-└── README.md
-```
-
-Follows the same structure as `packages/gateway/` — Hono replaces the raw HTTP handling, but the adapter dispatch pattern is identical.
-
----
-
-## JWT Claims Schema
-
-```typescript
-export interface ProxyJwtClaims {
-  /** Workspace ID (e.g., "wks_abc123") */
-  sub: string;
-
-  /** Fixed audience — must be "relay-llm-proxy" */
-  aud: 'relay-llm-proxy';
-
-  /** LLM provider this token authorizes */
-  provider: 'openai' | 'anthropic' | 'openrouter';
-
-  /** Reference to encrypted credential in relay's credential store */
-  credentialId: string;
-
-  /** Optional max tokens for this session (input + output combined) */
-  budget?: number;
-
-  /** Issued-at (unix seconds) */
-  iat: number;
-
-  /** Expiration (unix seconds) — default 15 min TTL */
-  exp: number;
-
-  /** Unique token ID for audit trail */
-  jti: string;
-
-  /** Issuer — "relay-credential-proxy" */
-  iss: string;
-}
-```
-
-**TTL policy:** 15 minutes default. Tokens are minted by the relay cloud API when a sandbox session starts. The sandbox receives only the JWT — never the underlying API key.
-
-**Signing:** HMAC-SHA256, following the pattern in `packages/sdk/src/provisioner/token.ts`. The signing secret is a per-workspace key stored in relay cloud, not in the proxy itself. The proxy receives the verification secret via environment variable or runtime config.
-
----
-
-## Request Flow
-
-```
-Agent (sandbox)
-  │
-  │  POST /v1/chat/completions  (or /v1/messages)
-  │  Authorization: Bearer <jwt>
-  │
-  ▼
-┌─────────────────────────────┐
-│  Credential Proxy (Hono)    │
-│                             │
-│  1. Extract JWT from header │
-│  2. Validate signature+exp  │
-│  3. Check budget (if set)   │
-│  4. Resolve real API key    │
-│     via credentialId        │
-│  5. Select provider adapter │
-│  6. Forward request with    │
-│     real API key            │
-│  7. Stream response back    │
-│  8. Extract token usage     │
-│  9. Record metering event   │
-└─────────────────────────────┘
-  │
-  ▼
-Provider API (OpenAI / Anthropic / OpenRouter)
-```
-
----
-
-## Provider Adapter Pattern
-
-Mirrors `packages/gateway/src/types.ts` — each surface adapter normalizes inbound/outbound messages. Here, each provider adapter normalizes auth headers and usage extraction.
-
-```typescript
-// src/providers/types.ts
-
-export interface ProviderAdapter {
-  /** Provider identifier */
-  readonly type: 'openai' | 'anthropic' | 'openrouter';
-
-  /** The upstream base URL for this provider */
-  readonly baseUrl: string;
-
-  /**
-   * Build the outgoing request headers.
-   * Replaces the proxy JWT with the real API key in the
-   * provider-specific auth header format.
-   */
-  buildHeaders(apiKey: string, incomingHeaders: Headers): Headers;
-
-  /**
-   * Map the incoming proxy path to the upstream provider path.
-   * e.g., /v1/chat/completions → /v1/chat/completions (OpenAI)
-   *        /v1/messages → /v1/messages (Anthropic)
-   */
-  upstreamPath(proxyPath: string): string;
-
-  /**
-   * Extract token usage from the provider's response body.
-   * Called after the full response is buffered (non-streaming)
-   * or after the stream ends (streaming).
-   */
-  extractUsage(responseBody: unknown): TokenUsage | null;
-
-  /**
-   * Extract token usage from a streaming chunk (SSE data).
-   * Returns null for non-final chunks. Returns usage from the
-   * final chunk that includes it (e.g., OpenAI's last chunk
-   * with usage field, Anthropic's message_stop event).
-   */
-  extractStreamingUsage(chunk: string): TokenUsage | null;
-}
-
-export interface TokenUsage {
-  inputTokens: number;
-  outputTokens: number;
-  totalTokens: number;
-  model?: string;
-}
-```
-
-### Adapter Implementations
-
-**OpenAI** (`src/providers/openai.ts`):
-
-- Base URL: `https://api.openai.com`
-- Auth header: `Authorization: Bearer <key>`
-- Path: `/v1/chat/completions` (passthrough)
-- Usage: `response.usage.prompt_tokens`, `response.usage.completion_tokens`
-- Streaming: final SSE chunk contains `usage` when `stream_options.include_usage` is set; proxy injects this option
-
-**Anthropic** (`src/providers/anthropic.ts`):
-
-- Base URL: `https://api.anthropic.com`
-- Auth header: `x-api-key: <key>` (NOT Bearer)
-- Also sets: `anthropic-version: 2023-06-01`
-- Path: `/v1/messages` (passthrough)
-- Usage: `response.usage.input_tokens`, `response.usage.output_tokens`
-- Streaming: `message_delta` event contains `usage` in the final event
-
-**OpenRouter** (`src/providers/openrouter.ts`):
-
-- Base URL: `https://openrouter.ai/api`
-- Auth header: `Authorization: Bearer <key>`
-- Path: `/v1/chat/completions` (passthrough — OpenAI-compatible)
-- Usage: `response.usage.prompt_tokens`, `response.usage.completion_tokens`
-- Streaming: same as OpenAI format
-
----
-
-## Router Design
-
-```typescript
-// src/router.ts
-
-import { Hono } from 'hono';
-import type { ProxyJwtClaims } from './jwt.js';
-
-const app = new Hono();
-
-// Health check
-app.get('/health', (c) => c.json({ status: 'ok' }));
-
-// OpenAI-compatible endpoint
-app.post('/v1/chat/completions', jwtMiddleware, proxyHandler);
-
-// Anthropic-compatible endpoint
-app.post('/v1/messages', jwtMiddleware, proxyHandler);
-```
-
-**Route → Provider mapping:**
-
-- `/v1/chat/completions` → uses `claims.provider` to select OpenAI or OpenRouter adapter
-- `/v1/messages` → Anthropic adapter (validated against `claims.provider === "anthropic"`)
-
-If the route doesn't match the JWT's `provider` claim, return 400.
-
-**jwtMiddleware** extracts and validates the JWT, attaches claims to context:
-
-```typescript
-async function jwtMiddleware(c: Context, next: Next) {
-  const token = c.req.header('Authorization')?.replace('Bearer ', '');
-  if (!token) return c.json({ error: 'Missing authorization' }, 401);
-
-  const claims = await validateJwt(token, signingSecret);
-  c.set('claims', claims);
-  await next();
-}
-```
-
-**proxyHandler** orchestrates the forward-and-stream:
-
-```typescript
-async function proxyHandler(c: Context) {
-  const claims = c.get('claims') as ProxyJwtClaims;
-  const adapter = resolveAdapter(claims.provider);
-  const apiKey = await credentialStore.resolve(claims.credentialId);
-
-  // Budget check
-  if (claims.budget) {
-    const used = await metering.getSessionUsage(claims.jti);
-    if (used >= claims.budget) {
-      return c.json({ error: 'Token budget exceeded' }, 429);
-    }
-  }
-
-  // Build upstream request
-  const headers = adapter.buildHeaders(apiKey, c.req.raw.headers);
-  const upstreamUrl = `${adapter.baseUrl}${adapter.upstreamPath(c.req.path)}`;
-  const body = await c.req.text();
-
-  const isStreaming = JSON.parse(body).stream === true;
-
-  const upstream = await fetch(upstreamUrl, {
-    method: 'POST',
-    headers,
-    body,
-  });
-
-  if (!upstream.ok) {
-    // Pass through provider errors unchanged
-    return new Response(upstream.body, {
-      status: upstream.status,
-      headers: upstream.headers,
-    });
-  }
-
-  if (isStreaming) {
-    return streamResponse(c, upstream, adapter, claims);
-  } else {
-    return bufferedResponse(c, upstream, adapter, claims);
-  }
-}
-```
-
-### Streaming Strategy
-
-For streaming responses, the proxy pipes SSE chunks through unchanged, but taps each chunk to detect usage:
-
-```typescript
-async function streamResponse(c, upstream, adapter, claims) {
-  const { readable, writable } = new TransformStream();
-  const writer = writable.getWriter();
-  const reader = upstream.body.getReader();
-  const decoder = new TextDecoder();
-
-  let finalUsage: TokenUsage | null = null;
-
-  // Pipe in background
-  (async () => {
-    while (true) {
-      const { done, value } = await reader.read();
-      if (done) break;
-      const text = decoder.decode(value, { stream: true });
-      const usage = adapter.extractStreamingUsage(text);
-      if (usage) finalUsage = usage;
-      await writer.write(value); // Pass through unchanged
-    }
-    writer.close();
-
-    // Record usage after stream ends
-    if (finalUsage) {
-      await metering.record(claims, finalUsage);
-    }
-  })();
-
-  return new Response(readable, {
-    headers: {
-      'content-type': 'text/event-stream',
-      'cache-control': 'no-cache',
-      connection: 'keep-alive',
-    },
-  });
-}
-```
-
----
-
-## JWT Validation
-
-```typescript
-// src/jwt.ts
-
-import { createHmac, timingSafeEqual } from 'node:crypto';
-import type { ProxyJwtClaims } from './providers/types.js';
-
-const ALLOWED_AUDIENCES = ['relay-llm-proxy'] as const;
-const CLOCK_SKEW_SECONDS = 30;
-
-export function validateJwt(token: string, secret: string): ProxyJwtClaims {
-  const parts = token.split('.');
-  if (parts.length !== 3) throw new JwtError('Malformed token');
-
-  const [headerB64, payloadB64, signatureB64] = parts;
-
-  // 1. Verify signature (HMAC-SHA256, timing-safe)
-  const unsigned = `${headerB64}.${payloadB64}`;
-  const expected = createHmac('sha256', secret).update(unsigned).digest('base64url');
-
-  if (!timingSafeEqual(Buffer.from(expected), Buffer.from(signatureB64))) {
-    throw new JwtError('Invalid signature');
-  }
-
-  // 2. Decode and parse
-  const header = JSON.parse(base64urlDecode(headerB64));
-  if (header.alg !== 'HS256') throw new JwtError('Unsupported algorithm');
-
-  const claims = JSON.parse(base64urlDecode(payloadB64)) as ProxyJwtClaims;
-
-  // 3. Validate standard claims
-  const now = Math.floor(Date.now() / 1000);
-  if (claims.exp < now - CLOCK_SKEW_SECONDS) {
-    throw new JwtError('Token expired');
-  }
-  if (claims.aud !== 'relay-llm-proxy') {
-    throw new JwtError('Invalid audience');
-  }
-  if (!['openai', 'anthropic', 'openrouter'].includes(claims.provider)) {
-    throw new JwtError('Invalid provider');
-  }
-
-  return claims;
-}
-```
-
-Follows the same HMAC-SHA256 + `timingSafeEqual` pattern used in `packages/sdk/src/provisioner/token.ts` and `packages/gateway/src/adapters/slack.ts`.
-
----
-
-## Credential Store Integration
-
-The proxy resolves real API keys via `credentialId` from the JWT claims. This integrates with relay cloud's encrypted credential storage (`packages/cloud/src/`).
-
-```typescript
-// src/credential-store.ts
-
-export interface CredentialStore {
-  /**
-   * Resolve a credentialId to the decrypted API key.
-   * The credentialId is an opaque reference stored in the JWT claims.
-   * The actual key is encrypted at rest in relay cloud (S3 + KMS).
-   */
-  resolve(credentialId: string): Promise<string>;
-}
-```
-
-### Implementation Options
-
-**Option A — API call to relay cloud** (recommended for production):
-
-```typescript
-export class CloudCredentialStore implements CredentialStore {
-  constructor(
-    private readonly apiUrl: string,
-    private readonly serviceToken: string
-  ) {}
-
-  async resolve(credentialId: string): Promise<string> {
-    const res = await fetch(`${this.apiUrl}/api/v1/credentials/${credentialId}`, {
-      headers: { authorization: `Bearer ${this.serviceToken}` },
-    });
-    if (!res.ok) throw new CredentialError(`Failed to resolve: ${res.status}`);
-    const { apiKey } = await res.json();
-    return apiKey;
-  }
-}
-```
-
-This follows the same pattern as `packages/cloud/src/auth.ts` — the proxy never holds decryption keys; cloud API decrypts via KMS and returns the plaintext key over a TLS-protected internal channel.
-
-**Option B — Local cache with TTL** (for performance):
-
-```typescript
-export class CachedCredentialStore implements CredentialStore {
-  private cache = new Map<string, { key: string; expiresAt: number }>();
-  private readonly ttlMs = 5 * 60 * 1000; // 5 min cache
-
-  constructor(private readonly inner: CredentialStore) {}
-
-  async resolve(credentialId: string): Promise<string> {
-    const cached = this.cache.get(credentialId);
-    if (cached && cached.expiresAt > Date.now()) return cached.key;
-
-    const key = await this.inner.resolve(credentialId);
-    this.cache.set(credentialId, { key, expiresAt: Date.now() + this.ttlMs });
-    return key;
-  }
-}
-```
-
-The cache must be bounded (LRU or size cap) and the TTL kept short since credentials can be rotated.
-
----
-
-## Metering Data Model
-
-```typescript
-// src/metering.ts
-
-export interface MeteringEvent {
-  /** Unique event ID */
-  id: string;
-
-  /** ISO 8601 timestamp */
-  timestamp: string;
-
-  /** From JWT claims */
-  workspaceId: string; // claims.sub
-  provider: string; // claims.provider
-  credentialId: string; // claims.credentialId
-  tokenId: string; // claims.jti (for budget tracking)
-
-  /** From provider response */
-  model: string; // e.g., "gpt-4o", "claude-sonnet-4-20250514"
-  inputTokens: number;
-  outputTokens: number;
-  totalTokens: number;
-
-  /** Request metadata */
-  streaming: boolean;
-  statusCode: number;
-  latencyMs: number;
-}
-```
-
-### Recording Strategy
-
-**Phase 1 — Append to local log** (simple, works everywhere):
-
-```typescript
-export class MeteringRecorder {
-  async record(claims: ProxyJwtClaims, usage: TokenUsage, meta: RequestMeta): Promise<void> {
-    const event: MeteringEvent = {
-      id: crypto.randomUUID(),
-      timestamp: new Date().toISOString(),
-      workspaceId: claims.sub,
-      provider: claims.provider,
-      credentialId: claims.credentialId,
-      tokenId: claims.jti,
-      model: usage.model ?? 'unknown',
-      inputTokens: usage.inputTokens,
-      outputTokens: usage.outputTokens,
-      totalTokens: usage.totalTokens,
-      streaming: meta.streaming,
-      statusCode: meta.statusCode,
-      latencyMs: meta.latencyMs,
-    };
-    // Emit to configured sink (stdout JSON line, or POST to metering API)
-    this.sink.emit(event);
-  }
-
-  async getSessionUsage(tokenId: string): Promise<number> {
-    // Sum totalTokens for this jti (for budget enforcement)
-    return this.sink.sumByTokenId(tokenId);
-  }
-}
-```
-
-**Phase 2 — Push to relay cloud metering API** (for billing):
-
-- Batch events and flush every N seconds or N events
-- POST to `/api/v1/metering/events`
-- Cloud aggregates per workspace for billing
-
-**Metering sinks** (pluggable):
-
-- `StdoutSink` — JSON lines to stdout (Lambda CloudWatch / local dev)
-- `ApiSink` — POST to relay cloud metering endpoint
-- `InMemorySink` — for tests and budget enforcement in single-process mode
-
----
-
-## Error Handling
-
-| Error Condition                    | HTTP Status  | Response Body                                 |
-| ---------------------------------- | ------------ | --------------------------------------------- |
-| Missing Authorization header       | 401          | `{ "error": "Missing authorization" }`        |
-| Malformed JWT                      | 401          | `{ "error": "Malformed token" }`              |
-| Invalid JWT signature              | 401          | `{ "error": "Invalid signature" }`            |
-| Expired JWT                        | 401          | `{ "error": "Token expired" }`                |
-| Wrong audience claim               | 401          | `{ "error": "Invalid audience" }`             |
-| Provider mismatch (route vs claim) | 400          | `{ "error": "Provider mismatch" }`            |
-| Credential not found               | 502          | `{ "error": "Credential resolution failed" }` |
-| Budget exceeded                    | 429          | `{ "error": "Token budget exceeded" }`        |
-| Provider returns error             | pass-through | Provider's original error response            |
-| Provider unreachable               | 502          | `{ "error": "Upstream unreachable" }`         |
-| Provider rate limit (429)          | 429          | Provider's original 429 response              |
-
-**Design principle:** Provider errors are passed through unchanged. The agent SDK already handles OpenAI/Anthropic error formats — the proxy should not transform them. Only proxy-level errors (JWT, budget, credential resolution) use the proxy's own error format.
-
-```typescript
-// src/errors.ts
-
-export class ProxyError extends Error {
-  constructor(
-    message: string,
-    public readonly status: number,
-    public readonly code: string
-  ) {
-    super(message);
-  }
-}
-
-export class JwtError extends ProxyError {
-  constructor(message: string) {
-    super(message, 401, 'jwt_error');
-  }
-}
-
-export class CredentialError extends ProxyError {
-  constructor(message: string) {
-    super(message, 502, 'credential_error');
-  }
-}
-
-export class BudgetExceededError extends ProxyError {
-  constructor() {
-    super('Token budget exceeded', 429, 'budget_exceeded');
-  }
-}
-```
-
-Hono error handler catches `ProxyError` and returns structured JSON:
-
-```typescript
-app.onError((err, c) => {
-  if (err instanceof ProxyError) {
-    return c.json({ error: err.message, code: err.code }, err.status);
-  }
-  console.error('Unexpected error:', err);
-  return c.json({ error: 'Internal server error' }, 500);
-});
-```
-
----
-
-## Deployment Targets
-
-Hono runs on all of these with zero code changes:
-
-| Target                 | Entry Point               | Notes                     |
-| ---------------------- | ------------------------- | ------------------------- |
-| **Node.js**            | `hono/node-server`        | Local dev, Docker, EC2    |
-| **AWS Lambda**         | `hono/aws-lambda`         | Nango's likely deployment |
-| **Cloudflare Workers** | `hono/cloudflare-workers` | Edge deployment           |
-
-The `src/index.ts` exports the Hono app; the deployment adapter wraps it:
-
-```typescript
-// src/index.ts
-export { createProxy } from './router.js';
-
-// For Node.js standalone:
-// import { serve } from '@hono/node-server';
-// import { createProxy } from './index.js';
-// serve({ fetch: createProxy({ ... }).fetch, port: 3001 });
-```
-
----
-
-## Security Considerations
-
-1. **No key exposure** — API keys never leave the proxy process. They are fetched from the credential store, used in the upstream request, and discarded. Never logged.
-
-2. **Short-lived tokens** — 15 min default TTL. Even if a JWT leaks, the blast radius is time-bounded and budget-capped.
-
-3. **Budget enforcement** — Optional per-session token budget prevents runaway costs from compromised or buggy agents.
-
-4. **Timing-safe comparison** — JWT signature validation uses `timingSafeEqual` to prevent timing attacks (same pattern as gateway's Slack signature verification).
-
-5. **No credential caching without TTL** — If caching is enabled, it's bounded and short-lived. Credential rotation takes effect within the cache TTL.
-
-6. **Provider error passthrough** — The proxy doesn't leak internal state in error messages. Provider errors are forwarded as-is; proxy errors use minimal, fixed messages.
-
-7. **Audit trail** — Every request is metered with workspace, provider, model, and token ID. Combined with JWT `jti`, this enables per-session forensics.
-
----
-
-## Integration with Existing Packages
-
-| Package            | Integration                                                                                                 |
-| ------------------ | ----------------------------------------------------------------------------------------------------------- |
-| `packages/sdk`     | JWT minting functions extended to mint proxy tokens; `TokenClaims` type extended with proxy-specific fields |
-| `packages/cloud`   | Credential store API serves decrypted keys to the proxy; new `/api/v1/credentials/:id` endpoint             |
-| `packages/gateway` | No direct integration; shared adapter pattern for consistency                                               |
-| `packages/config`  | Proxy configuration (signing secret, credential store URL) follows existing config patterns                 |
-
----
-
-## Open Questions
-
-1. **Multi-region credential store** — Should the proxy cache credentials regionally, or always call the central credential store? Latency vs. consistency tradeoff.
-
-2. **Token renewal** — Should the proxy support a `/v1/token/refresh` endpoint, or should the orchestrator (Nango) mint new tokens directly from relay cloud?
-
-3. **Model allowlisting** — Should the JWT claims include an allowed model list, or is provider-level access sufficient?
-
-4. **Request body inspection** — Should the proxy inspect/modify request bodies (e.g., inject `stream_options.include_usage` for OpenAI), or keep the body strictly opaque?
diff --git a/PROXY_INTEGRATION.md b/PROXY_INTEGRATION.md
deleted file mode 100644
index 5ac5345cc..000000000
--- a/PROXY_INTEGRATION.md
+++ /dev/null
@@ -1,394 +0,0 @@
-# Credential Proxy Integration Plan
-
-## Overview
-
-Integrate the credential proxy into the workflow runner so that agents receive proxy JWTs instead of raw API keys. When `credentials.proxy: true` is set on an agent, the runner mints a scoped JWT and injects proxy env vars — the agent never sees the real API key.
-
----
-
-## 1. New Config Fields
-
-### `agents[].credentials` (AgentDefinition)
-
-Add an optional `credentials` block to `AgentDefinition` in `packages/sdk/src/workflows/types.ts`:
-
-```typescript
-export interface AgentCredentials {
-  /** Opt-in to credential proxy mode. When true, the runner mints a proxy JWT
-   *  and injects RELAY_LLM_PROXY_URL + RELAY_LLM_PROXY_TOKEN instead of raw keys. */
-  proxy?: boolean;
-  /** Override the default budget (max tokens) for this agent's proxy session. */
-  budget?: number;
-  /** Override which providers this agent can access (defaults to all configured). */
-  providers?: ProviderType[];
-}
-
-export interface AgentDefinition {
-  // ... existing fields ...
-  credentials?: AgentCredentials;
-}
-```
-
-### `swarm.credentialProxy` (SwarmConfig)
-
-Add an optional `credentialProxy` block to `SwarmConfig`:
-
-```typescript
-export interface CredentialProxyConfig {
-  /** The proxy endpoint URL (e.g. "https://agentrelay.com/llm-proxy"). */
-  proxyUrl: string;
-  /** JWT signing secret. Supports env var reference: "$RELAY_PROXY_SECRET". */
-  jwtSecret: string;
-  /** Default max-token budget per agent session. */
-  defaultBudget?: number;
-  /** Provider-to-credential mapping. */
-  providers: Partial<Record<ProviderType, { credentialId: string }>>;
-}
-
-export interface SwarmConfig {
-  // ... existing fields ...
-  credentialProxy?: CredentialProxyConfig;
-}
-```
-
----
-
-## 2. Runner Modifications
-
-All changes in `packages/sdk/src/workflows/runner.ts`.
-
-### 2a. Import credential-proxy JWT minting
-
-```typescript
-import { mintProxyToken, type ProxyTokenClaims } from '@agent-relay/credential-proxy/jwt';
-```
-
-### 2b. New instance state
-
-```typescript
-/** Minted proxy tokens keyed by agent definition name. */
-private proxyTokens = new Map<string, string>();
-```
-
-### 2c. Mint tokens in `provisionAgents()` (~line 1547)
-
-After the existing provisioning loop, add proxy token minting:
-
-```typescript
-// ── Credential proxy provisioning ──────────────────────────────────
-const proxyConfig = config.swarm.credentialProxy;
-if (proxyConfig) {
-  for (const agent of config.agents) {
-    if (!agent.credentials?.proxy) continue;
-
-    const providers = agent.credentials.providers ?? (Object.keys(proxyConfig.providers) as ProviderType[]);
-
-    // Mint one JWT per provider per agent
-    // For simplicity, mint for the first configured provider.
-    // Multi-provider support: mint multiple tokens or a multi-provider token.
-    for (const provider of providers) {
-      const providerConfig = proxyConfig.providers[provider];
-      if (!providerConfig) continue;
-
-      const claims: ProxyTokenClaims = {
-        sub: `${this.workspaceId}:${agent.name}`,
-        aud: 'relay-llm-proxy',
-        provider,
-        credentialId: providerConfig.credentialId,
-        budget: agent.credentials.budget ?? proxyConfig.defaultBudget,
-      };
-
-      const secret = proxyConfig.jwtSecret.startsWith('$')
-        ? (process.env[proxyConfig.jwtSecret.slice(1)] ?? proxyConfig.jwtSecret)
-        : proxyConfig.jwtSecret;
-
-      const token = await mintProxyToken(claims, secret);
-      // Key: "agentName:provider" for multi-provider, or just agentName for single
-      this.proxyTokens.set(`${agent.name}:${provider}`, token);
-    }
-  }
-}
-```
-
-### 2d. Modify `getRelayEnv()` (~line 1535)
-
-No changes needed here — proxy env vars are injected at the spawn site (2e/2f) rather than globally, because only proxy-enabled agents should receive them.
-
-### 2e. Modify `execNonInteractive()` (~line 5572)
-
-After the existing `agentToken`/`mount` injection block, add proxy env injection:
-
-```typescript
-// ── Credential proxy env injection ─────────────────────────────────
-const proxyConfig = this.currentConfig?.swarm?.credentialProxy;
-if (proxyConfig && agentDef.credentials?.proxy) {
-  const cliOverrides = resolveCliBaseUrlOverrides(agentDef.cli, proxyConfig.proxyUrl);
-  Object.assign(env, cliOverrides);
-
-  // Inject proxy token(s) — find all tokens for this agent
-  for (const [key, token] of this.proxyTokens) {
-    if (key.startsWith(`${agentDef.name}:`)) {
-      const provider = key.split(':')[1];
-      env[`RELAY_LLM_PROXY_TOKEN_${provider.toUpperCase()}`] = token;
-    }
-  }
-  env.RELAY_LLM_PROXY_URL = proxyConfig.proxyUrl;
-
-  // Strip raw API keys so the agent can't bypass the proxy
-  delete env.OPENAI_API_KEY;
-  delete env.ANTHROPIC_API_KEY;
-  delete env.OPENROUTER_API_KEY;
-}
-```
-
-### 2f. Modify `spawnAndWait()` (~line 5831)
-
-In the `spawnOptions` construction, pass proxy env via the spawn options:
-
-```typescript
-const spawnEnvOverrides: Record<string, string> = {};
-const proxyConfig = this.currentConfig?.swarm?.credentialProxy;
-if (proxyConfig && agentDef.credentials?.proxy) {
-  const cliOverrides = resolveCliBaseUrlOverrides(agentDef.cli, proxyConfig.proxyUrl);
-  Object.assign(spawnEnvOverrides, cliOverrides);
-
-  for (const [key, token] of this.proxyTokens) {
-    if (key.startsWith(`${agentDef.name}:`)) {
-      const provider = key.split(':')[1];
-      spawnEnvOverrides[`RELAY_LLM_PROXY_TOKEN_${provider.toUpperCase()}`] = token;
-    }
-  }
-  spawnEnvOverrides.RELAY_LLM_PROXY_URL = proxyConfig.proxyUrl;
-}
-
-// Pass spawnEnvOverrides into spawnOptions.env (needs relay.spawnPty to accept env)
-```
-
-### 2g. Modify `filteredEnv()` (~line 150)
-
-Add a `stripApiKeys` parameter:
-
-```typescript
-function filteredEnv(
-  extra?: Record<string, string | undefined>,
-  options?: { stripApiKeys?: boolean }
-): Record<string, string | undefined> {
-  const env: Record<string, string | undefined> = {};
-  const stripKeys = new Set(
-    options?.stripApiKeys ? ['OPENAI_API_KEY', 'ANTHROPIC_API_KEY', 'OPENROUTER_API_KEY'] : []
-  );
-  for (const key of ENV_ALLOWLIST) {
-    if (stripKeys.has(key)) continue;
-    if (process.env[key] !== undefined) {
-      env[key] = process.env[key];
-    }
-  }
-  if (extra) {
-    Object.assign(env, extra);
-  }
-  return env;
-}
-```
-
-Note: Currently none of `OPENAI_API_KEY`, `ANTHROPIC_API_KEY`, or `OPENROUTER_API_KEY` are in the `ENV_ALLOWLIST` (line 113-147), so they already do NOT propagate through `filteredEnv()`. They would only leak through `getRelayEnv()` which spreads `...process.env`. The `delete` statements in 2e handle this case.
-
----
-
-## 3. CLI Base URL Override Registry
-
-New file: `packages/sdk/src/workflows/cli-proxy-overrides.ts`
-
-Each coding agent CLI uses different env vars to override the LLM API base URL. The proxy works by redirecting these base URLs to the proxy endpoint.
-
-```typescript
-import type { AgentCli } from './types.js';
-
-/** Maps CLI name -> env var overrides needed to redirect LLM calls through the proxy. */
-const CLI_BASE_URL_OVERRIDES: Record<string, (proxyUrl: string) => Record<string, string>> = {
-  // Claude Code
-  claude: (url) => ({
-    ANTHROPIC_BASE_URL: url,
-    ANTHROPIC_API_KEY: 'proxy', // Claude Code requires a non-empty key
-  }),
-
-  // OpenAI Codex CLI
-  codex: (url) => ({
-    OPENAI_BASE_URL: url,
-    OPENAI_API_KEY: 'proxy',
-  }),
-
-  // OpenCode
-  opencode: (url) => ({
-    OPENAI_BASE_URL: url,
-    OPENAI_API_KEY: 'proxy',
-  }),
-
-  // Aider
-  aider: (url) => ({
-    OPENAI_API_BASE: url,
-    OPENAI_API_KEY: 'proxy',
-  }),
-
-  // Gemini CLI
-  gemini: (url) => ({
-    GOOGLE_API_BASE: url,
-  }),
-
-  // Goose (uses OpenAI-compatible endpoint)
-  goose: (url) => ({
-    OPENAI_BASE_URL: url,
-    OPENAI_API_KEY: 'proxy',
-  }),
-
-  // Droid (uses OpenAI-compatible endpoint)
-  droid: (url) => ({
-    OPENAI_BASE_URL: url,
-    OPENAI_API_KEY: 'proxy',
-  }),
-
-  // Cursor / Cursor Agent (uses OpenAI-compatible endpoint)
-  cursor: (url) => ({
-    OPENAI_BASE_URL: url,
-    OPENAI_API_KEY: 'proxy',
-  }),
-  'cursor-agent': (url) => ({
-    OPENAI_BASE_URL: url,
-    OPENAI_API_KEY: 'proxy',
-  }),
-};
-
-/** Generic fallback: set both major provider base URLs. */
-const GENERIC_FALLBACK = (url: string): Record<string, string> => ({
-  OPENAI_BASE_URL: url,
-  ANTHROPIC_BASE_URL: url,
-  OPENAI_API_KEY: 'proxy',
-  ANTHROPIC_API_KEY: 'proxy',
-});
-
-/**
- * Resolve the env var overrides needed to route a CLI's LLM calls through the proxy.
- *
- * @param cli - The agent CLI type (e.g. "claude", "codex", "aider")
- * @param proxyUrl - The credential proxy endpoint URL
- * @returns Record of env vars to inject into the agent's environment
- */
-export function resolveCliBaseUrlOverrides(cli: AgentCli | string, proxyUrl: string): Record<string, string> {
-  const resolver = CLI_BASE_URL_OVERRIDES[cli] ?? GENERIC_FALLBACK;
-  return resolver(proxyUrl);
-}
-```
-
----
-
-## 4. Workflow Config Example
-
-```yaml
-version: '1'
-name: multi-agent-with-proxy
-description: Agents use credential proxy instead of raw API keys
-
-swarm:
-  pattern: fan-out
-  credentialProxy:
-    proxyUrl: 'https://agentrelay.com/llm-proxy'
-    jwtSecret: '$RELAY_PROXY_SECRET' # resolved from env
-    defaultBudget: 100000
-    providers:
-      anthropic:
-        credentialId: 'nango-anthropic-prod'
-      openai:
-        credentialId: 'nango-openai-prod'
-      openrouter:
-        credentialId: 'nango-openrouter-prod'
-
-agents:
-  - name: generator
-    cli: claude
-    role: 'Code generator'
-    credentials:
-      proxy: true # opt-in to proxy mode
-    # Agent receives ANTHROPIC_BASE_URL pointing to proxy
-    # and a scoped JWT — never sees the real Anthropic key
-
-  - name: reviewer
-    cli: codex
-    role: 'Code reviewer'
-    credentials:
-      proxy: true
-      budget: 50000 # override default budget
-    # Agent receives OPENAI_BASE_URL pointing to proxy
-
-  - name: legacy-agent
-    cli: aider
-    role: 'Legacy helper'
-    # No credentials.proxy — gets normal env, no proxy
-```
-
----
-
-## 5. Data Flow
-
-```
-relay.yaml                    Runner                         Agent Process
-─────────                    ──────                         ─────────────
-credentialProxy config ──→ provisionAgents()
-                            │
-                            ├─ for each agent w/ proxy:true
-                            │   └─ mintProxyToken(claims, secret) ──→ JWT
-                            │
-                            ├─ spawnAndWait() / execNonInteractive()
-                            │   ├─ resolveCliBaseUrlOverrides(cli, proxyUrl)
-                            │   ├─ inject RELAY_LLM_PROXY_URL
-                            │   ├─ inject RELAY_LLM_PROXY_TOKEN_<PROVIDER>
-                            │   ├─ inject CLI-specific base URL overrides
-                            │   └─ strip raw API keys from env
-                            │
-                            └─ Agent spawns with proxy env ──→  CLI makes API call
-                                                                  │
-                                                                  ├─ Base URL → proxy
-                                                                  ├─ Proxy validates JWT
-                                                                  ├─ Proxy fetches real
-                                                                  │  credential from Nango
-                                                                  └─ Proxy forwards to
-                                                                     real provider API
-```
-
----
-
-## 6. Backwards Compatibility
-
-- **No `credentialProxy` in swarm config**: Zero behavior change. No proxy tokens minted.
-- **No `credentials.proxy` on agent**: Zero behavior change. Agent gets normal env.
-- **Mixed mode**: Some agents use proxy, others don't. Each agent's env is independent.
-- **`filteredEnv()` unchanged**: Raw API keys are already excluded from the allowlist. Only `getRelayEnv()` (which spreads `process.env`) could leak them, and the proxy injection code explicitly deletes them.
-
----
-
-## 7. Security Considerations
-
-- **JWT scope**: Each token is scoped to one agent + one provider + one credential. An agent cannot use another agent's token for a different provider.
-- **Budget enforcement**: The proxy validates budget claims and rejects requests that exceed the token's budget.
-- **Key stripping**: Raw API keys (`OPENAI_API_KEY`, `ANTHROPIC_API_KEY`, `OPENROUTER_API_KEY`) are deleted from the agent's env when proxy mode is active, preventing bypass.
-- **Secret resolution**: `jwtSecret` supports `$ENV_VAR` syntax so the secret never appears in YAML files.
-- **Token TTL**: Tokens use the 15-minute default TTL from `DEFAULT_PROXY_TOKEN_TTL_SECONDS`. For long-running agents, the runner should refresh tokens (future enhancement).
-
----
-
-## 8. Files to Modify
-
-| File                                                | Change                                                                                                        |
-| --------------------------------------------------- | ------------------------------------------------------------------------------------------------------------- |
-| `packages/sdk/src/workflows/types.ts`               | Add `AgentCredentials`, `CredentialProxyConfig`, update `AgentDefinition` and `SwarmConfig`                   |
-| `packages/sdk/src/workflows/runner.ts`              | Import proxy JWT, add `proxyTokens` map, modify `provisionAgents()`, `execNonInteractive()`, `spawnAndWait()` |
-| `packages/sdk/src/workflows/cli-proxy-overrides.ts` | **New file** — CLI base URL override registry                                                                 |
-| `packages/sdk/src/workflows/schema.json`            | Add `credentialProxy` and `credentials` to validation schema                                                  |
-
----
-
-## 9. Implementation Order
-
-1. **Types first** — Add `AgentCredentials` and `CredentialProxyConfig` to `types.ts`
-2. **CLI overrides** — Create `cli-proxy-overrides.ts` with the resolver registry
-3. **Runner integration** — Wire up minting + env injection in `runner.ts`
-4. **Schema update** — Add new fields to `schema.json` for YAML validation
-5. **Tests** — Unit tests for `resolveCliBaseUrlOverrides()`, integration tests for env injection
diff --git a/README.md b/README.md
index c91d2105d..6e5c65741 100644
--- a/README.md
+++ b/README.md
@@ -1,33 +1,55 @@
-![Agent Relay](./readme-banner.png)
+<img src="./readme-banner.png" alt="Agent Relay" height="392">
 
+**Website:** [agentrelay.com](https://agentrelay.com) · **Docs:** [agentrelay.com/docs](https://agentrelay.com/docs)
 
+<a href="https://www.npmjs.com/package/@agent-relay/sdk"><img alt="npm" src="https://img.shields.io/npm/v/@agent-relay/sdk"></a>
+<a href="https://github.com/AgentWorkforce/relay/actions/workflows/test.yml"><img alt="Tests" src="https://img.shields.io/github/actions/workflow/status/AgentWorkforce/relay/test.yml?branch=main&label=tests"></a>
+<a href="./LICENSE"><img alt="License" src="https://img.shields.io/badge/license-Apache--2.0-blue.svg"></a>
 
-<div align="center">
+</div>
 
-  
+## Multi Agent Orchestration
 
-[![Featured on OSSCAR](https://osscar.dev/api/badge?slug=agentworkforce)](https://osscar.dev/org/agentworkforce)
+Enable your Claude Code, Codex, OpenCode agent spawn agent teams that can communicate and collaborate. Not subagents, but real agents who
+could spawn their own subagents. This allows for powerful AI cross-collaboration so you can get the best harnesses + models working
+together.
 
-Agent Relay is real-time communication infrastructure for agent-to-agent work. Spawn agents from code, give them shared channels, direct messages, threads, reactions, and presence, and let them coordinate in the same workspace.
+## Benefits Over Subagents
 
-It is not a framework or a harness. Your agents keep running however they already run. Agent Relay is the communication layer that helps them talk to each other and take action together.
+1. The agent orchestrating has full insight what the spawned agents are doing. It can read the logs and steer mid turn if needed
+2. Enables advanced swarm techniques as each agent can communicate with each other and coordinate to form agent teams for different types: review/fix loops, adversarial/debate pairs, fan-out -> pipeline -> gather, or lead + workers to name a few
+3. Diversity of thought and implementation. Codex implement, Claude review, Gemini do the final verification leads to better results as different models + harnesses excel in different things.
+4. Review happens as a conversation between the live reviewer and the live implementer, not as a report handed back to the parent after each one finishes.
+5. Audit trail exists outside the agent and outside the parent. With the [Agent Relay Observer](https://agentrelay.com/observer) you get full auditability into every single DM and group message sent by the agents.
 
-**Website:** [agentrelay.com](https://agentrelay.com) · **Docs:** [agentrelay.com/docs](https://agentrelay.com/docs)
+## Get Started
 
-  <a href="https://www.npmjs.com/package/@agent-relay/sdk"><img alt="npm" src="https://img.shields.io/npm/v/@agent-relay/sdk"></a>
-  <a href="https://github.com/AgentWorkforce/relay/actions/workflows/test.yml"><img alt="Tests" src="https://img.shields.io/github/actions/workflow/status/AgentWorkforce/relay/test.yml?branch=main&label=tests"></a>
-  <a href="./LICENSE"><img alt="License" src="https://img.shields.io/badge/license-Apache--2.0-blue.svg"></a>
-</div>
+1. Install the agent-relay cli
+
+```
+curl -fsSL https://raw.githubusercontent.com/AgentWorkforce/relay/main/install.sh | bash
+
+```
+
+2. Install the skill
+
+```
+npx skills add https://github.com/agentworkforce/skills --skill orchestrating-agent-relay
+```
+
+3. Tell your agent to use it
+
+```
+use the orchestrating-agent-relay skill to spawn a claude and codex agent and [YOUR_TASK]
+```
 
+For single, well-scoped, one-shot tasks, subagents still win. Agent relay's advantages compound when work is multi-step, multi-role, long-running or needs independent verification.
 
-## Why Agent Relay
+## SDK
 
-- **Built for real-time coordination**: channels, messages, inboxes, reactions, and presence for agents that need to collaborate.
-- **Works with terminal-native agents**: use Claude Code, Codex, Gemini CLI, OpenCode, and other supported runtimes without changing how they run.
-- **SDK-first**: spawn agents programmatically, route work, wait for readiness, and manage lifecycles from TypeScript or Python.
-- **Useful from both code and tools**: wire Relay into apps, scripts, plugins, and local workflows.
+Use the Agent Relay SDK to spawn and control agents programmatically.
 
-## Install
+### Install
 
 **TypeScript / Node.js**
 
@@ -45,7 +67,7 @@ pip install agent-relay-sdk
 
 See the [Python SDK](./packages/sdk-py) for Python usage and adapters.
 
-## Quick example
+### Quick example
 
 ```typescript
 import { AgentRelay, Models } from '@agent-relay/sdk';
@@ -82,53 +104,17 @@ await relay.shutdown();
 
 Want more than a toy example? Start with:
 
-- [Introduction](./docs/introduction.md)
-- [CLI on the Relay](./docs/cli-on-the-relay.md)
-- [Examples](./examples/README.md)
-- [TypeScript SDK README](./packages/sdk/README.md)
-- [Python SDK README](./packages/sdk-py/README.md)
+- [Introduction](https://agentrelay.com/docs/introduction)
+- [TypeScript SDK README](https://agentrelay.com/docs/typescript-sdk)
+- [Python SDK README](https://agentrelay.com/docs/python-sdk)
 
-## What you can build
+### What you can build
 
 - Multi-agent coding flows with shared channels and worker handoffs
 - Agent inboxes for status updates, blockers, and review loops
 - Tooling that lets existing agents communicate without rewriting their runtime
 - Local or remote coordination patterns where multiple agents need shared context
 
-## Claude Code plugin
-
-Use Agent Relay directly inside Claude Code, no SDK required. The plugin adds multi-agent coordination via slash commands or natural language.
-
-```text
-/plugin marketplace add Agentworkforce/skills
-/plugin install claude-relay-plugin
-```
-
-Once installed, you can coordinate teams of agents with built-in skills:
-
-```text
-> /relay-team Refactor the auth module, split the middleware, update tests, and update docs
-> /relay-fanout Run linting fixes across all packages in the monorepo
-> /relay-pipeline Analyze the API logs, generate a summary report, then draft an email
-```
-
-Or just describe what you want in plain language:
-
-```text
-> Use relay fan-out to lint all packages in parallel
-> Split the migration into three relay workers, one for the schema, one for the API, and one for the frontend
-```
-
-See [docs/plugin-claude-code.md](./docs/plugin-claude-code.md) and the [plugin README](https://github.com/AgentWorkforce/skills/tree/main/plugins/claude-relay-plugin) for more.
-
-## Agent Relay CLI
-
-Install the CLI with:
-
-```bash
-curl -fsSL https://raw.githubusercontent.com/AgentWorkforce/relay/main/install.sh | bash
-```
-
 Then use Agent Relay to bring agents into a shared workspace and route work between them.
 
 ## Supported agents and runtimes
@@ -142,7 +128,7 @@ Agent Relay is designed for terminal-native agents and SDK-driven workflows. Thi
 
 The broader SDK and workflow surface also includes additional integrations in the codebase. See the package docs for details.
 
-## Development
+### Development
 
 If you want to work on the repo itself:
 
@@ -154,7 +140,6 @@ npm test
 
 Useful references:
 
-- [ARCHITECTURE.md](./ARCHITECTURE.md)
 - [CHANGELOG.md](./CHANGELOG.md)
 - [GitHub Issues](https://github.com/AgentWorkforce/relay/issues)
 
diff --git a/TRACEBACK_DESIGN.md b/TRACEBACK_DESIGN.md
deleted file mode 100644
index 19294a79b..000000000
--- a/TRACEBACK_DESIGN.md
+++ /dev/null
@@ -1,422 +0,0 @@
-# Verification Traceback Pattern — Design Document
-
-## Problem
-
-When a verification check fails and the runner retries a step, the retry prompt currently includes:
-
-1. The raw error message
-2. The last 2000 characters of the previous agent's output
-3. For custom verification: the command and its output
-
-This is a blunt instrument. The failing agent receives a wall of text and must self-diagnose what went wrong. For complex verification failures (e.g., `npx nango compile` producing 50 lines of TypeScript errors), the agent often wastes its retry attempt misinterpreting the error or fixing the wrong file.
-
-**Marcin's insight**: "It's a DAG, so technically no loops." The review-loop template (`builtin-templates/review-loop.yaml`) achieves review via a DAG topology — separate steps for implement, review, consolidate, address. But diagnostic traceback is fundamentally different: it must happen _within_ the retry loop, not as a separate DAG step.
-
-**Solution**: Spawn an ephemeral diagnostic agent inside the runner's retry flow. This agent analyzes the failure and produces targeted guidance that gets injected into the retry prompt — replacing the raw 2000-char truncation with intelligent analysis.
-
----
-
-## 1. New `VerificationCheck` Field: `diagnosticAgent`
-
-### Type Change
-
-```typescript
-// packages/sdk/src/workflows/types.ts
-export interface VerificationCheck {
-  type: 'output_contains' | 'exit_code' | 'file_exists' | 'custom';
-  value: string;
-  description?: string;
-  timeoutMs?: number;
-  /** Name of an agent defined in the workflow's agents list.
-   *  When set, and verification fails with retries remaining,
-   *  this agent is spawned to analyze the failure before retry. */
-  diagnosticAgent?: string;
-}
-```
-
-The field is optional. When omitted, existing retry behavior is preserved exactly.
-
-### Schema Change
-
-In `schema.json`, add to the `VerificationCheck` definition:
-
-```json
-"diagnosticAgent": {
-  "type": "string",
-  "description": "Agent name to spawn for failure diagnosis before retry"
-}
-```
-
-### Validation
-
-During preflight/dry-run, if `diagnosticAgent` is set:
-
-- The named agent **must** exist in the workflow's `agents` list
-- Warning if the step has `retries: 0` or no `retries` (diagnostic agent would never run)
-
----
-
-## 2. Runner Integration
-
-### Where It Hooks In
-
-The traceback logic lives in `executeAgentStep()` in `runner.ts`, specifically in the retry prompt construction block (currently lines ~4203-4219).
-
-Current flow:
-
-```
-attempt loop start
-  → resolve task with step output variables
-  → if attempt > 0: prepend [RETRY] context (raw error + last 2000 chars)
-  → spawn agent
-  → collect output
-  → run verification
-  → if verification fails: throw WorkflowCompletionError
-  → catch block: lastError = error, continue loop
-attempt loop end
-```
-
-New flow:
-
-```
-attempt loop start
-  → resolve task with step output variables
-  → if attempt > 0: prepend [RETRY] context (see below)
-  → spawn agent
-  → collect output
-  → run verification
-  → if verification fails AND diagnosticAgent is set AND retries remain:
-      a. spawn diagnostic agent (ephemeral, non-interactive)
-      b. collect diagnostic output
-      c. store diagnostic output for next iteration's retry prompt
-  → throw WorkflowCompletionError (unchanged)
-  → catch block: lastError = error, continue loop
-attempt loop end
-```
-
-### Diagnostic Agent Prompt
-
-When verification fails and `diagnosticAgent` is configured, the runner spawns the diagnostic agent with this prompt:
-
-```
-The following verification failed after step "<step-name>".
-
-Verification command: <check.value>
-Verification output:
-<verification error output>
-
-Step task was:
-<original resolved task (without retry prefix)>
-
-Step output (last 2000 chars):
-<agent output, truncated>
-
-Analyze what went wrong. Your response will be injected into the retry prompt
-for the original agent. Be specific about:
-- Which file(s) have issues
-- What the exact error is (line numbers, error codes)
-- What the agent should do differently on the next attempt
-
-Do NOT fix the code yourself — just diagnose.
-```
-
-### Modified Retry Prompt
-
-When diagnostic output is available, the retry prompt changes from:
-
-```
-[RETRY — Attempt 2/3]
-Previous attempt failed: <error>
-[VERIFICATION FAILED] Your code did not pass the verification check.
-Command: npx nango compile
-Output:
-<raw compiler output>
-
-Fix the issues above before proceeding.
-Previous output (last 2000 chars):
-<raw output>
----
-<original task>
-```
-
-To:
-
-```
-[RETRY — Attempt 2/3]
-Verification failed. A diagnostic agent analyzed the failure:
-
---- Diagnostic Analysis ---
-<diagnostic agent output>
---- End Analysis ---
-
-Original verification error:
-Command: npx nango compile
-Output (last 500 chars):
-<truncated raw output>
-
----
-<original task>
-```
-
-The raw verification output is kept but truncated more aggressively (500 chars instead of 2000) since the diagnostic analysis is the primary guidance.
-
-### Implementation Location
-
-New private method on `WorkflowRunner`:
-
-```typescript
-private async runDiagnosticAgent(
-  step: WorkflowStep,
-  verificationError: string,
-  agentOutput: string,
-  originalTask: string,
-  agentMap: Map<string, AgentDefinition>,
-  timeoutMs?: number
-): Promise<string | null>
-```
-
-Returns the diagnostic output, or `null` if:
-
-- The diagnostic agent is not configured
-- The diagnostic agent timed out
-- The diagnostic agent failed to spawn
-
-New instance field to store diagnostic output between retry iterations:
-
-```typescript
-private lastDiagnosticOutput = new Map<string, string>();
-```
-
----
-
-## 3. Builder API
-
-### Step Configuration
-
-```typescript
-const workflow = new WorkflowBuilder('nango-sync')
-  .agent('generator', { cli: 'claude', role: 'Code generator' })
-  .agent('reviewer', { cli: 'claude', role: 'Diagnostic reviewer', interactive: false })
-  .step('generate', {
-    agent: 'generator',
-    task: 'Implement the Nango sync integration for ...',
-    verification: {
-      type: 'custom',
-      value: 'cd nango-integrations && npx nango compile',
-      diagnosticAgent: 'reviewer',
-    },
-    retries: 2,
-  })
-  .build();
-```
-
-### YAML Configuration
-
-```yaml
-agents:
-  - name: generator
-    cli: claude
-    role: Code generator
-
-  - name: reviewer
-    cli: claude
-    role: Diagnostic reviewer
-    interactive: false
-    constraints:
-      maxTokens: 4000
-      timeoutMs: 60000
-
-workflows:
-  - name: nango-sync
-    steps:
-      - name: generate
-        agent: generator
-        task: |
-          Implement the Nango sync integration for ...
-        verification:
-          type: custom
-          value: cd nango-integrations && npx nango compile
-          diagnosticAgent: reviewer
-        retries: 2
-```
-
----
-
-## 4. Diagnostic Agent Lifecycle
-
-### Ephemeral Spawning
-
-The diagnostic agent:
-
-- Is defined in the workflow's `agents` list (same as any other agent)
-- Uses the same agent definition (CLI, model, permissions, cwd)
-- Is spawned **ephemerally** by the runner — it does NOT appear as a step in the DAG
-- Does NOT get registered with relay messaging (no PTY, no channel)
-- Runs as `interactive: false` regardless of the agent definition's setting
-- Is spawned via the same `executor.executeAgentStep()` path used for non-interactive workers
-
-### Not a DAG Step
-
-The diagnostic agent invocation:
-
-- Has no `WorkflowStepRow` in the database
-- Has no entry in `stepStates`
-- Does not appear in dry-run reports
-- Does not participate in barriers or coordination
-- Is invisible to the DAG topology
-
-It is an implementation detail of the retry mechanism, similar to how the runner already injects retry context strings.
-
-### Evidence Recording
-
-The diagnostic invocation IS recorded in the step's completion evidence:
-
-```typescript
-this.recordStepToolSideEffect(step.name, {
-  type: 'diagnostic_agent',
-  detail: `Diagnostic agent "${diagnosticAgentName}" analyzed verification failure (attempt ${attempt})`,
-  raw: {
-    diagnosticAgent: diagnosticAgentName,
-    attempt,
-    outputLength: diagnosticOutput.length,
-  },
-});
-```
-
-This requires adding `'diagnostic_agent'` to the `CompletionEvidenceToolSideEffectType` union.
-
----
-
-## 5. Timeout Handling
-
-### Sub-Timeout
-
-The diagnostic agent runs with a dedicated sub-timeout:
-
-| Source                                         | Timeout                                |
-| ---------------------------------------------- | -------------------------------------- |
-| Diagnostic agent's own `constraints.timeoutMs` | Used if set                            |
-| Default                                        | 60,000 ms (60 seconds)                 |
-| Step's remaining time                          | Capped to avoid exceeding step timeout |
-
-```typescript
-const diagnosticTimeout = Math.min(
-  diagnosticAgentDef.constraints?.timeoutMs ?? 60_000,
-  remainingStepTimeMs ?? Infinity
-);
-```
-
-### Fallback on Timeout
-
-If the diagnostic agent times out or errors:
-
-1. Log a warning: `[step-name] Diagnostic agent timed out, falling back to raw retry`
-2. Fall back to the existing retry behavior (raw error + 2000 chars)
-3. The retry still happens — diagnostic failure does NOT consume a retry attempt
-
----
-
-## 6. Budget Interaction
-
-### Token Accounting
-
-When budget enforcement is enabled (`swarm.tokenBudget`):
-
-- Diagnostic agent token usage counts toward the **workflow's total budget**
-- Diagnostic token usage is attributed to the step being retried
-- If the workflow budget is exhausted, the diagnostic agent is NOT spawned (fall back to raw retry)
-
-### Budget Check Before Spawning
-
-```typescript
-if (this.budgetTracker && !this.budgetTracker.canSpend(estimatedDiagnosticTokens)) {
-  this.log(`[${step.name}] Skipping diagnostic agent — budget exhausted`);
-  return null; // fall back to raw retry
-}
-```
-
-The `estimatedDiagnosticTokens` is a conservative estimate (default: 2000 tokens) to avoid spawning a diagnostic agent that would immediately be killed by budget enforcement.
-
----
-
-## 7. How This Differs from Existing Retry
-
-| Aspect         | Current Retry                      | Traceback Retry                                 |
-| -------------- | ---------------------------------- | ----------------------------------------------- |
-| Error context  | Raw error string                   | Diagnostic agent analysis                       |
-| Output context | Last 2000 chars (blind truncation) | Agent-analyzed output (targeted)                |
-| Root cause     | Agent must self-diagnose           | Diagnostic agent identifies root cause          |
-| Fix guidance   | None                               | Specific files, errors, and suggested approach  |
-| Cost           | Free (string ops)                  | 1 additional agent invocation per retry         |
-| Latency        | None                               | 10-60s per diagnostic invocation                |
-| Fallback       | N/A                                | Falls back to current behavior on timeout/error |
-
-### When to Use Traceback vs Plain Retry
-
-- **Plain retry** (no `diagnosticAgent`): Simple verification (output_contains, file_exists), or when the error message is self-explanatory
-- **Traceback**: Complex verification (compilation, test suites, linting) where the raw output needs interpretation
-
----
-
-## 8. Sequence Diagram
-
-```
-Step Attempt 1:
-  Runner → spawn generator agent
-  Generator → produces code
-  Runner → run verification (npx nango compile)
-  Verification → FAILS (compile errors)
-
-  Runner → diagnosticAgent is set, retries remain
-  Runner → spawn reviewer agent (ephemeral)
-    Prompt: "Verification failed. Here's the error output and
-             the agent's work. Diagnose what went wrong."
-  Reviewer → "The generator created fetchUsers.ts but imported
-              from 'nango' instead of '@nangohq/node'. Line 12
-              has a type error: UserResponse is not exported
-              from the schema file. The agent should fix the
-              import path and use the correct type name."
-  Runner → store diagnostic output
-
-Step Attempt 2:
-  Runner → spawn generator agent
-    Prompt: "[RETRY — Attempt 2/3]
-             Verification failed. Diagnostic analysis:
-             --- The generator created fetchUsers.ts but imported
-             from 'nango' instead of '@nangohq/node'. Line 12 ...
-             ---
-             Original task: Implement the Nango sync integration..."
-  Generator → fixes the specific issues identified
-  Runner → run verification (npx nango compile)
-  Verification → PASSES
-  Step → completed_verified
-```
-
----
-
-## 9. Edge Cases
-
-1. **Diagnostic agent is the same as the step agent**: Allowed. The diagnostic agent is a separate invocation with a diagnosis-specific prompt.
-
-2. **Multiple verification checks on a step**: Not currently supported (VerificationCheck is singular). If added later, diagnostic agent runs once for the first failing check.
-
-3. **Owner-supervised steps**: Diagnostic agent runs AFTER the owner/specialist flow but BEFORE the retry. It supplements, not replaces, the owner decision flow.
-
-4. **Non-custom verification with diagnosticAgent**: Supported but less useful. For `file_exists`, the diagnostic prompt would include "file X does not exist" — still potentially valuable for the diagnostic agent to suggest why.
-
-5. **Diagnostic agent itself fails verification**: N/A — the diagnostic agent has no verification check. Its raw output is used as-is.
-
----
-
-## 10. Implementation Checklist
-
-1. **types.ts**: Add `diagnosticAgent?: string` to `VerificationCheck` interface
-2. **types.ts**: Add `'diagnostic_agent'` to `CompletionEvidenceToolSideEffectType` union
-3. **schema.json**: Add `diagnosticAgent` to verification check schema
-4. **runner.ts**: Add `runDiagnosticAgent()` private method
-5. **runner.ts**: Add `lastDiagnosticOutput` map field
-6. **runner.ts**: Modify retry prompt construction in `executeAgentStep()` to use diagnostic output when available
-7. **runner.ts**: Call `runDiagnosticAgent()` when verification fails with retries remaining
-8. **builder.ts**: Allow `diagnosticAgent` in step verification config (pass-through, no builder changes needed beyond type)
-9. **Validation**: Add preflight check that `diagnosticAgent` references a valid agent
-10. **Tests**: Unit tests for diagnostic prompt construction, timeout fallback, budget skip
diff --git a/crates/broker/src/snippets.rs b/crates/broker/src/snippets.rs
index 1de484f55..9baaf92a3 100644
--- a/crates/broker/src/snippets.rs
+++ b/crates/broker/src/snippets.rs
@@ -11,35 +11,8 @@ use tokio::process::Command;
 
 const RELAYCAST_MCP_PACKAGE: &str = "@relaycast/mcp";
 
-const TARGET_FILES: [&str; 3] = ["AGENTS.md", "CLAUDE.md", "GEMINI.md"];
-const MARKER_START: &str = "<!-- prpm:snippet:start @agent-relay/agent-relay-snippet@1.2.0 -->";
-const MARKER_END: &str = "<!-- prpm:snippet:end @agent-relay/agent-relay-snippet@1.2.0 -->";
-const MARKER_START_PREFIX: &str = "<!-- prpm:snippet:start @agent-relay/agent-relay-snippet@";
-const MARKER_END_PREFIX: &str = "<!-- prpm:snippet:end @agent-relay/agent-relay-snippet@";
-const SNIPPET_BODY: &str = include_str!("../../../relay-snippets/agent-relay-snippet.md");
 const MCP_FILE: &str = ".mcp.json";
 const RELAYCAST_SERVER: &str = "relaycast";
-const MCP_SECTION: &str = r#"## MCP-First Workflow (Preferred)
-
-MCP is configured for this workspace/CLI. Use MCP tools first:
-
-- `mcp__relaycast__message_dm_send(to, text)` — Send a DM
-- `mcp__relaycast__message_post(channel, text)` — Post to channel
-- `mcp__relaycast__agent_add(name, cli, task)` — Spawn agent
-- `mcp__relaycast__message_inbox_check()` — Check messages
-- `mcp__relaycast__agent_list()` — List agents
-- `mcp__relaycast__agent_remove(name)` — Release agent
-
-Use MCP/skills only; do not use filesystem protocols.
-
-"#;
-
-#[derive(Debug, Clone, Copy, Default, PartialEq, Eq)]
-pub struct SnippetInstallReport {
-    pub created: usize,
-    pub updated: usize,
-    pub skipped: usize,
-}
 
 #[derive(Debug, Clone, Copy, Default, PartialEq, Eq)]
 pub struct McpInstallReport {
@@ -48,48 +21,6 @@ pub struct McpInstallReport {
     pub skipped: usize,
 }
 
-pub fn find_project_root(start: &Path) -> PathBuf {
-    let start_dir = if start.is_dir() {
-        start.to_path_buf()
-    } else {
-        start
-            .parent()
-            .map(Path::to_path_buf)
-            .unwrap_or_else(|| start.to_path_buf())
-    };
-
-    let mut cursor = start_dir.clone();
-    loop {
-        if cursor.join(".git").exists() {
-            return cursor;
-        }
-
-        if !cursor.pop() {
-            break;
-        }
-    }
-
-    start_dir
-}
-
-pub fn should_install_in(root: &Path) -> bool {
-    if !root.is_dir() {
-        return false;
-    }
-
-    if let Some(home) = dirs::home_dir() {
-        if root == home {
-            return false;
-        }
-    }
-
-    true
-}
-
-pub fn ensure_protocol_snippets(root: &Path) -> io::Result<SnippetInstallReport> {
-    ensure_protocol_snippets_inner(root, dirs::home_dir())
-}
-
 pub fn ensure_relaycast_mcp_config(
     root: &Path,
     relay_api_key: Option<&str>,
@@ -165,60 +96,6 @@ pub fn ensure_relaycast_mcp_config(
     Ok(report)
 }
 
-fn ensure_protocol_snippets_inner(
-    root: &Path,
-    home: Option<PathBuf>,
-) -> io::Result<SnippetInstallReport> {
-    let mut report = SnippetInstallReport::default();
-
-    for file_name in TARGET_FILES {
-        let path = root.join(file_name);
-        let block = snippet_block(root, file_name, home.as_deref());
-
-        if !path.exists() {
-            fs::write(&path, &block)?;
-            report.created += 1;
-            continue;
-        }
-
-        let existing = fs::read_to_string(&path)?;
-        if let Some(next) = replace_existing_block(&existing, &block) {
-            if next == existing {
-                report.skipped += 1;
-            } else {
-                fs::write(&path, next)?;
-                report.updated += 1;
-            }
-        } else {
-            let next = append_block(existing, &block);
-            fs::write(&path, next)?;
-            report.updated += 1;
-        }
-    }
-
-    Ok(report)
-}
-
-fn snippet_block(root: &Path, target_file: &str, home: Option<&Path>) -> String {
-    let mcp_first = mcp_configured_for_target(root, target_file, home);
-    let mut body = SNIPPET_BODY.trim_end().to_string();
-
-    if mcp_first && !body.contains("## MCP-First Workflow (Preferred)") {
-        if let Some(idx) = body.find("## Send a Message") {
-            body.insert_str(idx, MCP_SECTION);
-        } else {
-            body.push('\n');
-            body.push('\n');
-            body.push_str(MCP_SECTION.trim_end());
-        }
-    }
-
-    format!(
-        "{MARKER_START}\n{body}\n{MARKER_END}\n",
-        body = body.trim_end()
-    )
-}
-
 /// Build the full MCP config JSON string for the relaycast server.
 /// Suitable for passing to `--mcp-config` CLI flags.
 pub fn relaycast_mcp_config_json(
@@ -1115,103 +992,6 @@ fn write_pretty_json(path: &Path, value: &Value) -> io::Result<()> {
     fs::write(path, body)
 }
 
-fn replace_existing_block(existing: &str, desired_block: &str) -> Option<String> {
-    let ranges = find_snippet_ranges(existing)?;
-    let mut next = String::with_capacity(existing.len() + desired_block.len());
-    let mut cursor = 0usize;
-
-    for (idx, (start, end)) in ranges.iter().enumerate() {
-        if idx == 0 {
-            next.push_str(&existing[..*start]);
-            next.push_str(desired_block);
-        } else {
-            next.push_str(&existing[cursor..*start]);
-        }
-        cursor = *end;
-    }
-
-    next.push_str(&existing[cursor..]);
-    Some(next)
-}
-
-fn find_snippet_ranges(existing: &str) -> Option<Vec<(usize, usize)>> {
-    let mut ranges = Vec::new();
-    let mut offset = 0usize;
-
-    while let Some(start_rel) = existing[offset..].find(MARKER_START_PREFIX) {
-        let start = offset + start_rel;
-        let end_start_rel = existing[start..].find(MARKER_END_PREFIX)?;
-        let end_start = start + end_start_rel;
-        let end = existing[end_start..]
-            .find('\n')
-            .map(|idx| end_start + idx + 1)
-            .unwrap_or(existing.len());
-        ranges.push((start, end));
-        offset = end;
-    }
-
-    if ranges.is_empty() {
-        None
-    } else {
-        Some(ranges)
-    }
-}
-
-fn append_block(mut existing: String, block: &str) -> String {
-    if !existing.ends_with('\n') {
-        existing.push('\n');
-    }
-    if !existing.trim().is_empty() {
-        existing.push('\n');
-    }
-    existing.push_str(block);
-    existing
-}
-
-fn mcp_configured_for_target(root: &Path, target_file: &str, home: Option<&Path>) -> bool {
-    candidate_mcp_paths(root, target_file, home)
-        .iter()
-        .filter_map(|path| fs::read_to_string(path).ok())
-        .any(|contents| {
-            let lower = contents.to_ascii_lowercase();
-            (lower.contains("agent-relay") || lower.contains("relaycast")) && lower.contains("mcp")
-        })
-}
-
-fn candidate_mcp_paths(root: &Path, target_file: &str, home: Option<&Path>) -> Vec<PathBuf> {
-    let mut paths = vec![root.join(".mcp.json")];
-
-    let home = match home.map(Path::to_path_buf).or_else(dirs::home_dir) {
-        Some(h) => h,
-        None => return paths,
-    };
-
-    if target_file == "CLAUDE.md" || target_file == "AGENTS.md" {
-        paths.push(home.join(".claude").join("settings.json"));
-        paths.push(
-            home.join("Library")
-                .join("Application Support")
-                .join("Claude")
-                .join("claude_desktop_config.json"),
-        );
-        paths.push(
-            home.join(".config")
-                .join("claude")
-                .join("claude_desktop_config.json"),
-        );
-    }
-
-    if target_file == "GEMINI.md" || target_file == "AGENTS.md" {
-        paths.push(home.join(".gemini").join("settings.json"));
-    }
-
-    if target_file == "AGENTS.md" {
-        paths.push(home.join(".codex").join("config.toml"));
-    }
-
-    paths
-}
-
 #[cfg(test)]
 mod tests {
     use std::fs;
@@ -1219,44 +999,7 @@ mod tests {
     use serde_json::{Map, Value};
     use tempfile::tempdir;
 
-    use super::{
-        ensure_protocol_snippets_inner, ensure_relaycast_mcp_config, find_project_root,
-        should_install_in, snippet_block, MARKER_START, RELAYCAST_MCP_PACKAGE,
-    };
-
-    #[test]
-    fn finds_git_ancestor_as_project_root() {
-        let temp = tempdir().expect("tempdir");
-        let root = temp.path();
-        fs::create_dir(root.join(".git")).expect("create .git");
-        fs::create_dir_all(root.join("a/b/c")).expect("create nested");
-
-        let resolved = find_project_root(&root.join("a/b/c"));
-        assert_eq!(resolved, root);
-    }
-
-    #[test]
-    fn returns_start_when_git_not_found() {
-        let temp = tempdir().expect("tempdir");
-        let start = temp.path().join("nested");
-        fs::create_dir_all(&start).expect("create nested");
-        let resolved = find_project_root(&start);
-        assert_eq!(resolved, start);
-    }
-
-    #[test]
-    fn should_install_requires_directory() {
-        let temp = tempdir().expect("tempdir");
-        let file_path = temp.path().join("file.txt");
-        fs::write(&file_path, "x").expect("write file");
-        assert!(!should_install_in(&file_path));
-    }
-
-    /// Helper: runs ensure_protocol_snippets with home isolated to the tempdir
-    /// so real user configs (e.g. ~/.claude/settings.json) don't leak in.
-    fn install_isolated(root: &std::path::Path) -> std::io::Result<super::SnippetInstallReport> {
-        ensure_protocol_snippets_inner(root, Some(root.to_path_buf()))
-    }
+    use super::ensure_relaycast_mcp_config;
 
     fn assert_is_reaycast_mcp_package(value: Option<&str>) {
         let package = value.expect("expected relaycast mcp package string");
@@ -1266,152 +1009,6 @@ mod tests {
         );
     }
 
-    #[test]
-    fn installs_to_all_targets_and_is_idempotent() {
-        let temp = tempdir().expect("tempdir");
-        let root = temp.path();
-
-        let first = install_isolated(root).expect("first install");
-        assert_eq!(first.created, 3);
-        assert_eq!(first.updated, 0);
-        assert_eq!(first.skipped, 0);
-
-        let second = install_isolated(root).expect("second install");
-        assert_eq!(second.created, 0);
-        assert_eq!(second.updated, 0);
-        assert_eq!(second.skipped, 3);
-    }
-
-    #[test]
-    fn installs_without_mcp_section_when_config_missing() {
-        let temp = tempdir().expect("tempdir");
-        let root = temp.path();
-
-        install_isolated(root).expect("install snippets");
-        let content = fs::read_to_string(root.join("AGENTS.md")).expect("read AGENTS.md");
-        assert!(!content.contains("## MCP-First Workflow (Preferred)"));
-    }
-
-    #[test]
-    fn keeps_single_mcp_section_when_project_mcp_config_exists() {
-        let temp = tempdir().expect("tempdir");
-        let root = temp.path();
-
-        fs::write(
-            root.join(".mcp.json"),
-            serde_json::json!({
-                "mcpServers": {
-                    "relaycast": {
-                        "command": "npx",
-                        "args": ["-y", RELAYCAST_MCP_PACKAGE]
-                    }
-                }
-            })
-            .to_string(),
-        )
-        .expect("write .mcp.json");
-
-        install_isolated(root).expect("install snippets");
-        let content = fs::read_to_string(root.join("AGENTS.md")).expect("read AGENTS.md");
-        let occurrences = content.matches("## MCP-First Workflow (Preferred)").count();
-        assert_eq!(occurrences, 1);
-    }
-
-    #[test]
-    fn refreshes_existing_block_when_mode_changes() {
-        let temp = tempdir().expect("tempdir");
-        let root = temp.path();
-
-        // Install without MCP config (isolated home = root, no MCP files in home)
-        let old = snippet_block(root, "AGENTS.md", Some(root));
-        fs::write(root.join("AGENTS.md"), old).expect("write old snippet");
-        fs::write(
-            root.join("CLAUDE.md"),
-            snippet_block(root, "CLAUDE.md", Some(root)),
-        )
-        .expect("write old snippet");
-        fs::write(
-            root.join("GEMINI.md"),
-            snippet_block(root, "GEMINI.md", Some(root)),
-        )
-        .expect("write old snippet");
-
-        // Now add MCP config
-        fs::write(
-            root.join(".mcp.json"),
-            serde_json::json!({
-                "mcpServers": {
-                    "relaycast": {
-                        "command": "npx",
-                        "args": ["-y", RELAYCAST_MCP_PACKAGE]
-                    }
-                }
-            })
-            .to_string(),
-        )
-        .expect("write .mcp.json");
-
-        let report = install_isolated(root).expect("refresh snippets");
-        assert_eq!(report.created, 0);
-        assert_eq!(report.updated, 3);
-        assert_eq!(report.skipped, 0);
-
-        let content = fs::read_to_string(root.join("AGENTS.md")).expect("read updated AGENTS.md");
-        assert!(content.contains("## MCP-First Workflow (Preferred)"));
-    }
-
-    #[test]
-    fn appends_to_existing_file_without_marker() {
-        let temp = tempdir().expect("tempdir");
-        let root = temp.path();
-        fs::write(root.join("AGENTS.md"), "# Existing\n").expect("write existing");
-
-        let report = install_isolated(root).expect("install snippets");
-        assert_eq!(report.created, 2);
-        assert_eq!(report.updated, 1);
-        assert_eq!(report.skipped, 0);
-
-        let content = fs::read_to_string(root.join("AGENTS.md")).expect("read agents after update");
-        assert!(content.contains("# Existing"));
-        assert!(content.contains(MARKER_START));
-    }
-
-    #[test]
-    fn upgrades_legacy_snippet_block_and_removes_file_protocol_text() {
-        let temp = tempdir().expect("tempdir");
-        let root = temp.path();
-        let legacy = r#"<!-- prpm:snippet:start @agent-relay/agent-relay-snippet@1.1.6 -->
-# Agent Relay Protocol
-
-Use AGENT_RELAY_OUTBOX and ->relay-file:spawn.
-<!-- prpm:snippet:end @agent-relay/agent-relay-snippet@1.1.6 -->
-"#;
-        fs::write(root.join("AGENTS.md"), legacy).expect("write legacy snippet");
-        fs::write(root.join("CLAUDE.md"), legacy).expect("write legacy snippet");
-        fs::write(root.join("GEMINI.md"), legacy).expect("write legacy snippet");
-        fs::write(
-            root.join(".mcp.json"),
-            serde_json::json!({
-                "mcpServers": {
-                    "relaycast": {
-                        "command": "npx",
-                        "args": [RELAYCAST_MCP_PACKAGE]
-                    }
-                }
-            })
-            .to_string(),
-        )
-        .expect("write .mcp.json");
-
-        let report = install_isolated(root).expect("upgrade snippets");
-        assert_eq!(report.updated, 3);
-
-        let content = fs::read_to_string(root.join("AGENTS.md")).expect("read AGENTS.md");
-        assert!(content.contains(MARKER_START));
-        assert!(!content.contains("Use AGENT_RELAY_OUTBOX and ->relay-file:spawn."));
-        assert!(content.contains("Use MCP/skills only; do not use filesystem protocols."));
-    }
-
     #[test]
     fn creates_reaycast_mcp_config_when_missing() {
         let temp = tempdir().expect("tempdir");
diff --git a/docker-compose.browser.yml b/docker-compose.browser.yml
deleted file mode 100644
index 4c81e293b..000000000
--- a/docker-compose.browser.yml
+++ /dev/null
@@ -1,78 +0,0 @@
-# Agent Relay - Browser Testing Workspace
-#
-# Extends docker-compose.dev.yml with browser testing capabilities.
-#
-# Usage:
-#   docker compose -f docker-compose.dev.yml -f docker-compose.browser.yml up
-#
-# Access:
-#   - Dashboard: http://localhost:3888
-#   - VNC (web): http://localhost:6080/vnc.html
-#   - VNC (native): vnc://localhost:5900
-
-version: '3.8'
-
-services:
-  # Browser-enabled workspace with full testing capabilities
-  workspace-browser:
-    build:
-      context: .
-      dockerfile: deploy/workspace/Dockerfile.browser
-    ports:
-      - "3888:3888"    # Dashboard/API
-      - "3889:3889"    # WebSocket
-      - "5900:5900"    # VNC direct
-      - "6080:6080"    # noVNC web interface
-    environment:
-      WORKSPACE_ID: browser-workspace
-      SUPERVISOR_ENABLED: "true"
-      MAX_AGENTS: "10"
-      # Browser display settings
-      DISPLAY: ":99"
-      SCREEN_WIDTH: "1920"
-      SCREEN_HEIGHT: "1080"
-      SCREEN_DEPTH: "24"
-      # VNC settings
-      VNC_ENABLED: "true"
-      VNC_PORT: "5900"
-      NOVNC_ENABLED: "true"
-      NOVNC_PORT: "6080"
-    volumes:
-      # Persistent data
-      - workspace_browser_data:/data
-      # Mount repos
-      - ./:/workspace/relay:ro
-      # Docker socket for spawning containers
-      - /var/run/docker.sock:/var/run/docker.sock
-    # Required for some browser operations
-    shm_size: '2gb'
-    # Security options for browser sandboxing
-    security_opt:
-      - seccomp:unconfined
-    depends_on:
-      - cloud
-
-  # Alternative: Rootless Docker-in-Docker workspace
-  # Uses sysbox runtime for secure nested containers
-  workspace-dind:
-    build:
-      context: .
-      dockerfile: deploy/workspace/Dockerfile.browser
-    runtime: sysbox-runc  # Requires sysbox installed on host
-    ports:
-      - "3898:3888"
-      - "6090:6080"
-    environment:
-      WORKSPACE_ID: dind-workspace
-      SUPERVISOR_ENABLED: "true"
-      MAX_AGENTS: "10"
-      # DinD mode - Docker daemon runs inside container
-      DOCKER_HOST: "unix:///var/run/docker.sock"
-    volumes:
-      - workspace_dind_data:/data
-    profiles:
-      - dind  # Only start with: --profile dind
-
-volumes:
-  workspace_browser_data:
-  workspace_dind_data:
diff --git a/docker-compose.test.yml b/docker-compose.test.yml
deleted file mode 100644
index a4c990c53..000000000
--- a/docker-compose.test.yml
+++ /dev/null
@@ -1,202 +0,0 @@
-# Agent Relay Cloud - Full QA Test Environment
-# Run with: docker compose -f docker-compose.test.yml up --build
-#
-# This environment simulates the full cloud stack with:
-# - PostgreSQL database
-# - Redis for sessions/pub-sub
-# - Cloud API server
-# - Simulated daemon(s) that report metrics
-# - Test runner for integration tests
-#
-# Usage:
-#   # Start the full stack
-#   docker compose -f docker-compose.test.yml up -d
-#
-#   # Run integration tests
-#   docker compose -f docker-compose.test.yml run test-runner
-#
-#   # View logs
-#   docker compose -f docker-compose.test.yml logs -f
-#
-#   # Tear down
-#   docker compose -f docker-compose.test.yml down -v
-
-version: '3.8'
-
-services:
-  # PostgreSQL database
-  postgres:
-    image: postgres:16-alpine
-    environment:
-      POSTGRES_USER: agent_relay
-      POSTGRES_PASSWORD: test_password
-      POSTGRES_DB: agent_relay_test
-    ports:
-      - "5433:5432"
-    volumes:
-      - postgres_test_data:/var/lib/postgresql/data
-    healthcheck:
-      test: ["CMD-SHELL", "pg_isready -U agent_relay"]
-      interval: 2s
-      timeout: 5s
-      retries: 10
-
-  # Redis for sessions and pub/sub
-  redis:
-    image: redis:7-alpine
-    ports:
-      - "6380:6379"
-    healthcheck:
-      test: ["CMD", "redis-cli", "ping"]
-      interval: 2s
-      timeout: 5s
-      retries: 10
-
-  # Cloud API server
-  cloud:
-    build:
-      context: .
-      dockerfile: Dockerfile
-    ports:
-      - "3100:3000"
-    environment:
-      NODE_ENV: test
-      PORT: 3000
-      PUBLIC_URL: http://localhost:3100
-
-      # Database
-      DATABASE_URL: postgres://agent_relay:test_password@postgres:5432/agent_relay_test
-      REDIS_URL: redis://redis:6379
-
-      # Session
-      SESSION_SECRET: test-session-secret
-
-      # Vault master key (test only) - "test-vault-key-32-bytes-testing!" = 32 bytes
-      VAULT_MASTER_KEY: dGVzdC12YXVsdC1rZXktMzItYnl0ZXMtdGVzdGluZyE=
-
-      # Disable external services in test mode
-      STRIPE_SECRET_KEY: sk_test_placeholder
-      STRIPE_PUBLISHABLE_KEY: pk_test_placeholder
-      STRIPE_WEBHOOK_SECRET: whsec_test
-
-      # Compute provider (docker for local)
-      COMPUTE_PROVIDER: docker
-
-      # Enable memory monitoring
-      RELAY_MEMORY_MONITORING: "true"
-      RELAY_CLOUD_ENABLED: "true"
-    depends_on:
-      postgres:
-        condition: service_healthy
-      redis:
-        condition: service_healthy
-    volumes:
-      - /var/run/docker.sock:/var/run/docker.sock
-    healthcheck:
-      test: ["CMD", "curl", "-f", "http://localhost:3000/health"]
-      interval: 5s
-      timeout: 5s
-      retries: 10
-
-  # Simulated daemon 1 - Reports metrics to cloud
-  daemon-simulator-1:
-    build:
-      context: .
-      dockerfile: test/cloud/Dockerfile.daemon-simulator
-    environment:
-      DAEMON_NAME: test-daemon-1
-      CLOUD_API_URL: http://cloud:3000
-      SIMULATOR_MODE: "true"
-      AGENT_COUNT: "3"
-      REPORT_INTERVAL_MS: "5000"
-      # Simulate some memory issues
-      SIMULATE_MEMORY_GROWTH: "true"
-      SIMULATE_CRASH: "false"
-    depends_on:
-      cloud:
-        condition: service_healthy
-    restart: on-failure
-
-  # Simulated daemon 2 - Normal operation
-  daemon-simulator-2:
-    build:
-      context: .
-      dockerfile: test/cloud/Dockerfile.daemon-simulator
-    environment:
-      DAEMON_NAME: test-daemon-2
-      CLOUD_API_URL: http://cloud:3000
-      SIMULATOR_MODE: "true"
-      AGENT_COUNT: "2"
-      REPORT_INTERVAL_MS: "5000"
-      SIMULATE_MEMORY_GROWTH: "false"
-      SIMULATE_CRASH: "false"
-    depends_on:
-      cloud:
-        condition: service_healthy
-    restart: on-failure
-
-  # Simulated daemon 3 - Crash simulation
-  daemon-simulator-crash:
-    build:
-      context: .
-      dockerfile: test/cloud/Dockerfile.daemon-simulator
-    environment:
-      DAEMON_NAME: test-daemon-crash
-      CLOUD_API_URL: http://cloud:3000
-      SIMULATOR_MODE: "true"
-      AGENT_COUNT: "1"
-      REPORT_INTERVAL_MS: "3000"
-      SIMULATE_MEMORY_GROWTH: "false"
-      SIMULATE_CRASH: "true"
-      CRASH_AFTER_SECONDS: "30"
-    depends_on:
-      cloud:
-        condition: service_healthy
-    profiles:
-      - crash-test
-
-  # Integration test runner
-  test-runner:
-    build:
-      context: .
-      dockerfile: test/cloud/Dockerfile.test-runner
-    environment:
-      CLOUD_API_URL: http://cloud:3000
-      DATABASE_URL: postgres://agent_relay:test_password@postgres:5432/agent_relay_test
-      REDIS_URL: redis://redis:6379
-      TEST_TIMEOUT: "60000"
-    depends_on:
-      cloud:
-        condition: service_healthy
-      daemon-simulator-1:
-        condition: service_started
-      daemon-simulator-2:
-        condition: service_started
-    volumes:
-      - ./test:/app/test:ro
-      - ./src:/app/src:ro
-      - test_results:/app/test-results
-    profiles:
-      - test
-
-  # WebSocket test client
-  ws-test-client:
-    build:
-      context: .
-      dockerfile: test/cloud/Dockerfile.ws-client
-    environment:
-      CLOUD_WS_URL: ws://cloud:3000/ws
-      TEST_DURATION_SECONDS: "60"
-    depends_on:
-      cloud:
-        condition: service_healthy
-    profiles:
-      - ws-test
-
-volumes:
-  postgres_test_data:
-  test_results:
-
-networks:
-  default:
-    name: agent-relay-test
diff --git a/docs/authentication.md b/docs/authentication.md
deleted file mode 100644
index 7925c91b4..000000000
--- a/docs/authentication.md
+++ /dev/null
@@ -1,3 +0,0 @@
-# Authentication
-
-See [the CLI reference](reference-cli.md) for current authentication commands and provider login flows.
diff --git a/docs/cli-cloud-commands.md b/docs/cli-cloud-commands.md
deleted file mode 100644
index 2b0b9df67..000000000
--- a/docs/cli-cloud-commands.md
+++ /dev/null
@@ -1,3 +0,0 @@
-# Cloud Commands
-
-See [the CLI reference](reference-cli.md) for current `agent-relay cloud` commands and flags.
diff --git a/docs/cli-messaging.md b/docs/cli-messaging.md
deleted file mode 100644
index 091619802..000000000
--- a/docs/cli-messaging.md
+++ /dev/null
@@ -1,77 +0,0 @@
-Once the broker is up, the CLI can act as a lightweight operator console for human-to-agent messages and recent conversation history.
-
-## Send a message
-
-```bash
-agent-relay send reviewer "Please summarize the riskiest changes first."
-```
-
-The target argument accepts:
-
-- an agent name such as `reviewer`
-- a channel such as `#general`
-- `*` for broadcast
-
-Optional flags:
-
-- `--from <name>` sets the sender identity. Defaults to `$AGENT_RELAY_ORCHESTRATOR_NAME` or `orchestrator`.
-  Workers' replies are addressed to this name, so use a stable value you can read with `agent-relay replies <worker>`.
-- `--thread <id>` keeps follow-ups grouped under an existing thread.
-
-## Read recent history
-
-```bash
-agent-relay history --to '#general' --since 30m
-```
-
-Useful filters:
-
-- `--from <agent>` keeps only one sender.
-- `--to <agent-or-channel>` narrows to a target. When `<agent>` is not a channel, the command prints messages in chronological order with no preview truncation; pair with `--from <sender>` to filter by sender.
-  For example, `agent-relay history --to Worker2 --from Worker2` is equivalent to `agent-relay replies Worker2` for the non-`--unread` case.
-- `--since <time>` accepts values like `30m`, `1h`, or an ISO date.
-- `--json` emits structured output for scripts. Each DM record carries a `direction` field (`inbound` or `outbound`) relative to the reader identity.
-
-## Read replies from a worker
-
-```bash
-agent-relay replies Worker2
-```
-
-Shows messages received from `<agent>` in chronological order (oldest first, newest at the bottom), full text, with sender attribution. Inbound-only: it never echoes the orchestrator's own outbound DMs.
-
-Useful filters:
-
-- `-n, --limit <count>` caps the number of messages (default `50`).
-- `--since <time>` accepts values like `30s`, `5m`, `1h`, or an ISO date.
-- `--unread` shows only unread messages and does not mark them read.
-- `--mark-read` marks the printed messages as read after printing.
-- `--as <name>` reads as a specific orchestrator identity (default `$AGENT_RELAY_ORCHESTRATOR_NAME` or `orchestrator`).
-- `--full` disables truncation; text is already printed in full, so this is currently a forward-compatible no-op.
-- `--json` emits structured output; each record carries a `direction` field (`inbound` or `outbound`) relative to the reader identity.
-
-Exit code is `0` whether messages were printed or none were found; only connection or auth failures return non-zero.
-
-## Check the inbox
-
-```bash
-agent-relay inbox
-```
-
-`inbox` summarizes unread channels, mentions, and DMs. For DMs, the text renderer prints up to three most recent unread messages per conversation with full text and a `<sender> -> <reader>` header.
-If a conversation has more than three unread messages, a footer line points at `agent-relay replies <agent> --unread` for the full list. Add `--json` if another tool will parse the result; the JSON shape is unchanged for existing callers and additionally carries a `direction` field on each unread DM `last_message`.
-
-## Practical pattern
-
-```bash
-agent-relay send planner "Create the plan, then hand implementation to coder."
-agent-relay replies planner --since 15m
-agent-relay inbox
-```
-
-## See also
-
-- [Agent management](cli-agent-management.md) - Spawn agents before trying to message them.
-- [Sending messages](sending-messages.md) - SDK patterns for the same message flow.
-- [Channels](channels.md) - Design shared communication spaces.
-- [DMs](dms.md) - One-to-one coordination patterns.
diff --git a/docs/cli-on-the-relay.md b/docs/cli-on-the-relay.md
deleted file mode 100644
index 8650167f4..000000000
--- a/docs/cli-on-the-relay.md
+++ /dev/null
@@ -1,79 +0,0 @@
-# On the relay
-
-Launch an agent into the sandboxed relay environment, preview permissions, and shut the services down.
-
-`agent-relay on` is the CLI entry point for running an agent inside the relay sandbox, with mounted services and permission-aware workspace access.
-
-## Launch an agent
-
-```bash
-agent-relay on codex --agent reviewer -- --model gpt-5.4
-```
-
-Common options:
-
-- `--agent <name>` sets the relay identity name.
-- `--workspace <id>` joins an existing relay workspace.
-- `--port-auth <port>` overrides the Relayauth port.
-- `--port-file <port>` overrides the Relayfile port.
-- any extra args after `--` are passed through to the underlying CLI.
-
-## Preview or diagnose the environment
-
-```bash
-agent-relay on --scan
-agent-relay on --doctor
-```
-
-- `--scan` previews what the agent will be able to see before launch.
-- `--doctor` checks prerequisites and exits without starting a session.
-
-## Stop relay services
-
-```bash
-agent-relay off
-```
-
-Use `off` when you are done with the mounted relay environment and want a clean shutdown.
-
-## File visibility with dotfiles
-
-Place `.agentignore` and `.agentreadonly` files in the project root to control what the agent sees. Both use gitignore-style glob syntax (one pattern per line, `#` for comments). `.agentignore` hides files entirely; `.agentreadonly` makes them visible but not writable.
-
-```text
-# .agentignore
-.env*
-secrets/**
-```
-
-```text
-# .agentreadonly
-docs/**
-README.md
-```
-
-For per-agent rules, prefix with the agent name: `.reviewer.agentignore`, `.writer.agentreadonly`.
-
-Dotfiles are loaded automatically and applied before YAML-level permissions. Use `--scan` to preview what the agent will see. See [Permissions](permissions.md) for full details.
-
-> **Note:** `.agentignore` does **not** inherit from `.gitignore`. The relay automatically skips `.git`, `node_modules`, and `.relay` regardless of your dotfiles.
-
-## Isolation model
-
-`agent-relay on` copies your project into a mount directory and sets the agent's working directory to it. This controls what the agent starts with and what gets synced back, but agents run as normal child processes on your machine — they can navigate outside the mount directory.
-
-For true sandboxed execution, use [cloud mode](cli-cloud-commands.md). Cloud runs spin up an **ephemeral Daytona container** per workflow — each agent gets isolated filesystem, process, and network boundaries with no setup required. Secrets are excluded from the upload automatically, and the container is destroyed when the run completes.
-
-|            | Local (`on`)           | Cloud                      |
-| ---------- | ---------------------- | -------------------------- |
-| Filesystem | Copy-based (escapable) | Container-isolated         |
-| Process    | Bare child process     | Container process          |
-| Network    | Unrestricted           | Container network policies |
-| Setup      | None                   | None                       |
-
-## See also
-
-- [CLI reference](reference-cli.md) — Full map of the CLI command surface.
-- [Cloud commands](cli-cloud-commands.md) — Run workflows remotely instead of entering the sandbox yourself.
-- [Authentication](authentication.md) — Understand the auth service used by relay-aware environments.
-- [File sharing](file-sharing.md) — Shared filesystem concepts used by the relay environment.
diff --git a/docs/cli-overview.md b/docs/cli-overview.md
deleted file mode 100644
index 07ad367b1..000000000
--- a/docs/cli-overview.md
+++ /dev/null
@@ -1,32 +0,0 @@
-The `agent-relay` CLI is the operational layer for local broker management, agent spawning, messaging, workflow execution, and cloud runs.
-
-## Install
-
-```bash
-npm install -g agent-relay
-agent-relay --help
-agent-relay --version
-```
-
-## Main command groups
-
-- Broker lifecycle: `up`, `status`, `down`, `update`, `uninstall`
-- Agent management: `spawn`, `who`, `release`, `set-model`, `agents:logs`, `view`
-- Messaging: `send`, `history`, `replies`, `inbox`
-- Workflows: `run`, `workflows list`
-- Cloud: `cloud login`, `cloud connect`, `cloud run`, `cloud status`, `cloud logs`, `cloud sync`
-- Sandbox entry: `on`, `off`
-
-## Typical local session
-
-```bash
-agent-relay up
-agent-relay spawn reviewer claude "Review the latest auth changes"
-agent-relay send reviewer "Start with the middleware and summarize risks."
-agent-relay who
-```
-
-## Other useful commands
-
-- `agent-relay telemetry status` shows whether anonymous telemetry is enabled.
-- `agent-relay telemetry disable` turns telemetry off for the local machine.
diff --git a/docs/dms.md b/docs/dms.md
deleted file mode 100644
index ef648db26..000000000
--- a/docs/dms.md
+++ /dev/null
@@ -1,54 +0,0 @@
-DMs are the cleanest way to assign work, request a review, or ask for a status update without broadcasting everything to the whole team.
-
-## Orchestrate mode
-
-```typescript
-await planner.sendMessage({
-  to: 'Reviewer',
-  text: 'Please review src/auth.ts and reply with the highest-risk issue first.',
-});
-```
-
-```python
-await planner.send_message(
-    to="Reviewer",
-    text="Please review src/auth.ts and reply with the highest-risk issue first.",
-)
-```
-
-## Communicate mode
-
-```python
-from agent_relay.communicate import Relay
-
-relay = Relay("MyAgent")
-await relay.send("Reviewer", "Please check the migration plan.")
-```
-
-## Reading replies from the CLI
-
-When an orchestrator spawns a worker and DMs it a task, read the worker's reply with:
-
-```bash
-agent-relay replies Worker2
-```
-
-This prints inbound-only messages with full text and sender attribution.
-See [Messaging](cli-messaging.md#read-replies-from-a-worker) for filters (`--since`, `--unread`, `--mark-read`, `--json`).
-
-## Good DM use cases
-
-- Handing a concrete task from one worker to another
-- Review requests with a specific file or artifact
-- Quiet side conversations that do not belong in a shared channel
-
-## Threaded follow-ups
-
-When a conversation needs to stay grouped, include `threadId` in TypeScript or `thread_id` in Python on follow-up messages.
-
-## See also
-
-- [Sending messages](sending-messages.md) - Broader message patterns across Relay.
-- [Channels](channels.md) - Shared coordination surfaces for larger teams.
-- [Quickstart](quickstart.md) - End-to-end spawn and DM example.
-- [Communicate Mode](communicate.md) - DM APIs for existing framework agents.
diff --git a/docs/file-sharing.md b/docs/file-sharing.md
deleted file mode 100644
index 9551119b2..000000000
--- a/docs/file-sharing.md
+++ /dev/null
@@ -1,3 +0,0 @@
-# File Sharing
-
-See [CLI on the Relay](cli-on-the-relay.md) for the local relay filesystem model.
diff --git a/docs/introduction.md b/docs/introduction.md
deleted file mode 100644
index 49d4b1983..000000000
--- a/docs/introduction.md
+++ /dev/null
@@ -1,113 +0,0 @@
-
-The Agent Relay SDK has two modes:
-
-- **Orchestrate** — Spawn and manage AI agents (Claude, Codex, Gemini, OpenCode) from code. Send messages, listen for responses, and shut them down when done.
-- **Communicate** — Put an existing framework agent "on the relay" with a single `on_relay()` / `onRelay()` call. Works with AI SDK, OpenAI Agents, Claude Agent SDK, Google ADK, Pi, Agno, Swarms, and CrewAI.
-
-```bash TypeScript
-npm install @agent-relay/sdk
-```
-
-```bash Python
-pip install agent-relay-sdk
-```
-
-## Two Modes
-
-### Orchestrate Mode
-
-Spawn and control agents from your code:
-
-```typescript TypeScript
-import { AgentRelayClient } from '@agent-relay/sdk';
-const client = await AgentRelayClient.spawn();
-const agent = await client.spawnPty({ name: 'reviewer', cli: 'claude', task: 'Review the PR' });
-```
-
-```python Python
-from agent_relay import workflow
-wf = workflow("review")
-wf.agent("reviewer", cli="claude")
-wf.step("review", agent="reviewer", task="Review the PR")
-wf.build()
-```
-
-### Communicate Mode
-
-Connect any framework agent to Relaycast in 3 lines:
-
-```python Python
-from agent_relay.communicate import Relay, on_relay
-relay = Relay("MyAgent")
-agent = on_relay(my_framework_agent, relay)
-```
-
-```typescript TypeScript
-import { Relay } from '@agent-relay/sdk/communicate';
-import { onRelay } from '@agent-relay/sdk/communicate/adapters/pi';
-const config = onRelay('MyAgent', piConfig, new Relay('MyAgent'));
-```
-
-## What You Can Do
-
-<CardGroup cols={2}>
-  <Card title="Spawn Agents" icon="users">
-    Programmatically create Claude, Codex, Gemini, or OpenCode agents with a specific model and task.
-  
-  <Card title="Send Messages" icon="messages">
-    Route messages between agents — direct, broadcast, or channel-based.
-  
-  <Card title="Connect Frameworks" icon="plug">
-    Put OpenAI Agents, Claude SDK, Google ADK, Pi, Agno, Swarms, or CrewAI agents on the relay.
-  
-  <Card title="Multi-Provider" icon="shuffle">
-    Mix Claude, Codex, Gemini, and OpenCode agents in a single workflow, each using their strengths.
-  
-
-## Claude Code Plugin
-
-Use Agent Relay directly inside Claude Code — no SDK required. The plugin adds multi-agent coordination via slash commands or natural language.
-
-```bash
-/plugin marketplace add Agentworkforce/skills
-/plugin install claude-relay-plugin
-```
-
-Once installed, coordinate agents with built-in skills:
-
-```bash
-/relay-team Refactor the auth module — split the middleware, update tests, and update docs
-/relay-fanout Run linting fixes across all packages in the monorepo
-/relay-pipeline Analyze the API logs, then generate a summary report, then draft an email
-```
-
-Or just describe what you want in plain language — the plugin's hooks and agent definitions handle the infrastructure automatically:
-
-```bash
-Use relay fan-out to lint all packages in parallel
-Split the migration into three relay workers — one for the schema, one for the API, one for the frontend
-```
-
-## LLM / Machine-Readable Docs
-
-These docs are also available as plain Markdown for LLMs, CLI tools, and programmatic access:
-
-- [📄 Markdown Docs on GitHub](https://github.com/AgentWorkforce/relay/tree/main/docs/markdown)
-  Plain-text versions of every page — no MDX components, no JavaScript. Designed for `curl`, agents, and language models.
-
-
-## Next Steps
-
-<CardGroup cols={2}>
-  - [Quickstart](/docs/quickstart)
-    Get your first agents talking to each other in minutes.
-  
-  - [Communicate Mode](/docs/communicate)
-    Put any framework agent on the relay with on_relay().
-  
-  - [TypeScript SDK](/docs/typescript-sdk)
-    Full API reference for the TypeScript SDK.
-  
-  - [Python SDK](/docs/python-sdk)
-    Full API reference for the Python SDK.
-  
diff --git a/docs/permissions.md b/docs/permissions.md
deleted file mode 100644
index 753256549..000000000
--- a/docs/permissions.md
+++ /dev/null
@@ -1,122 +0,0 @@
-# Permissions
-
-Control what workflow agents can read, write, execute, and access over the network.
-
-## Quick Start
-
-```yaml
-agents:
-  - name: reviewer
-    cli: claude
-    permissions: readonly # preset string
-
-  - name: writer
-    cli: codex
-    permissions:
-      access: restricted # inline block
-      files:
-        read: ['src/**']
-        write: ['docs/**']
-        deny: ['.env*', 'secrets/**']
-```
-
-## Access Presets
-
-| Preset       | Read                    | Write                   | Dotfiles  |
-| ------------ | ----------------------- | ----------------------- | --------- |
-| `readonly`   | all non-ignored         | none                    | inherited |
-| `readwrite`  | all non-ignored         | all non-ignored         | inherited |
-| `restricted` | nothing (explicit only) | nothing (explicit only) | inherited |
-| `full`       | everything              | everything              | ignored   |
-
-Default when omitted: `readwrite`.
-
-## File Permissions
-
-```yaml
-permissions:
-  access: restricted
-  files:
-    read: ['src/**', 'package.json']
-    write: ['tests/**']
-    deny: ['.env*', 'secrets/**']
-```
-
-- `write` implies read access
-- `deny` always wins over read/write grants
-- Merged on top of the access preset
-
-## Network
-
-```yaml
-# Boolean — allow or deny all
-permissions:
-  network: false
-
-# Object — scoped allowlist
-permissions:
-  network:
-    allow: ['registry.npmjs.org:443', 'github.com:443']
-    deny: ['*']
-```
-
-## Exec
-
-```yaml
-permissions:
-  exec: ['npm test', 'npx vitest', 'git diff']
-```
-
-Matches by command prefix. Omit to allow all commands.
-
-## Profiles
-
-Reusable named permission blocks:
-
-```yaml
-permissions:
-  profiles:
-    source-dev:
-      access: restricted
-      files:
-        read: ['src/**', 'packages/**', 'package.json']
-        write: ['src/**', 'tests/**']
-        deny: ['.env*', 'secrets/**']
-      network: false
-  default: source-dev
-
-agents:
-  - name: frontend
-    cli: codex
-    permissions: source-dev
-```
-
-## Dotfiles
-
-- `.agentignore` — hides files from agents entirely
-- `.agentreadonly` — visible but not writable
-- `.<agent>.agentignore` / `.<agent>.agentreadonly` — per-agent overrides
-
-Applied before YAML rules. Bypassed by `full` preset.
-
-## Resolution Order
-
-1. Dotfiles (when inherited)
-2. `access` preset
-3. Explicit `files` globs
-4. `deny` rules (always win)
-
-## Step-Level Overrides
-
-Steps can narrow the agent's permissions for a specific task:
-
-```yaml
-steps:
-  - name: ui
-    type: agent
-    agent: frontend
-    permissions:
-      access: restricted
-      files:
-        write: ['src/components/**']
-```
diff --git a/docs/plugin-claude-code.md b/docs/plugin-claude-code.md
deleted file mode 100644
index ab08fdb83..000000000
--- a/docs/plugin-claude-code.md
+++ /dev/null
@@ -1,92 +0,0 @@
-# Claude Code Plugin
-
-Use Agent Relay directly inside Claude Code — coordinate multi-agent workflows with slash commands or natural language.
-
-## Overview
-
-The Agent Relay plugin for Claude Code gives you multi-agent coordination without writing SDK code. Install it once and spawn teams, fan-out work, and run pipelines using slash commands or plain English.
-
-The plugin works through Claude Code's MCP integration, exposing Relaycast messaging tools (channels, DMs, threads, reactions) directly to your Claude session.
-
-## Install
-
-```bash
-/plugin marketplace add Agentworkforce/skills
-/plugin install claude-relay-plugin
-```
-
-Verify the install:
-
-```bash
-/plugin list
-```
-
-You should see `claude-relay-plugin` in the output.
-
-
-## Slash Commands
-
-Once installed, the plugin adds relay-specific slash commands:
-
-### `/relay-team`
-
-Coordinate a team of agents to work on a complex task. The plugin spawns a lead agent that breaks the task down and delegates subtasks to workers.
-
-```bash
-/relay-team Refactor the auth module — split the middleware, update tests, and update docs
-```
-
-### `/relay-fanout`
-
-Fan out identical or similar work across multiple agents in parallel.
-
-```bash
-/relay-fanout Run linting fixes across all packages in the monorepo
-```
-
-### `/relay-pipeline`
-
-Run a sequential pipeline where each step feeds into the next.
-
-```bash
-/relay-pipeline Analyze the API logs, then generate a summary report, then draft an email
-```
-
-## Natural Language
-
-You don't need slash commands — describe what you want and the plugin handles the orchestration:
-
-```bash
-Use relay fan-out to lint all packages in parallel
-Split the migration into three relay workers — one for the schema, one for the API, one for the frontend
-Coordinate a team to review and refactor the payment service
-```
-
-## MCP Tools
-
-Under the hood, the plugin exposes these Relaycast MCP tools to your Claude session:
-
-| Tool | Description |
-| ---- | ----------- |
-| `message_post` | Post a message to a channel |
-| `message_reply` | Reply to a message in a thread |
-| `message_dm_send` | Send a direct message to another agent |
-| `message_reaction_add` | React to a message |
-| `channel_create` | Create a new channel |
-| `channel_list` | List available channels |
-| `agent_register` | Register an agent in the workspace |
-| `agent_list` | List agents in the workspace |
-| `message_search` | Search messages across channels |
-| `message_inbox_check` | Check unread messages |
-
-## How It Works
-
-1. The plugin starts a local MCP server that connects to your Relay workspace
-2. Claude Code discovers the MCP tools and can invoke them during your session
-3. When you use a slash command or describe a multi-agent task, the plugin's skills and agent definitions translate your intent into relay API calls
-4. Spawned agents communicate through Relaycast channels and report progress back to your session
-
-## Next Steps
-
-- [Quickstart](/docs/quickstart) — Learn the SDK fundamentals that the plugin builds on.
-- [Workflows](/docs/reference-workflows) — Build more complex orchestration patterns with the workflow API.
diff --git a/docs/reference-cli.md b/docs/reference-cli.md
deleted file mode 100644
index 62102de84..000000000
--- a/docs/reference-cli.md
+++ /dev/null
@@ -1,48 +0,0 @@
-# CLI reference
-
-Reference for broker-level command surfaces that are useful to integrations.
-
-## `mcp-args`
-
-Compute per-CLI MCP args without spawning.
-
-```bash
-agent-relay-broker mcp-args \
-  --cli claude \
-  --agent-name reviewer \
-  --api-key rk_live_example \
-  --base-url https://api.relaycast.dev \
-  --cwd /Users/me/project
-```
-
-Sample output:
-
-```json
-{
-  "args": [
-    "--mcp-config",
-    "{\"mcpServers\":{\"relaycast\":{\"command\":\"npx\",\"args\":[\"-y\",\"@relaycast/mcp\"]}}}"
-  ],
-  "sideEffectFiles": [],
-  "agentToken": null
-}
-```
-
-Flags:
-
-| Flag                              | Required | Description                                                                                                                                     |
-| --------------------------------- | -------- | ----------------------------------------------------------------------------------------------------------------------------------------------- |
-| `--cli <name>`                    | yes      | CLI name or command to compute MCP args for, such as `claude`, `codex`, `opencode`, `cursor-agent`, `gemini`, or `droid`.                       |
-| `--agent-name <name>`             | yes      | Relaycast agent name to inject into the MCP configuration.                                                                                      |
-| `--api-key <key>`                 | no       | Relaycast API key. Falls back to `RELAY_API_KEY`.                                                                                               |
-| `--base-url <url>`                | no       | Relaycast base URL. Falls back to `RELAY_BASE_URL`.                                                                                             |
-| `--agent-token <token>`           | no       | Pre-registered agent token to pass to the child MCP server.                                                                                     |
-| `--register`                      | no       | Mint a fresh Relaycast agent token with `--api-key`/`RELAY_API_KEY` and `--base-url`/`RELAY_BASE_URL`; mutually exclusive with `--agent-token`. |
-| `--workspaces-json <json>`        | no       | Multi-workspace context JSON to pass to the child MCP server.                                                                                   |
-| `--default-workspace <workspace>` | no       | Default workspace ID or name to pass to the child MCP server.                                                                                   |
-| `--cwd <path>`                    | no       | Working directory used by CLIs that need local MCP config files. Defaults to the current directory.                                             |
-| `--existing-args <json>`          | no       | Existing CLI args as a JSON string array, for example `'["--foo","--bar"]'`. Defaults to `[]`.                                                  |
-
-`agentToken` is `null` unless `--register` successfully minted and injected a fresh Relaycast agent token.
-
-`mcp-args` writes side-effect files synchronously when the target CLI requires a config file. For `opencode`, it may write `<cwd>/opencode.json`; for `cursor`, `cursor-agent`, and `agent`, it may write `<cwd>/.cursor/mcp.json`; for `gemini`, it may write `<HOME>/.gemini/trustedFolders.json`. Treat the command as compute plus configure for those CLIs, not as a side-effect-free dry run.
diff --git a/examples/.env.example b/examples/.env.example
deleted file mode 100644
index 1168ca84e..000000000
--- a/examples/.env.example
+++ /dev/null
@@ -1,42 +0,0 @@
-# Agent Relay Configuration
-# Copy this file to .env in your project root
-
-# ============================================
-# Data Storage
-# ============================================
-
-# Base directory for all agent-relay data
-# Default: ~/.agent-relay (or XDG_DATA_HOME/agent-relay)
-# AGENT_RELAY_DATA_DIR=/path/to/data
-
-# Storage backend type (sqlite, memory)
-# Default: sqlite
-# AGENT_RELAY_STORAGE_TYPE=sqlite
-
-# Custom SQLite database path
-# Default: <data_dir>/messages.sqlite
-# AGENT_RELAY_STORAGE_PATH=/path/to/messages.sqlite
-
-# SQLite driver preference (better-sqlite3, node)
-# Default: better-sqlite3 (falls back to node:sqlite if unavailable)
-# AGENT_RELAY_SQLITE_DRIVER=better-sqlite3
-
-# ============================================
-# Dashboard
-# ============================================
-
-# Web dashboard port
-# Default: 3888
-# AGENT_RELAY_DASHBOARD_PORT=3888
-
-# ============================================
-# Agent Identity
-# ============================================
-
-# Default agent name (used by hooks)
-# If not set, a random name is generated
-# AGENT_RELAY_NAME=MyAgent
-
-# Custom inbox directory for message storage
-# Default: <data_dir>/inbox
-# AGENT_RELAY_INBOX_DIR=/path/to/inbox
diff --git a/examples/README.md b/examples/README.md
deleted file mode 100644
index 87b28e53e..000000000
--- a/examples/README.md
+++ /dev/null
@@ -1,175 +0,0 @@
-# Agent Relay Configuration Examples
-
-This folder contains examples for configuring agent-relay in different environments.
-
-## Configuration Files
-
-| File | Description |
-|------|-------------|
-| `.env.example` | Environment variables for dotenv configuration |
-| `cli-usage.sh` | CLI command examples and options |
-| `programmatic-usage.ts` | Using agent-relay as a Node.js library |
-| `slack-claude-bot.ts` | Slack bot with Claude Code via agent-relay |
-| `slack-claude-standalone.ts` | Standalone Slack + Claude Code bot (no relay) |
-| `discord-claude-bot.ts` | Discord bot with Claude Code via agent-relay |
-| `discord-claude-standalone.ts` | Standalone Discord + Claude Code bot (no relay) |
-| `slack-codex-standalone.ts` | Standalone Slack + Codex CLI bot |
-| `discord-codex-standalone.ts` | Standalone Discord + Codex CLI bot |
-| `docker-compose.yml` | Docker Compose setup for containerized deployment |
-| `agent-relay.service` | Systemd service file for Linux servers |
-| `team-config.json` | Team configuration with multiple agents |
-
-## Usage Examples
-
-| Directory | Description |
-|-----------|-------------|
-| `basic-chat/` | Simple two-agent chat example |
-| `collaborative-task/` | Multi-agent collaboration workflow |
-
-## Quick Start
-
-### Environment Variables
-
-Copy `.env.example` to your project root as `.env`:
-
-```bash
-cp examples/.env.example .env
-```
-
-Edit the values as needed. Agent-relay uses dotenv to load these automatically.
-
-### CLI Configuration
-
-All configuration can also be passed via CLI flags:
-
-```bash
-# Start with dashboard on custom port
-agent-relay up --dashboard --port 4000
-agent-relay -n MyAgent claude
-```
-
-### Programmatic Configuration
-
-```typescript
-import { Daemon, getProjectPaths } from 'agent-relay';
-
-const paths = getProjectPaths();
-const daemon = new Daemon({
-  socketPath: paths.socketPath,
-  storagePath: paths.dbPath,
-});
-```
-
-## Slack Bot Examples
-
-Two Slack bot examples are included - both use Claude Code CLI (your subscription, no API costs).
-
-### Standalone Bot (Quick Test)
-
-No agent-relay needed - just Slack + Claude Code:
-
-```bash
-# Install Slack SDK
-npm install @slack/bolt
-
-# Run (ensure `claude` CLI is logged in)
-SLACK_BOT_TOKEN=xoxb-... SLACK_APP_TOKEN=xapp-... npx ts-node examples/slack-claude-standalone.ts
-```
-
-### Agent-Relay Bridge
-
-Bridges Slack with your relay network - agents can send messages to Slack:
-
-```bash
-# Start relay daemon first
-agent-relay up
-
-# Run the bridge
-SLACK_BOT_TOKEN=xoxb-... SLACK_APP_TOKEN=xapp-... npx ts-node examples/slack-claude-bot.ts
-```
-
-### Slack App Setup
-
-1. Create app at https://api.slack.com/apps
-2. Enable **Socket Mode** → copy App Token (`xapp-...`)
-3. **OAuth & Permissions** → add scopes: `app_mentions:read`, `chat:write`
-4. **Event Subscriptions** → subscribe to `app_mention`
-5. Install to workspace → copy Bot Token (`xoxb-...`)
-
-## Discord Bot Examples
-
-Two Discord bot examples are included - both use Claude Code CLI (your subscription, no API costs).
-
-### Standalone Bot (Quick Test)
-
-No agent-relay needed - just Discord + Claude Code:
-
-```bash
-# Install Discord.js
-npm install discord.js
-
-# Run (ensure `claude` CLI is logged in)
-DISCORD_TOKEN=... npx ts-node examples/discord-claude-standalone.ts
-```
-
-### Agent-Relay Bridge
-
-Bridges Discord with your relay network - agents can send messages to Discord:
-
-```bash
-# Start relay daemon first
-agent-relay up
-
-# Run the bridge
-DISCORD_TOKEN=... npx ts-node examples/discord-claude-bot.ts
-```
-
-### Discord App Setup
-
-1. Create app at https://discord.com/developers/applications
-2. **Bot** → Add Bot → copy Token
-3. **Bot** → enable **Message Content Intent**
-4. **OAuth2** → URL Generator → select `bot` scope
-5. Select permissions: `Send Messages`, `Read Message History`
-6. Use generated URL to invite bot to your server
-
-## Codex Bot Examples
-
-Codex CLI examples for both Slack and Discord (uses OpenAI Codex subscription).
-
-### Setup Codex CLI
-
-```bash
-npm install -g @openai/codex
-codex auth login
-```
-
-### Slack + Codex
-
-```bash
-npm install @slack/bolt
-SLACK_BOT_TOKEN=xoxb-... SLACK_APP_TOKEN=xapp-... npx ts-node examples/slack-codex-standalone.ts
-```
-
-### Discord + Codex
-
-```bash
-npm install discord.js
-DISCORD_TOKEN=... npx ts-node examples/discord-codex-standalone.ts
-```
-
-## Configuration Priority
-
-1. CLI flags (highest priority)
-2. Environment variables
-3. Default values (lowest priority)
-
-## Multi-Project Setup
-
-Agent-relay automatically isolates data per project based on the project root directory. Each project gets its own:
-
-- SQLite database
-- Unix socket
-- Message history
-
-Projects are identified by a hash of their root path (detected via `.git`, `package.json`, etc.).
diff --git a/examples/agent-relay.service b/examples/agent-relay.service
deleted file mode 100644
index 14767c872..000000000
--- a/examples/agent-relay.service
+++ /dev/null
@@ -1,74 +0,0 @@
-# Systemd service file for agent-relay daemon
-#
-# Installation:
-#   1. Copy this file to /etc/systemd/system/agent-relay.service
-#   2. Edit the paths and user as needed
-#   3. Run: sudo systemctl daemon-reload
-#   4. Run: sudo systemctl enable agent-relay
-#   5. Run: sudo systemctl start agent-relay
-#
-# Features:
-#   - Auto-restart on crash (RestartSec=2s)
-#   - Built-in supervisor mode (--watch) for additional resilience
-#   - Health check via /health endpoint
-#   - Systemd watchdog integration
-
-[Unit]
-Description=Agent Relay Daemon
-Documentation=https://github.com/khaliqgant/agent-relay
-After=network.target
-
-[Service]
-Type=simple
-User=your-username
-Group=your-username
-
-# Working directory (your project root)
-WorkingDirectory=/path/to/your/project
-
-# Environment configuration
-Environment=NODE_ENV=production
-Environment=AGENT_RELAY_DATA_DIR=/var/lib/agent-relay
-Environment=AGENT_RELAY_DASHBOARD_PORT=3888
-
-# Start the daemon with supervisor mode (--watch) and dashboard enabled
-# This provides two layers of resilience:
-# 1. The --watch flag restarts the daemon if it crashes
-# 2. Systemd restarts the supervisor if it crashes
-ExecStart=/usr/bin/npx agent-relay up --dashboard --watch --port 3888
-
-# Alternative: Run without --watch (rely on systemd alone)
-# ExecStart=/usr/bin/npx agent-relay up --dashboard --port 3888
-
-# Graceful shutdown
-ExecStop=/usr/bin/npx agent-relay down
-
-# Restart policy - restart on any failure
-Restart=always
-RestartSec=2
-
-# Rate limiting - don't restart more than 5 times in 60 seconds
-StartLimitIntervalSec=60
-StartLimitBurst=5
-
-# Memory and resource limits (optional, adjust as needed)
-# MemoryLimit=512M
-# CPUQuota=50%
-
-# Security hardening (optional)
-# NoNewPrivileges=true
-# ProtectSystem=strict
-# ProtectHome=true
-
-# Logging
-StandardOutput=journal
-StandardError=journal
-SyslogIdentifier=agent-relay
-
-# Send SIGTERM on stop, wait 30s before SIGKILL
-TimeoutStopSec=30
-KillMode=mixed
-KillSignal=SIGTERM
-
-[Install]
-WantedBy=multi-user.target
diff --git a/examples/ai-sdk-relay-helpdesk/README.md b/examples/ai-sdk-relay-helpdesk/README.md
deleted file mode 100644
index 8d641f76a..000000000
--- a/examples/ai-sdk-relay-helpdesk/README.md
+++ /dev/null
@@ -1,39 +0,0 @@
-# AI SDK + Relay Helpdesk Example
-
-A small consumer-facing Next.js app that uses the AI SDK adapter as the point-person layer and escalates bigger requests into a Relay workflow.
-
-## What it demonstrates
-
-- `onRelay()` attached to an AI SDK model via `wrapLanguageModel()`
-- normal user-facing chat turns through `streamText()`
-- a simple escalation gate that kicks off `runWorkflow()` for longer multi-step work
-- a workflow file that uses a lead + specialist review path
-
-## Files
-
-- `app/page.tsx` — tiny browser UI
-- `app/api/chat/route.ts` — AI SDK route with Relay communicate middleware
-- `workflows/helpdesk-escalation.yaml` — Relay workflow used for escalations
-
-## Run
-
-```bash
-cd examples/ai-sdk-relay-helpdesk
-npm install
-npm run dev
-```
-
-Set the env vars your app needs first, for example:
-
-```bash
-export OPENAI_API_KEY=...
-export RELAY_API_KEY=...
-export RELAY_BASE_URL=http://localhost:3888
-```
-
-Then open `http://localhost:3000` and try:
-
-- a normal question like `Summarize the latest support issue`
-- an escalation like `Please escalate: coordinate a migration plan for repo X`
-
-If the prompt begins with `Please escalate:`, the route starts the Relay workflow and returns the workflow run id instead of trying to finish everything in one chat turn.
diff --git a/examples/ai-sdk-relay-helpdesk/app/api/chat/route.ts b/examples/ai-sdk-relay-helpdesk/app/api/chat/route.ts
deleted file mode 100644
index 35a7c48b7..000000000
--- a/examples/ai-sdk-relay-helpdesk/app/api/chat/route.ts
+++ /dev/null
@@ -1,56 +0,0 @@
-import { openai } from '@ai-sdk/openai';
-import { streamText, wrapLanguageModel } from 'ai';
-import { Relay } from '@agent-relay/sdk/communicate';
-import { onRelay } from '@agent-relay/sdk/communicate/adapters/ai-sdk';
-import { runWorkflow } from '@agent-relay/sdk/workflows';
-
-const ESCALATE_PREFIX = 'please escalate:';
-
-export async function POST(request: Request) {
-  const { prompt } = (await request.json()) as { prompt?: string };
-  const text = prompt?.trim() ?? '';
-
-  if (text.length === 0) {
-    return Response.json({ error: 'prompt is required' }, { status: 400 });
-  }
-
-  const relay = new Relay('HelpdeskLead');
-  const relaySession = onRelay(
-    {
-      name: 'HelpdeskLead',
-      instructions:
-        'You are the customer-facing lead. Answer directly when you can. When work needs specialists, use Relay tools, keep the user updated, and escalate to a workflow when the task is clearly multi-step.',
-    },
-    relay,
-  );
-
-  const model = wrapLanguageModel({
-    model: openai('gpt-4o-mini'),
-    middleware: relaySession.middleware,
-  });
-
-  try {
-    if (text.toLowerCase().startsWith(ESCALATE_PREFIX)) {
-      const workflow = await runWorkflow('workflows/helpdesk-escalation.yaml', {
-        vars: {
-          request: text.slice(ESCALATE_PREFIX.length).trim(),
-        },
-      });
-
-      return Response.json({ mode: 'workflow', status: workflow.status, runId: workflow.runId });
-    }
-
-    const result = await streamText({
-      model,
-      tools: relaySession.tools,
-      system:
-        'You are the point person for the user. Coordinate through Relay when needed, but keep the final answer concise and user-facing.',
-      messages: [{ role: 'user', content: text }],
-    });
-
-    return Response.json({ mode: 'chat', text: await result.text });
-  } finally {
-    relaySession.cleanup();
-    await relay.close();
-  }
-}
diff --git a/examples/ai-sdk-relay-helpdesk/app/page.tsx b/examples/ai-sdk-relay-helpdesk/app/page.tsx
deleted file mode 100644
index ae1cd3a2e..000000000
--- a/examples/ai-sdk-relay-helpdesk/app/page.tsx
+++ /dev/null
@@ -1,54 +0,0 @@
-'use client';
-
-import { useState } from 'react';
-
-type ChatResponse =
-  | { mode: 'chat'; text: string }
-  | { mode: 'workflow'; status: string; runId: string };
-
-export default function Page() {
-  const [prompt, setPrompt] = useState('Summarize the latest support issue.');
-  const [result, setResult] = useState<ChatResponse | null>(null);
-  const [loading, setLoading] = useState(false);
-
-  async function onSubmit(event: React.FormEvent<HTMLFormElement>) {
-    event.preventDefault();
-    setLoading(true);
-
-    try {
-      const response = await fetch('/api/chat', {
-        method: 'POST',
-        headers: { 'content-type': 'application/json' },
-        body: JSON.stringify({ prompt }),
-      });
-
-      const data = (await response.json()) as ChatResponse;
-      setResult(data);
-    } finally {
-      setLoading(false);
-    }
-  }
-
-  return (
-    <main style={{ maxWidth: 720, margin: '40px auto', fontFamily: 'sans-serif' }}>
-      <h1>AI SDK + Relay Helpdesk</h1>
-      <p>Normal prompts stay in the chat loop. Prompts starting with <code>Please escalate:</code> hand work to a Relay workflow.</p>
-
-      <form onSubmit={onSubmit} style={{ display: 'grid', gap: 12 }}>
-        <textarea
-          rows={6}
-          value={prompt}
-          onChange={(event) => setPrompt(event.target.value)}
-          style={{ width: '100%' }}
-        />
-        <button type="submit" disabled={loading}>{loading ? 'Working…' : 'Send'}</button>
-      </form>
-
-      {result ? (
-        <pre style={{ marginTop: 24, padding: 16, background: '#111', color: '#eee', overflowX: 'auto' }}>
-          {JSON.stringify(result, null, 2)}
-        </pre>
-      ) : null}
-    </main>
-  );
-}
diff --git a/examples/ai-sdk-relay-helpdesk/package.json b/examples/ai-sdk-relay-helpdesk/package.json
deleted file mode 100644
index 379eb0d54..000000000
--- a/examples/ai-sdk-relay-helpdesk/package.json
+++ /dev/null
@@ -1,18 +0,0 @@
-{
-  "name": "ai-sdk-relay-helpdesk-example",
-  "private": true,
-  "type": "module",
-  "scripts": {
-    "dev": "next dev",
-    "build": "next build",
-    "start": "next start"
-  },
-  "dependencies": {
-    "@agent-relay/sdk": "workspace:*",
-    "@ai-sdk/openai": "^2.0.0",
-    "ai": ">=5.0.0",
-    "next": "^15.3.0",
-    "react": "^19.0.0",
-    "react-dom": "^19.0.0"
-  }
-}
diff --git a/examples/ai-sdk-relay-helpdesk/workflows/helpdesk-escalation.yaml b/examples/ai-sdk-relay-helpdesk/workflows/helpdesk-escalation.yaml
deleted file mode 100644
index 1537fdb9c..000000000
--- a/examples/ai-sdk-relay-helpdesk/workflows/helpdesk-escalation.yaml
+++ /dev/null
@@ -1,55 +0,0 @@
-version: '1.0'
-name: helpdesk-escalation
-swarm:
-  pattern: hub-spoke
-  channel: wf-helpdesk-escalation
-  maxConcurrency: 2
-agents:
-  - name: lead
-    cli: claude
-    role: Customer-facing helpdesk lead. Keeps the final answer crisp.
-  - name: specialist
-    cli: codex
-    role: Technical implementation specialist. Produces the concrete plan.
-  - name: reviewer
-    cli: claude
-    role: Reviews the plan for completeness and user clarity.
-workflows:
-  - name: helpdesk-escalation
-    description: Escalated support or implementation request handled by a lead, specialist, and reviewer.
-    steps:
-      - name: scope
-        agent: lead
-        task: |
-          Review the user request below and turn it into a compact execution brief.
-
-          User request:
-          {{request}}
-
-          Post a short status update to the workflow channel, then output /exit.
-      - name: plan
-        agent: specialist
-        dependsOn: [scope]
-        task: |
-          Create a concrete plan for this escalated request:
-          {{steps.scope.output}}
-
-          Include:
-          - likely files or systems involved
-          - risks and unknowns
-          - the smallest useful next step
-
-          When done, output /exit.
-      - name: review
-        agent: reviewer
-        dependsOn: [plan]
-        task: |
-          Review the proposed plan for clarity, missing risks, and whether it is ready to send back to the user.
-
-          Plan:
-          {{steps.plan.output}}
-
-          If acceptable, say Approved and output /exit.
-        verification:
-          type: exit_code
-          value: '0'
diff --git a/examples/basic-chat/README.md b/examples/basic-chat/README.md
deleted file mode 100644
index 89749c8bd..000000000
--- a/examples/basic-chat/README.md
+++ /dev/null
@@ -1,62 +0,0 @@
-# Basic Chat Example
-
-Two AI agents having a conversation using agent-relay.
-
-## Prerequisites
-
-- agent-relay installed (`npm install` from project root)
-- Two terminal windows
-
-## Quick Start
-
-### Terminal 1: Start the Daemon
-
-```bash
-cd /path/to/agent-relay
-npx agent-relay start -f
-```
-
-### Terminal 2: Agent Alice
-
-```bash
-npx agent-relay wrap -n Alice "claude"
-```
-
-Once Claude starts, you can tell it:
-
-> "Your name is Alice. You're chatting with Bob via agent-relay. Say hello to Bob using the fenced format."
-
-### Terminal 3: Agent Bob
-
-```bash
-npx agent-relay wrap -n Bob "claude"
-```
-
-Once Claude starts, you can tell it:
-
-> "Your name is Bob. You're chatting with Alice via agent-relay. Wait for her message, then respond."
-
-## How It Works
-
-1. Each agent is wrapped with `agent-relay wrap`, which:
-   - Provides MCP tools for agent communication
-   - Routes messages through the broker to other agents
-   - Injects received messages into the agent's terminal
-
-2. Messages are sent using MCP tools:
-
-   ```
-   mcp__relaycast__message_dm_send(to: "RecipientName", text: "Your message here")
-   ```
-
-3. Received messages appear as:
-   ```
-   Relay message from SenderName [id]: Their message
-   ```
-
-## Tips
-
-- Use `mcp__relaycast__message_dm_send(to: "Name", text: "...")` to send direct messages
-- Use `mcp__relaycast__post_message(channel: "general", text: "...")` to broadcast to a channel
-- Use `mcp__relaycast__list_agents()` to see connected agents
-- Check broker status with `agent-relay status`
diff --git a/examples/basic-chat/setup.sh b/examples/basic-chat/setup.sh
deleted file mode 100755
index 94df5824c..000000000
--- a/examples/basic-chat/setup.sh
+++ /dev/null
@@ -1,74 +0,0 @@
-#!/bin/bash
-# Basic Chat Setup Script
-# Sets up two agents for a chat using agent-relay MCP tools
-
-set -e
-
-DATA_DIR="${1:-/tmp/agent-relay-chat}"
-AGENT1="${2:-Alice}"
-AGENT2="${3:-Bob}"
-
-echo "Setting up basic chat in: $DATA_DIR"
-echo "Agents: $AGENT1, $AGENT2"
-echo ""
-
-# Create agent directories
-mkdir -p "$DATA_DIR/$AGENT1"
-mkdir -p "$DATA_DIR/$AGENT2"
-
-# Create instruction files
-cat > "$DATA_DIR/$AGENT1/INSTRUCTIONS.md" << EOF
-# You are $AGENT1
-
-You're participating in a chat with $AGENT2 using agent-relay.
-
-## How to send messages
-
-Use the MCP tool:
-\`\`\`
-mcp__relaycast__message_dm_send(to: "$AGENT2", text: "Your message")
-\`\`\`
-
-## How to check for messages
-
-Use the MCP tool:
-\`\`\`
-mcp__relaycast__message_inbox_check()
-\`\`\`
-
-## Start the conversation
-
-Say hello to $AGENT2!
-EOF
-
-cat > "$DATA_DIR/$AGENT2/INSTRUCTIONS.md" << EOF
-# You are $AGENT2
-
-You're participating in a chat with $AGENT1 using agent-relay.
-
-## How to send messages
-
-Use the MCP tool:
-\`\`\`
-mcp__relaycast__message_dm_send(to: "$AGENT1", text: "Your message")
-\`\`\`
-
-## How to check for messages
-
-Use the MCP tool:
-\`\`\`
-mcp__relaycast__message_inbox_check()
-\`\`\`
-
-## Wait for $AGENT1's message
-
-Check your inbox and respond!
-EOF
-
-echo "Created:"
-echo "  $DATA_DIR/$AGENT1/INSTRUCTIONS.md"
-echo "  $DATA_DIR/$AGENT2/INSTRUCTIONS.md"
-echo ""
-echo "To start:"
-echo "  Terminal 1: Read $DATA_DIR/$AGENT1/INSTRUCTIONS.md and start chatting"
-echo "  Terminal 2: Read $DATA_DIR/$AGENT2/INSTRUCTIONS.md and respond"
diff --git a/examples/cli-usage.sh b/examples/cli-usage.sh
deleted file mode 100644
index 3480652de..000000000
--- a/examples/cli-usage.sh
+++ /dev/null
@@ -1,81 +0,0 @@
-#!/bin/bash
-# Agent Relay CLI Usage Examples
-# These are example commands - don't run this file directly
-
-# ============================================
-# Starting the Daemon
-# ============================================
-
-# Start daemon only (dashboard disabled by default)
-agent-relay up
-
-# Start daemon with web dashboard enabled
-agent-relay up --dashboard
-
-# Start daemon with dashboard on custom port
-agent-relay up --dashboard --port 4000
-
-# Check if daemon is running
-agent-relay status
-
-# Stop the daemon
-agent-relay down
-
-# ============================================
-# Running Agents
-# ============================================
-
-# OPTION 1: Dashboard (recommended for interactive use)
-# Open http://localhost:3888, click "Spawn Agent", enter name and CLI
-
-# OPTION 2: Spawn command (for scripting/automation)
-agent-relay spawn Alice claude "Help with coding tasks"
-agent-relay spawn Bob claude "Wait for instructions"
-
-# Release an agent
-agent-relay release Alice
-
-# ADVANCED: create-agent wraps CLI in tmux with messaging
-# Use when you need shadow agents or other advanced options
-agent-relay create-agent claude
-agent-relay create-agent -n Worker claude
-agent-relay create-agent -q -n Worker claude  # quiet mode
-
-# ============================================
-# Message Management
-# ============================================
-
-# List connected agents
-agent-relay agents
-
-# Show active agents (alias)
-agent-relay who
-
-# Read a truncated message by ID
-agent-relay read abc12345
-
-# View message history
-agent-relay history
-
-# View history with filters
-agent-relay history --since 1h        # Last hour
-agent-relay history --since 30m       # Last 30 minutes
-agent-relay history --limit 50        # Last 50 messages
-agent-relay history --from Alice      # Messages from Alice
-agent-relay history --to Bob          # Messages to Bob
-
-# ============================================
-# Multiple Projects
-# ============================================
-
-# Each project gets isolated data based on project root
-# Just run agent-relay from different project directories
-
-cd /path/to/project-a
-agent-relay up                        # Uses ~/.agent-relay/<hash-of-project-a>/
-
-cd /path/to/project-b
-agent-relay up                        # Uses ~/.agent-relay/<hash-of-project-b>/
-
-# List all known projects
-agent-relay projects
diff --git a/examples/collaborative-task/README.md b/examples/collaborative-task/README.md
deleted file mode 100644
index 974e342a4..000000000
--- a/examples/collaborative-task/README.md
+++ /dev/null
@@ -1,85 +0,0 @@
-# Collaborative Task Example
-
-Multiple AI agents working together on a shared coding task using agent-relay.
-
-## Scenario
-
-Three agents collaborate on building a feature:
-
-- **Architect** - Designs the solution and coordinates
-- **Developer** - Implements the code
-- **Reviewer** - Reviews code and suggests improvements
-
-## Prerequisites
-
-- agent-relay installed
-- Three terminal windows
-
-## Quick Start with PTY Wrapper
-
-### Terminal 1: Daemon
-
-```bash
-npx agent-relay start -f
-```
-
-### Terminal 2: Architect
-
-```bash
-npx agent-relay wrap -n Architect "claude"
-```
-
-Tell the agent:
-
-> "You are the Architect. Your job is to design a solution for adding user authentication. Once you have a plan, message Developer with the design using: mcp__relaycast__message_dm_send(to: 'Developer', text: 'your design here')"
-
-### Terminal 3: Developer
-
-```bash
-npx agent-relay wrap -n Developer "claude"
-```
-
-### Terminal 4: Reviewer
-
-```bash
-npx agent-relay wrap -n Reviewer "claude"
-```
-
-## Communication Flow
-
-```
-Architect                Developer                Reviewer
-    |                        |                       |
-    |---(design doc)-------->|                       |
-    |                        |                       |
-    |                        |---(code for review)-->|
-    |                        |                       |
-    |                        |<--(review feedback)---|
-    |                        |                       |
-    |<--(status update)------|                       |
-    |                        |                       |
-```
-
-## Message Protocol
-
-Agents use structured communication via MCP tools:
-
-```
-# Architect assigns task
-mcp__relaycast__message_dm_send(to: "Developer", text: "TASK: Implement user registration endpoint. Requirements: POST /api/register, validate email, hash password, return JWT.")
-
-# Developer requests review
-mcp__relaycast__message_dm_send(to: "Reviewer", text: "REVIEW REQUEST: Please review src/api/register.ts")
-
-# Reviewer provides feedback
-mcp__relaycast__message_dm_send(to: "Developer", text: "FEEDBACK: Line 23: Use bcrypt instead of md5 for password hashing.")
-
-# Developer notifies completion
-mcp__relaycast__message_dm_send(to: "Architect", text: "DONE: Registration endpoint implemented and reviewed.")
-```
-
-## Tips
-
-- Use `mcp__relaycast__message_dm_send(to: "Name", text: "...")` for direct messages
-- Use clear prefixes (TASK:, REVIEW:, FEEDBACK:, DONE:) for structured communication
-- Keep messages concise - agents can read files for details
diff --git a/examples/collaborative-task/setup.sh b/examples/collaborative-task/setup.sh
deleted file mode 100755
index 19ed262b2..000000000
--- a/examples/collaborative-task/setup.sh
+++ /dev/null
@@ -1,168 +0,0 @@
-#!/bin/bash
-# Collaborative Task Setup Script
-# Creates inboxes and instructions for three agents working together
-
-set -e
-
-DATA_DIR="${1:-/tmp/agent-relay-collab}"
-
-echo "Setting up collaborative task in: $DATA_DIR"
-echo "Agents: Architect, Developer, Reviewer"
-echo ""
-
-# Create directories
-mkdir -p "$DATA_DIR/Architect"
-mkdir -p "$DATA_DIR/Developer"
-mkdir -p "$DATA_DIR/Reviewer"
-
-# Create empty inboxes
-touch "$DATA_DIR/Architect/inbox.md"
-touch "$DATA_DIR/Developer/inbox.md"
-touch "$DATA_DIR/Reviewer/inbox.md"
-
-# Architect instructions
-cat > "$DATA_DIR/Architect/INSTRUCTIONS.md" << 'EOF'
-# You are Architect
-
-You're the technical lead for a collaborative coding task. Your role:
-- Design the solution
-- Assign tasks to Developer
-- Coordinate between Developer and Reviewer
-- Make final decisions on implementation
-
-## Your Team
-- **Developer** - Implements code based on your designs
-- **Reviewer** - Reviews code and ensures quality
-
-## Communication Commands
-
-Send to Developer:
-```bash
-agent-relay inbox-write -t Developer -f Architect -m "TASK: <description>" -d DATA_DIR
-```
-
-Send to Reviewer:
-```bash
-agent-relay inbox-write -t Reviewer -f Architect -m "<message>" -d DATA_DIR
-```
-
-Broadcast to all:
-```bash
-agent-relay inbox-write -t "*" -f Architect -m "STATUS: <update>" -d DATA_DIR
-```
-
-Check your inbox:
-```bash
-agent-relay inbox-poll -n Architect -d DATA_DIR --clear
-```
-
-## Message Prefixes
-- `TASK:` - Assign work
-- `QUESTION:` - Ask for input
-- `DECISION:` - Announce a decision
-- `STATUS:` - Progress update
-
-## Your First Task
-
-Design a simple user authentication system and assign implementation to Developer.
-EOF
-
-# Developer instructions
-cat > "$DATA_DIR/Developer/INSTRUCTIONS.md" << 'EOF'
-# You are Developer
-
-You implement code based on designs from Architect. Your role:
-- Receive tasks from Architect
-- Implement solutions
-- Request code reviews from Reviewer
-- Incorporate feedback
-
-## Your Team
-- **Architect** - Provides designs and coordinates
-- **Reviewer** - Reviews your code
-
-## Communication Commands
-
-Send to Architect:
-```bash
-agent-relay inbox-write -t Architect -f Developer -m "<message>" -d DATA_DIR
-```
-
-Send to Reviewer:
-```bash
-agent-relay inbox-write -t Reviewer -f Developer -m "REVIEW: <what to review>" -d DATA_DIR
-```
-
-Check your inbox:
-```bash
-agent-relay inbox-poll -n Developer -d DATA_DIR --clear
-```
-
-## Message Prefixes
-- `DONE:` - Task completed
-- `REVIEW:` - Request code review
-- `QUESTION:` - Ask for clarification
-- `BLOCKED:` - Report a blocker
-
-## Getting Started
-
-Wait for Architect to assign your first task, then implement and request review.
-EOF
-
-# Reviewer instructions
-cat > "$DATA_DIR/Reviewer/INSTRUCTIONS.md" << 'EOF'
-# You are Reviewer
-
-You ensure code quality through reviews. Your role:
-- Review code from Developer
-- Provide constructive feedback
-- Approve implementations
-- Flag potential issues
-
-## Your Team
-- **Architect** - Technical lead
-- **Developer** - Implements code you review
-
-## Communication Commands
-
-Send to Developer:
-```bash
-agent-relay inbox-write -t Developer -f Reviewer -m "FEEDBACK: <feedback>" -d DATA_DIR
-```
-
-Send to Architect:
-```bash
-agent-relay inbox-write -t Architect -f Reviewer -m "<message>" -d DATA_DIR
-```
-
-Check your inbox:
-```bash
-agent-relay inbox-poll -n Reviewer -d DATA_DIR --clear
-```
-
-## Message Prefixes
-- `FEEDBACK:` - Code review comments
-- `APPROVED:` - Code passes review
-- `CHANGES_NEEDED:` - Requires modifications
-- `CONCERN:` - Flag potential issues
-
-## Getting Started
-
-Wait for Developer to request a code review.
-EOF
-
-# Replace DATA_DIR tokens with the actual path
-sed -i.bak "s|DATA_DIR|$DATA_DIR|g" "$DATA_DIR/Architect/INSTRUCTIONS.md"
-sed -i.bak "s|DATA_DIR|$DATA_DIR|g" "$DATA_DIR/Developer/INSTRUCTIONS.md"
-sed -i.bak "s|DATA_DIR|$DATA_DIR|g" "$DATA_DIR/Reviewer/INSTRUCTIONS.md"
-rm -f "$DATA_DIR"/*/*.bak
-
-echo "Created:"
-echo "  $DATA_DIR/Architect/INSTRUCTIONS.md"
-echo "  $DATA_DIR/Developer/INSTRUCTIONS.md"
-echo "  $DATA_DIR/Reviewer/INSTRUCTIONS.md"
-echo ""
-echo "To start (3 terminals):"
-echo "  Terminal 1: Start agent, then: cat $DATA_DIR/Architect/INSTRUCTIONS.md"
-echo "  Terminal 2: Start agent, then: cat $DATA_DIR/Developer/INSTRUCTIONS.md"
-echo "  Terminal 3: Start agent, then: cat $DATA_DIR/Reviewer/INSTRUCTIONS.md"
diff --git a/examples/discord-claude-bot.ts b/examples/discord-claude-bot.ts
deleted file mode 100644
index 1f2329c37..000000000
--- a/examples/discord-claude-bot.ts
+++ /dev/null
@@ -1,244 +0,0 @@
-/**
- * Discord Claude Bot via Agent Relay
- *
- * A Discord bot that uses Claude Code CLI (subscription-based, no API costs)
- * bridged through agent-relay for message coordination.
- *
- * Setup:
- *   1. Create Discord app: https://discord.com/developers/applications
- *   2. Bot → Add Bot → copy Token
- *   3. Bot → enable "Message Content Intent"
- *   4. OAuth2 → URL Generator → "bot" scope + "Send Messages" + "Read Message History"
- *   5. Use generated URL to invite bot to your server
- *   6. Ensure `claude` CLI is installed and logged in
- *   7. Start agent-relay daemon: `agent-relay up`
- *
- * Run:
- *   DISCORD_TOKEN=... npx ts-node examples/discord-claude-bot.ts
- */
-
-import { Client, GatewayIntentBits, Message, TextChannel } from 'discord.js';
-import { spawn } from 'child_process';
-import { RelayClient } from 'agent-relay';
-import { getProjectPaths } from 'agent-relay';
-
-// Configuration
-const DISCORD_TOKEN = process.env.DISCORD_TOKEN;
-const BOT_NAME = process.env.BOT_NAME || 'DiscordBot';
-const DEFAULT_CHANNEL_ID = process.env.DISCORD_DEFAULT_CHANNEL;
-
-if (!DISCORD_TOKEN) {
-  console.error('Missing DISCORD_TOKEN');
-  process.exit(1);
-}
-
-// Initialize Discord client
-const discord = new Client({
-  intents: [
-    GatewayIntentBits.Guilds,
-    GatewayIntentBits.GuildMessages,
-    GatewayIntentBits.MessageContent,
-    GatewayIntentBits.DirectMessages,
-  ],
-});
-
-// Initialize agent-relay client
-const paths = getProjectPaths();
-const relay = new RelayClient({
-  name: BOT_NAME,
-  socketPath: paths.socketPath,
-});
-
-/**
- * Ask Claude using the CLI (uses subscription, not API)
- */
-async function askClaude(prompt: string): Promise<string> {
-  return new Promise((resolve, reject) => {
-    const claude = spawn('claude', ['--print', prompt], {
-      env: { ...process.env },
-      stdio: ['pipe', 'pipe', 'pipe'],
-    });
-
-    let output = '';
-    let error = '';
-
-    claude.stdout.on('data', (data) => {
-      output += data.toString();
-    });
-
-    claude.stderr.on('data', (data) => {
-      error += data.toString();
-    });
-
-    claude.on('close', (code) => {
-      if (code === 0) {
-        resolve(output.trim());
-      } else {
-        reject(new Error(error || `Claude exited with code ${code}`));
-      }
-    });
-
-    const timeout = setTimeout(() => {
-      claude.kill();
-      reject(new Error('Claude response timeout'));
-    }, 120000);
-
-    claude.on('close', () => clearTimeout(timeout));
-  });
-}
-
-// Split long messages for Discord's 2000 char limit
-function splitMessage(text: string, maxLength = 1900): string[] {
-  const chunks: string[] = [];
-  let remaining = text;
-
-  while (remaining.length > 0) {
-    if (remaining.length <= maxLength) {
-      chunks.push(remaining);
-      break;
-    }
-
-    let splitAt = remaining.lastIndexOf('\n', maxLength);
-    if (splitAt === -1 || splitAt < maxLength / 2) {
-      splitAt = remaining.lastIndexOf(' ', maxLength);
-    }
-    if (splitAt === -1 || splitAt < maxLength / 2) {
-      splitAt = maxLength;
-    }
-
-    chunks.push(remaining.slice(0, splitAt));
-    remaining = remaining.slice(splitAt).trimStart();
-  }
-
-  return chunks;
-}
-
-/**
- * Handle Discord mentions
- */
-discord.on('messageCreate', async (message: Message) => {
-  if (message.author.bot) return;
-
-  const isMentioned = message.mentions.has(discord.user!);
-  const isDM = !message.guild;
-
-  if (!isMentioned && !isDM) return;
-
-  const text = message.content.replace(/<@!?\d+>/g, '').trim();
-  if (!text) return;
-
-  console.log(`[Discord] ${message.author.tag}: ${text}`);
-
-  try {
-    // Notify relay that we received a Discord message
-    await relay.send({
-      to: '*',
-      body: `[Discord #${(message.channel as TextChannel).name || 'DM'}] ${message.author.tag}: ${text}`,
-      data: {
-        source: 'discord',
-        channelId: message.channel.id,
-        guildId: message.guild?.id,
-        userId: message.author.id,
-      },
-    });
-
-    // Show typing
-    await message.channel.sendTyping();
-
-    // Get response from Claude
-    const response = await askClaude(text);
-
-    // Send response to Discord
-    const chunks = splitMessage(response);
-    for (const chunk of chunks) {
-      await message.reply({ content: chunk, allowedMentions: { repliedUser: false } });
-    }
-
-    // Notify relay of the response
-    await relay.send({
-      to: '*',
-      body: `[Discord Response] ${response.substring(0, 200)}...`,
-      data: { source: 'discord-response', channelId: message.channel.id },
-    });
-  } catch (err) {
-    console.error('[Discord] Error:', err);
-    await message.reply(`Error: ${err instanceof Error ? err.message : 'Unknown error'}`);
-  }
-});
-
-/**
- * Handle incoming relay messages - forward to Discord
- */
-relay.on('message', async (msg) => {
-  // Skip messages from ourselves or other Discord sources
-  if (msg.from === BOT_NAME || msg.data?.source?.startsWith('discord')) {
-    return;
-  }
-
-  console.log(`[Relay] Message from ${msg.from}: ${msg.body}`);
-
-  // Check if message specifies a Discord channel
-  const targetChannelId = msg.data?.discordChannel || DEFAULT_CHANNEL_ID;
-
-  if (targetChannelId) {
-    try {
-      const channel = await discord.channels.fetch(targetChannelId);
-      if (channel?.isTextBased()) {
-        const chunks = splitMessage(`**${msg.from}**: ${msg.body}`);
-        for (const chunk of chunks) {
-          await (channel as TextChannel).send(chunk);
-        }
-      }
-    } catch (err) {
-      console.error('[Relay→Discord] Failed to post:', err);
-    }
-  }
-});
-
-/**
- * Handle relay connection events
- */
-relay.on('connected', () => {
-  console.log(`[Relay] Connected as ${BOT_NAME}`);
-});
-
-relay.on('disconnected', () => {
-  console.log('[Relay] Disconnected, will reconnect...');
-});
-
-discord.on('ready', () => {
-  console.log(`[Discord] Logged in as ${discord.user?.tag}`);
-});
-
-/**
- * Startup
- */
-async function main() {
-  try {
-    // Connect to relay daemon
-    await relay.connect();
-    console.log(`[Relay] Connected to ${paths.socketPath}`);
-
-    // Login to Discord
-    await discord.login(DISCORD_TOKEN);
-
-    // Announce presence
-    await relay.broadcast(`${BOT_NAME} online - bridging Discord ↔ Relay`);
-
-    console.log('\nReady! Mention the bot in Discord to interact.');
-    console.log('Messages from relay agents will be forwarded to Discord.\n');
-  } catch (err) {
-    console.error('Startup failed:', err);
-    process.exit(1);
-  }
-}
-
-// Graceful shutdown
-process.on('SIGINT', async () => {
-  console.log('\nShutting down...');
-  await relay.disconnect();
-  discord.destroy();
-  process.exit(0);
-});
-
-main();
diff --git a/examples/discord-claude-standalone.ts b/examples/discord-claude-standalone.ts
deleted file mode 100644
index d5e4ba2ec..000000000
--- a/examples/discord-claude-standalone.ts
+++ /dev/null
@@ -1,140 +0,0 @@
-#!/usr/bin/env npx ts-node
-/**
- * Standalone Discord Claude Bot
- *
- * Minimal Discord bot using Claude Code CLI - no agent-relay required.
- * Uses your Claude Code subscription (no API costs).
- *
- * Setup:
- *   1. Create Discord app: https://discord.com/developers/applications
- *   2. Bot → Add Bot → copy Token
- *   3. Bot → enable "Message Content Intent"
- *   4. OAuth2 → URL Generator → select "bot" scope + "Send Messages" permission
- *   5. Use generated URL to invite bot to your server
- *   6. Ensure `claude` CLI is logged in: `claude auth login`
- *
- * Run:
- *   DISCORD_TOKEN=... npx ts-node examples/discord-claude-standalone.ts
- */
-
-import { Client, GatewayIntentBits, Message } from 'discord.js';
-import { spawn } from 'child_process';
-
-const client = new Client({
-  intents: [
-    GatewayIntentBits.Guilds,
-    GatewayIntentBits.GuildMessages,
-    GatewayIntentBits.MessageContent,
-    GatewayIntentBits.DirectMessages,
-  ],
-});
-
-// Conversation history per channel/thread
-const threads = new Map<string, Array<{ role: string; text: string }>>();
-
-async function askClaude(prompt: string, history: Array<{ role: string; text: string }> = []): Promise<string> {
-  let fullPrompt = prompt;
-  if (history.length > 0) {
-    const context = history.map((m) => `${m.role}: ${m.text}`).join('\n');
-    fullPrompt = `Previous conversation:\n${context}\n\nUser: ${prompt}`;
-  }
-
-  return new Promise((resolve, reject) => {
-    const claude = spawn('claude', ['--print', fullPrompt], {
-      stdio: ['pipe', 'pipe', 'pipe'],
-    });
-
-    let output = '';
-    claude.stdout.on('data', (d) => (output += d));
-    claude.stderr.on('data', (d) => console.error('[claude stderr]', d.toString()));
-
-    claude.on('close', (code) => {
-      code === 0 ? resolve(output.trim()) : reject(new Error(`Exit ${code}`));
-    });
-
-    setTimeout(() => {
-      claude.kill();
-      reject(new Error('Timeout'));
-    }, 120000);
-  });
-}
-
-// Split long messages for Discord's 2000 char limit
-function splitMessage(text: string, maxLength = 1900): string[] {
-  const chunks: string[] = [];
-  let remaining = text;
-
-  while (remaining.length > 0) {
-    if (remaining.length <= maxLength) {
-      chunks.push(remaining);
-      break;
-    }
-
-    // Find a good break point
-    let splitAt = remaining.lastIndexOf('\n', maxLength);
-    if (splitAt === -1 || splitAt < maxLength / 2) {
-      splitAt = remaining.lastIndexOf(' ', maxLength);
-    }
-    if (splitAt === -1 || splitAt < maxLength / 2) {
-      splitAt = maxLength;
-    }
-
-    chunks.push(remaining.slice(0, splitAt));
-    remaining = remaining.slice(splitAt).trimStart();
-  }
-
-  return chunks;
-}
-
-client.on('ready', () => {
-  console.log(`⚡ Discord bot logged in as ${client.user?.tag}`);
-  console.log('   Mention the bot or DM it to chat!');
-});
-
-client.on('messageCreate', async (message: Message) => {
-  // Ignore bot messages
-  if (message.author.bot) return;
-
-  // Check if bot was mentioned or it's a DM
-  const isMentioned = message.mentions.has(client.user!);
-  const isDM = !message.guild;
-
-  if (!isMentioned && !isDM) return;
-
-  // Get the text (remove mention)
-  const text = message.content.replace(/<@!?\d+>/g, '').trim();
-  if (!text) return;
-
-  // Use thread ID or channel ID for conversation tracking
-  const threadId = message.channel.isThread()
-    ? message.channel.id
-    : message.reference?.messageId || message.channel.id;
-
-  console.log(`[${new Date().toISOString()}] ${message.author.tag}: "${text}"`);
-
-  // Get thread history
-  const history = threads.get(threadId) || [];
-
-  try {
-    // Show typing indicator
-    await message.channel.sendTyping();
-
-    const response = await askClaude(text, history);
-
-    // Update history (keep last 10 exchanges)
-    history.push({ role: 'User', text });
-    history.push({ role: 'Claude', text: response });
-    threads.set(threadId, history.slice(-20));
-
-    // Send response (split if too long)
-    const chunks = splitMessage(response);
-    for (const chunk of chunks) {
-      await message.reply({ content: chunk, allowedMentions: { repliedUser: false } });
-    }
-  } catch (err) {
-    console.error('Error:', err);
-    await message.reply(`Error: ${err}`);
-  }
-});
-
-client.login(process.env.DISCORD_TOKEN);
diff --git a/examples/discord-codex-standalone.ts b/examples/discord-codex-standalone.ts
deleted file mode 100644
index eebd2bd2a..000000000
--- a/examples/discord-codex-standalone.ts
+++ /dev/null
@@ -1,125 +0,0 @@
-#!/usr/bin/env npx ts-node
-/**
- * Standalone Discord Codex Bot
- *
- * Minimal Discord bot using OpenAI Codex CLI.
- *
- * Setup:
- *   1. Install Codex CLI: npm install -g @openai/codex
- *   2. Login: codex auth login
- *   3. Create Discord app with Message Content Intent (see README)
- *
- * Run:
- *   DISCORD_TOKEN=... npx ts-node examples/discord-codex-standalone.ts
- */
-
-import { Client, GatewayIntentBits, Message } from 'discord.js';
-import { spawn } from 'child_process';
-
-const client = new Client({
-  intents: [
-    GatewayIntentBits.Guilds,
-    GatewayIntentBits.GuildMessages,
-    GatewayIntentBits.MessageContent,
-    GatewayIntentBits.DirectMessages,
-  ],
-});
-
-const threads = new Map<string, Array<{ role: string; text: string }>>();
-
-async function askCodex(prompt: string, history: Array<{ role: string; text: string }> = []): Promise<string> {
-  let fullPrompt = prompt;
-  if (history.length > 0) {
-    const context = history.map((m) => `${m.role}: ${m.text}`).join('\n');
-    fullPrompt = `Previous conversation:\n${context}\n\nUser: ${prompt}`;
-  }
-
-  return new Promise((resolve, reject) => {
-    const codex = spawn('codex', ['--print', fullPrompt], {
-      stdio: ['pipe', 'pipe', 'pipe'],
-    });
-
-    let output = '';
-    codex.stdout.on('data', (d) => (output += d));
-    codex.stderr.on('data', (d) => console.error('[codex stderr]', d.toString()));
-
-    codex.on('close', (code) => {
-      code === 0 ? resolve(output.trim()) : reject(new Error(`Exit ${code}`));
-    });
-
-    setTimeout(() => {
-      codex.kill();
-      reject(new Error('Timeout'));
-    }, 120000);
-  });
-}
-
-function splitMessage(text: string, maxLength = 1900): string[] {
-  const chunks: string[] = [];
-  let remaining = text;
-
-  while (remaining.length > 0) {
-    if (remaining.length <= maxLength) {
-      chunks.push(remaining);
-      break;
-    }
-
-    let splitAt = remaining.lastIndexOf('\n', maxLength);
-    if (splitAt === -1 || splitAt < maxLength / 2) {
-      splitAt = remaining.lastIndexOf(' ', maxLength);
-    }
-    if (splitAt === -1 || splitAt < maxLength / 2) {
-      splitAt = maxLength;
-    }
-
-    chunks.push(remaining.slice(0, splitAt));
-    remaining = remaining.slice(splitAt).trimStart();
-  }
-
-  return chunks;
-}
-
-client.on('ready', () => {
-  console.log(`⚡ Discord bot logged in as ${client.user?.tag}`);
-  console.log('   Mention the bot or DM it to chat!');
-});
-
-client.on('messageCreate', async (message: Message) => {
-  if (message.author.bot) return;
-
-  const isMentioned = message.mentions.has(client.user!);
-  const isDM = !message.guild;
-
-  if (!isMentioned && !isDM) return;
-
-  const text = message.content.replace(/<@!?\d+>/g, '').trim();
-  if (!text) return;
-
-  const threadId = message.channel.isThread()
-    ? message.channel.id
-    : message.reference?.messageId || message.channel.id;
-
-  console.log(`[${new Date().toISOString()}] ${message.author.tag}: "${text}"`);
-
-  const history = threads.get(threadId) || [];
-
-  try {
-    await message.channel.sendTyping();
-
-    const response = await askCodex(text, history);
-
-    history.push({ role: 'User', text });
-    history.push({ role: 'Codex', text: response });
-    threads.set(threadId, history.slice(-20));
-
-    const chunks = splitMessage(response);
-    for (const chunk of chunks) {
-      await message.reply({ content: chunk, allowedMentions: { repliedUser: false } });
-    }
-  } catch (err) {
-    console.error('Error:', err);
-    await message.reply(`Error: ${err}`);
-  }
-});
-
-client.login(process.env.DISCORD_TOKEN);
diff --git a/examples/docker-compose.yml b/examples/docker-compose.yml
deleted file mode 100644
index 269d227b5..000000000
--- a/examples/docker-compose.yml
+++ /dev/null
@@ -1,42 +0,0 @@
-# Docker Compose example for agent-relay
-# Run with: docker-compose up
-
-version: '3.8'
-
-services:
-  relay-daemon:
-    image: node:20-slim
-    working_dir: /app
-    volumes:
-      - ..:/app
-      - relay-data:/data
-    environment:
-      - AGENT_RELAY_DATA_DIR=/data
-      - AGENT_RELAY_DASHBOARD_PORT=3888
-    ports:
-      - "3888:3888"
-    command: >
-      sh -c "npm install && npm run build && node dist/cli/index.js up --dashboard --port 3888"
-    healthcheck:
-      test: ["CMD", "node", "-e", "require('net').connect(3888).on('error', () => process.exit(1))"]
-      interval: 10s
-      timeout: 5s
-      retries: 3
-
-  # Example agent container
-  agent-alice:
-    image: node:20-slim
-    working_dir: /app
-    volumes:
-      - ..:/app
-      - relay-data:/data
-    environment:
-      - AGENT_RELAY_DATA_DIR=/data
-    depends_on:
-      relay-daemon:
-        condition: service_healthy
-    command: >
-      sh -c "node dist/cli/index.js -n Alice echo 'Hello from Alice'"
-
-volumes:
-  relay-data:
diff --git a/examples/electron-demo/README.md b/examples/electron-demo/README.md
deleted file mode 100644
index 1832845cf..000000000
--- a/examples/electron-demo/README.md
+++ /dev/null
@@ -1,25 +0,0 @@
-# Agent Relay — Electron Demo
-
-Minimal Electron app that demonstrates the `@agent-relay/sdk` integration pattern from the docs.
-
-## Run it
-
-```bash
-cd examples/electron-demo
-npm install   # first time only
-npm start
-```
-
-## What it does
-
-- Starts the Agent Relay broker automatically on launch
-- **Spawn Agent** — enter a name, pick a CLI (claude / codex / gemini), click Spawn
-- **Messages tab** — shows all messages routed through the broker
-- **PTY Output tab** — streams raw terminal output from spawned agents
-- Click an agent in the sidebar to DM it; click again to go back to broadcast mode
-- Type a message and hit Enter or Send
-
-## Requirements
-
-The broker binary must be built (`cargo build --release` from the repo root).
-To actually spawn agents you need the relevant CLI installed (`claude`, `codex`, etc.).
diff --git a/examples/electron-demo/index.html b/examples/electron-demo/index.html
deleted file mode 100644
index fc9b160d4..000000000
--- a/examples/electron-demo/index.html
+++ /dev/null
@@ -1,249 +0,0 @@
-<!DOCTYPE html>
-<html lang="en">
-<head>
-  <meta charset="UTF-8" />
-  <meta http-equiv="Content-Security-Policy" content="default-src 'self'; script-src 'self' 'unsafe-inline'">
-  <title>Agent Relay — Electron Demo</title>
-  <style>
-    * { box-sizing: border-box; margin: 0; padding: 0; }
-    body { font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', sans-serif;
-           background: #0f0f0f; color: #e0e0e0; height: 100vh;
-           display: flex; flex-direction: column; }
-
-    header { padding: 12px 16px; background: #1a1a1a; border-bottom: 1px solid #2a2a2a;
-             display: flex; align-items: center; gap: 10px; }
-    header h1 { font-size: 14px; font-weight: 600; letter-spacing: 0.5px; }
-    .dot { width: 8px; height: 8px; border-radius: 50%; background: #444; flex-shrink: 0; }
-    .dot.connected { background: #22c55e; }
-    .dot.error     { background: #ef4444; }
-
-    .layout { display: flex; flex: 1; overflow: hidden; }
-
-    /* Sidebar */
-    .sidebar { width: 220px; background: #141414; border-right: 1px solid #2a2a2a;
-               display: flex; flex-direction: column; flex-shrink: 0; }
-    .sidebar-title { padding: 10px 12px; font-size: 11px; font-weight: 600;
-                     color: #666; text-transform: uppercase; letter-spacing: 0.8px; }
-    #agents { flex: 1; overflow-y: auto; }
-    .agent-row { padding: 8px 12px; font-size: 13px; display: flex;
-                 align-items: center; gap: 8px; border-bottom: 1px solid #1e1e1e; }
-    .agent-status { font-size: 10px; color: #666; margin-left: auto; }
-    .agent-status.ready    { color: #22c55e; }
-    .agent-status.spawning { color: #f59e0b; }
-    .agent-status.exited   { color: #ef4444; }
-
-    .spawn-form { padding: 10px 12px; border-top: 1px solid #2a2a2a;
-                  display: flex; flex-direction: column; gap: 6px; }
-    .spawn-form input, .spawn-form select {
-      background: #1a1a1a; border: 1px solid #2a2a2a; color: #e0e0e0;
-      padding: 5px 8px; border-radius: 4px; font-size: 12px; width: 100%; }
-    .spawn-form button { background: #1d4ed8; color: white; border: none;
-      padding: 6px; border-radius: 4px; cursor: pointer; font-size: 12px; font-weight: 500; }
-    .spawn-form button:hover { background: #2563eb; }
-    .spawn-form button:disabled { background: #333; color: #666; cursor: default; }
-
-    /* Main */
-    .main { flex: 1; display: flex; flex-direction: column; overflow: hidden; }
-    #log-panel { flex: 1; overflow-y: auto; padding: 10px 12px;
-      font-family: 'SF Mono', Monaco, monospace; font-size: 12px; line-height: 1.6; }
-
-    .log-line { margin-bottom: 2px; }
-    .log-line .ts    { color: #444; margin-right: 6px; }
-    .log-line.msg .from  { color: #60a5fa; }
-    .log-line.msg .arrow { color: #444; margin: 0 4px; }
-    .log-line.msg .text  { color: #e0e0e0; }
-    .log-line.sys  { color: #666; }
-
-    /* Send bar */
-    .send-bar { padding: 10px 12px; border-top: 1px solid #2a2a2a;
-                display: flex; gap: 8px; align-items: center; }
-    .send-bar select { background: #1a1a1a; border: 1px solid #2a2a2a; color: #e0e0e0;
-      padding: 7px 8px; border-radius: 4px; font-size: 13px; min-width: 110px; }
-    .send-bar input  { flex: 1; background: #1a1a1a; border: 1px solid #2a2a2a; color: #e0e0e0;
-      padding: 7px 10px; border-radius: 4px; font-size: 13px; }
-    .send-bar input:focus  { outline: none; border-color: #1d4ed8; }
-    .send-bar button { background: #1d4ed8; color: white; border: none;
-      padding: 7px 14px; border-radius: 4px; cursor: pointer; font-size: 13px; }
-    .send-bar button:hover    { background: #2563eb; }
-    .send-bar button:disabled { background: #333; color: #666; cursor: default; }
-  </style>
-</head>
-<body>
-
-<header>
-  <div class="dot" id="broker-dot"></div>
-  <h1>Agent Relay — Electron Demo</h1>
-  <span id="broker-label" style="font-size:12px;color:#666;margin-left:4px">connecting…</span>
-</header>
-
-<div class="layout">
-  <div class="sidebar">
-    <div class="sidebar-title">Agents</div>
-    <div id="agents"><div style="padding:12px;font-size:12px;color:#555">No agents yet</div></div>
-    <div class="spawn-form">
-      <input id="spawn-name" placeholder="Agent name" value="Lead" />
-      <select id="spawn-cli">
-        <option value="claude">claude</option>
-        <option value="codex">codex</option>
-        <option value="gemini">gemini</option>
-      </select>
-      <button id="spawn-btn">Spawn Agent</button>
-    </div>
-  </div>
-
-  <div class="main">
-    <div id="log-panel"></div>
-    <div class="send-bar">
-      <select id="target-select">
-        <option value="">— to —</option>
-      </select>
-      <input id="msg-input" placeholder="Type a message…" />
-      <button id="send-btn" disabled>Send</button>
-    </div>
-  </div>
-</div>
-
-<script>
-  const brokerDot    = document.getElementById('broker-dot');
-  const brokerLabel  = document.getElementById('broker-label');
-  const agentsEl     = document.getElementById('agents');
-  const logPanel     = document.getElementById('log-panel');
-  const spawnBtn     = document.getElementById('spawn-btn');
-  const spawnName    = document.getElementById('spawn-name');
-  const spawnCli     = document.getElementById('spawn-cli');
-  const msgInput     = document.getElementById('msg-input');
-  const sendBtn      = document.getElementById('send-btn');
-  const targetSelect = document.getElementById('target-select');
-
-  const agentMap = {};  // name → { status }
-
-  // ── Utilities ────────────────────────────────────────────────────────────────
-
-  function ts() {
-    return new Date().toLocaleTimeString('en', { hour12: false, timeStyle: 'medium' });
-  }
-
-  function appendLog(html, cls = '') {
-    const div = document.createElement('div');
-    div.className = `log-line ${cls}`;
-    div.innerHTML = `<span class="ts">${ts()}</span>${html}`;
-    logPanel.appendChild(div);
-    logPanel.scrollTop = logPanel.scrollHeight;
-  }
-
-  // ── Agent list + target dropdown ─────────────────────────────────────────────
-
-  function updateAgents(name, status) {
-    agentMap[name] = { status };
-
-    // Sidebar list
-    agentsEl.innerHTML = '';
-    Object.entries(agentMap).forEach(([n, { status: s }]) => {
-      const row = document.createElement('div');
-      row.className = 'agent-row';
-      const nameSpan = document.createElement('span');
-      nameSpan.textContent = n;
-      const statusSpan = document.createElement('span');
-      statusSpan.className = `agent-status ${s}`;
-      statusSpan.textContent = s;
-      row.appendChild(nameSpan);
-      row.appendChild(statusSpan);
-      agentsEl.appendChild(row);
-    });
-
-    // Target dropdown — keep current selection if still valid
-    const current = targetSelect.value;
-    targetSelect.innerHTML = '<option value="">— to —</option>';
-    Object.entries(agentMap)
-      .filter(([, { status: s }]) => s !== 'exited')
-      .forEach(([n]) => {
-        const opt = document.createElement('option');
-        opt.value = n;
-        opt.textContent = n;
-        if (n === current) opt.selected = true;
-        targetSelect.appendChild(opt);
-      });
-
-    updateSendBtn();
-  }
-
-  function updateSendBtn() {
-    sendBtn.disabled = !targetSelect.value;
-  }
-
-  targetSelect.addEventListener('change', updateSendBtn);
-
-  // ── Broker status ─────────────────────────────────────────────────────────────
-
-  window.relay.onBrokerStatus((status) => {
-    if (status === 'connected') {
-      brokerDot.className = 'dot connected';
-      brokerLabel.textContent = 'broker connected';
-      brokerLabel.style.color = '#22c55e';
-    } else {
-      brokerDot.className = 'dot error';
-      brokerLabel.textContent = 'broker error';
-      brokerLabel.style.color = '#ef4444';
-    }
-  });
-
-  // ── Relay events ──────────────────────────────────────────────────────────────
-
-  window.relay.onMessage(({ from, to, text }) => {
-    appendLog(
-      `<span class="from">${from}</span><span class="arrow">→</span>` +
-      `<span class="from">${to}</span>: <span class="text">${text}</span>`,
-      'msg'
-    );
-  });
-
-  window.relay.onAgentUpdate(({ name, status }) => {
-    updateAgents(name, status);
-  });
-
-  window.relay.onBrokerLog((line) => {
-    appendLog(line, 'sys');
-  });
-
-  // ── Spawn ─────────────────────────────────────────────────────────────────────
-
-  spawnBtn.addEventListener('click', async () => {
-    const name = spawnName.value.trim() || 'Lead';
-    const cli  = spawnCli.value;
-    spawnBtn.disabled = true;
-    spawnBtn.textContent = 'Spawning…';
-
-    const result = await window.relay.spawn(name, cli);
-    if (!result.ok) {
-      appendLog(`Failed to spawn ${name}: ${result.error}`, 'sys');
-    }
-
-    spawnBtn.disabled = false;
-    spawnBtn.textContent = 'Spawn Agent';
-  });
-
-  // ── Send ──────────────────────────────────────────────────────────────────────
-
-  async function doSend() {
-    const to   = targetSelect.value;
-    const text = msgInput.value.trim();
-    if (!to || !text) return;
-    msgInput.value = '';
-
-    const result = await window.relay.sendMessage(to, text);
-    if (result.ok) {
-      appendLog(
-        `<span class="from">you</span><span class="arrow">→</span>` +
-        `<span class="from">${to}</span>: <span class="text">${text}</span>`,
-        'msg'
-      );
-    } else {
-      appendLog(`Send failed: ${result.error}`, 'sys');
-    }
-  }
-
-  sendBtn.addEventListener('click', doSend);
-  msgInput.addEventListener('keydown', (e) => { if (e.key === 'Enter') doSend(); });
-</script>
-</body>
-</html>
diff --git a/examples/electron-demo/main.mjs b/examples/electron-demo/main.mjs
deleted file mode 100644
index 348c53281..000000000
--- a/examples/electron-demo/main.mjs
+++ /dev/null
@@ -1,160 +0,0 @@
-import { app, BrowserWindow, ipcMain } from 'electron';
-import path from 'node:path';
-import { fileURLToPath } from 'node:url';
-import { AgentRelay } from '@agent-relay/sdk';
-
-const __dirname = path.dirname(fileURLToPath(import.meta.url));
-
-let relay = null;
-let win = null;
-
-// Name the human sender uses in the relay
-const HUMAN_NAME = 'user';
-
-function send(channel, data) {
-  win?.webContents?.send(channel, data);
-}
-
-// ── Relay setup ──────────────────────────────────────────────────────────────
-
-async function initRelay() {
-  relay = new AgentRelay({
-    cwd: process.cwd(),
-    env: { ...process.env, RUST_LOG: 'info' },
-  });
-
-  relay.onMessageReceived = (msg) => {
-    console.log('[relay:msg]', JSON.stringify(msg));
-    send('message', { from: msg.from, to: msg.to, text: msg.text });
-  };
-
-  relay.onBrokerStderr((line) => {
-    console.log('[broker:stderr]', line);
-    send('broker-log', line);
-  });
-
-  relay.onAgentSpawned = (agent) => {
-    send('agent-update', { name: agent.name, status: 'spawning' });
-  };
-
-  relay.onAgentReady = (agent) => {
-    send('agent-update', { name: agent.name, status: 'ready' });
-  };
-
-  relay.onAgentExited = (agent) => {
-    send('agent-update', { name: agent.name, status: 'exited' });
-  };
-
-  relay.onWorkerOutput = ({ name, chunk }) => {
-    process.stdout.write(`[${name}] ${chunk}`);
-  };
-
-  try {
-    // listAgents() lazily starts the broker + Relaycast workspace
-    await relay.listAgents();
-
-    // Register the human sender identity with Relaycast so it has a valid
-    // agent token before any send_dm calls are made from it.
-    await relay.preflightAgents([{ name: HUMAN_NAME, cli: 'browser' }]);
-
-    send('broker-status', 'connected');
-  } catch (err) {
-    console.error('[relay] Init error:', err.message);
-    send('broker-status', 'error');
-  }
-}
-
-// ── IPC handlers ─────────────────────────────────────────────────────────────
-
-const DEFAULT_TASK = [
-  'You are a helpful assistant connected to Agent Relay.',
-  'When you receive a relay message (it will appear as "Relay message from X [id]: ..."),',
-  'reply by calling mcp__relaycast__message_post with channel: "general" and your reply as the text.',
-  'Do not use send_dm. Do not type your reply in the terminal.',
-  'Only respond via mcp__relaycast__message_post(channel: "general", text: "...").',
-].join(' ');
-
-ipcMain.handle('spawn', async (_e, name, cli, task) => {
-  try {
-    const agent = await relay.spawn(name, cli, task ?? DEFAULT_TASK);
-    return { ok: true, name: agent.name };
-  } catch (err) {
-    return { ok: false, error: err.message };
-  }
-});
-
-ipcMain.handle('release', async (_e, name) => {
-  try {
-    const agents = await relay.listAgents();
-    const agent = agents.find((a) => a.name === name);
-    if (!agent) return { ok: false, error: 'Agent not found' };
-    await agent.release();
-    return { ok: true };
-  } catch (err) {
-    return { ok: false, error: err.message };
-  }
-});
-
-ipcMain.handle('send-message', async (_e, to, text) => {
-  try {
-    const human = relay.human({ name: HUMAN_NAME });
-    const result = await human.sendMessage({ to, text });
-    console.log('[send-message] delivered to', to, JSON.stringify(result));
-    return { ok: true };
-  } catch (err) {
-    console.error('[send-message] error:', err.message);
-    return { ok: false, error: err.message };
-  }
-});
-
-ipcMain.handle('list-agents', async () => {
-  try {
-    const agents = await relay.listAgents();
-    return { ok: true, agents: agents.map((a) => ({ name: a.name, status: a.status })) };
-  } catch (err) {
-    return { ok: false, agents: [], error: err.message };
-  }
-});
-
-ipcMain.handle('broadcast', async (_e, text) => {
-  try {
-    const human = relay.human({ name: HUMAN_NAME });
-    await human.sendMessage({ to: '*', text });
-    return { ok: true };
-  } catch (err) {
-    return { ok: false, error: err.message };
-  }
-});
-
-// ── Window ────────────────────────────────────────────────────────────────────
-
-function createWindow() {
-  win = new BrowserWindow({
-    width: 900,
-    height: 700,
-    webPreferences: {
-      preload: path.join(__dirname, 'preload.cjs'),
-      contextIsolation: true,
-      nodeIntegration: false,
-    },
-  });
-
-  win.loadFile('index.html');
-}
-
-app.whenReady().then(async () => {
-  createWindow();
-  await initRelay();
-});
-
-app.on('before-quit', (event) => {
-  if (!relay) return;
-  event.preventDefault();
-  relay.shutdown()
-    .then(() => app.exit(0))
-    .catch(() => app.exit(1));
-});
-
-app.on('window-all-closed', () => {
-  if (process.platform !== 'darwin') app.quit();
-});
diff --git a/examples/electron-demo/package-lock.json b/examples/electron-demo/package-lock.json
deleted file mode 100644
index a8207b016..000000000
--- a/examples/electron-demo/package-lock.json
+++ /dev/null
@@ -1,891 +0,0 @@
-{
-  "name": "agent-relay-electron-demo",
-  "version": "1.0.0",
-  "lockfileVersion": 3,
-  "requires": true,
-  "packages": {
-    "": {
-      "name": "agent-relay-electron-demo",
-      "version": "1.0.0",
-      "dependencies": {
-        "@agent-relay/sdk": "file:../../packages/sdk"
-      },
-      "devDependencies": {
-        "electron": "^33.0.0"
-      }
-    },
-    "../../packages/sdk": {
-      "name": "@agent-relay/sdk",
-      "version": "2.3.14",
-      "dependencies": {
-        "@agent-relay/config": "2.3.14",
-        "@relaycast/sdk": "^0.4.0",
-        "yaml": "^2.7.0"
-      },
-      "devDependencies": {
-        "@types/node": "^22.13.10",
-        "typescript": "^5.7.3"
-      }
-    },
-    "node_modules/@agent-relay/sdk": {
-      "resolved": "../../packages/sdk",
-      "link": true
-    },
-    "node_modules/@electron/get": {
-      "version": "2.0.3",
-      "resolved": "https://registry.npmjs.org/@electron/get/-/get-2.0.3.tgz",
-      "integrity": "sha512-Qkzpg2s9GnVV2I2BjRksUi43U5e6+zaQMcjoJy0C+C5oxaKl+fmckGDQFtRpZpZV0NQekuZZ+tGz7EA9TVnQtQ==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "debug": "^4.1.1",
-        "env-paths": "^2.2.0",
-        "fs-extra": "^8.1.0",
-        "got": "^11.8.5",
-        "progress": "^2.0.3",
-        "semver": "^6.2.0",
-        "sumchecker": "^3.0.1"
-      },
-      "engines": {
-        "node": ">=12"
-      },
-      "optionalDependencies": {
-        "global-agent": "^3.0.0"
-      }
-    },
-    "node_modules/@sindresorhus/is": {
-      "version": "4.6.0",
-      "resolved": "https://registry.npmjs.org/@sindresorhus/is/-/is-4.6.0.tgz",
-      "integrity": "sha512-t09vSN3MdfsyCHoFcTRCH/iUtG7OJ0CsjzB8cjAmKc/va/kIgeDI/TxsigdncE/4be734m0cvIYwNaV4i2XqAw==",
-      "dev": true,
-      "license": "MIT",
-      "engines": {
-        "node": ">=10"
-      },
-      "funding": {
-        "url": "https://github.com/sindresorhus/is?sponsor=1"
-      }
-    },
-    "node_modules/@szmarczak/http-timer": {
-      "version": "4.0.6",
-      "resolved": "https://registry.npmjs.org/@szmarczak/http-timer/-/http-timer-4.0.6.tgz",
-      "integrity": "sha512-4BAffykYOgO+5nzBWYwE3W90sBgLJoUPRWWcL8wlyiM8IB8ipJz3UMJ9KXQd1RKQXpKp8Tutn80HZtWsu2u76w==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "defer-to-connect": "^2.0.0"
-      },
-      "engines": {
-        "node": ">=10"
-      }
-    },
-    "node_modules/@types/cacheable-request": {
-      "version": "6.0.3",
-      "resolved": "https://registry.npmjs.org/@types/cacheable-request/-/cacheable-request-6.0.3.tgz",
-      "integrity": "sha512-IQ3EbTzGxIigb1I3qPZc1rWJnH0BmSKv5QYTalEwweFvyBDLSAe24zP0le/hyi7ecGfZVlIVAg4BZqb8WBwKqw==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "@types/http-cache-semantics": "*",
-        "@types/keyv": "^3.1.4",
-        "@types/node": "*",
-        "@types/responselike": "^1.0.0"
-      }
-    },
-    "node_modules/@types/http-cache-semantics": {
-      "version": "4.2.0",
-      "resolved": "https://registry.npmjs.org/@types/http-cache-semantics/-/http-cache-semantics-4.2.0.tgz",
-      "integrity": "sha512-L3LgimLHXtGkWikKnsPg0/VFx9OGZaC+eN1u4r+OB1XRqH3meBIAVC2zr1WdMH+RHmnRkqliQAOHNJ/E0j/e0Q==",
-      "dev": true,
-      "license": "MIT"
-    },
-    "node_modules/@types/keyv": {
-      "version": "3.1.4",
-      "resolved": "https://registry.npmjs.org/@types/keyv/-/keyv-3.1.4.tgz",
-      "integrity": "sha512-BQ5aZNSCpj7D6K2ksrRCTmKRLEpnPvWDiLPfoGyhZ++8YtiK9d/3DBKPJgry359X/P1PfruyYwvnvwFjuEiEIg==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "@types/node": "*"
-      }
-    },
-    "node_modules/@types/node": {
-      "version": "20.19.35",
-      "resolved": "https://registry.npmjs.org/@types/node/-/node-20.19.35.tgz",
-      "integrity": "sha512-Uarfe6J91b9HAUXxjvSOdiO2UPOKLm07Q1oh0JHxoZ1y8HoqxDAu3gVrsrOHeiio0kSsoVBt4wFrKOm0dKxVPQ==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "undici-types": "~6.21.0"
-      }
-    },
-    "node_modules/@types/responselike": {
-      "version": "1.0.3",
-      "resolved": "https://registry.npmjs.org/@types/responselike/-/responselike-1.0.3.tgz",
-      "integrity": "sha512-H/+L+UkTV33uf49PH5pCAUBVPNj2nDBXTN+qS1dOwyyg24l3CcicicCA7ca+HMvJBZcFgl5r8e+RR6elsb4Lyw==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "@types/node": "*"
-      }
-    },
-    "node_modules/@types/yauzl": {
-      "version": "2.10.3",
-      "resolved": "https://registry.npmjs.org/@types/yauzl/-/yauzl-2.10.3.tgz",
-      "integrity": "sha512-oJoftv0LSuaDZE3Le4DbKX+KS9G36NzOeSap90UIK0yMA/NhKJhqlSGtNDORNRaIbQfzjXDrQa0ytJ6mNRGz/Q==",
-      "dev": true,
-      "license": "MIT",
-      "optional": true,
-      "dependencies": {
-        "@types/node": "*"
-      }
-    },
-    "node_modules/boolean": {
-      "version": "3.2.0",
-      "resolved": "https://registry.npmjs.org/boolean/-/boolean-3.2.0.tgz",
-      "integrity": "sha512-d0II/GO9uf9lfUHH2BQsjxzRJZBdsjgsBiW4BvhWk/3qoKwQFjIDVN19PfX8F2D/r9PCMTtLWjYVCFrpeYUzsw==",
-      "deprecated": "Package no longer supported. Contact Support at https://www.npmjs.com/support for more info.",
-      "dev": true,
-      "license": "MIT",
-      "optional": true
-    },
-    "node_modules/buffer-crc32": {
-      "version": "0.2.13",
-      "resolved": "https://registry.npmjs.org/buffer-crc32/-/buffer-crc32-0.2.13.tgz",
-      "integrity": "sha512-VO9Ht/+p3SN7SKWqcrgEzjGbRSJYTx+Q1pTQC0wrWqHx0vpJraQ6GtHx8tvcg1rlK1byhU5gccxgOgj7B0TDkQ==",
-      "dev": true,
-      "license": "MIT",
-      "engines": {
-        "node": "*"
-      }
-    },
-    "node_modules/cacheable-lookup": {
-      "version": "5.0.4",
-      "resolved": "https://registry.npmjs.org/cacheable-lookup/-/cacheable-lookup-5.0.4.tgz",
-      "integrity": "sha512-2/kNscPhpcxrOigMZzbiWF7dz8ilhb/nIHU3EyZiXWXpeq/au8qJ8VhdftMkty3n7Gj6HIGalQG8oiBNB3AJgA==",
-      "dev": true,
-      "license": "MIT",
-      "engines": {
-        "node": ">=10.6.0"
-      }
-    },
-    "node_modules/cacheable-request": {
-      "version": "7.0.4",
-      "resolved": "https://registry.npmjs.org/cacheable-request/-/cacheable-request-7.0.4.tgz",
-      "integrity": "sha512-v+p6ongsrp0yTGbJXjgxPow2+DL93DASP4kXCDKb8/bwRtt9OEF3whggkkDkGNzgcWy2XaF4a8nZglC7uElscg==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "clone-response": "^1.0.2",
-        "get-stream": "^5.1.0",
-        "http-cache-semantics": "^4.0.0",
-        "keyv": "^4.0.0",
-        "lowercase-keys": "^2.0.0",
-        "normalize-url": "^6.0.1",
-        "responselike": "^2.0.0"
-      },
-      "engines": {
-        "node": ">=8"
-      }
-    },
-    "node_modules/clone-response": {
-      "version": "1.0.3",
-      "resolved": "https://registry.npmjs.org/clone-response/-/clone-response-1.0.3.tgz",
-      "integrity": "sha512-ROoL94jJH2dUVML2Y/5PEDNaSHgeOdSDicUyS7izcF63G6sTc/FTjLub4b8Il9S8S0beOfYt0TaA5qvFK+w0wA==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "mimic-response": "^1.0.0"
-      },
-      "funding": {
-        "url": "https://github.com/sponsors/sindresorhus"
-      }
-    },
-    "node_modules/debug": {
-      "version": "4.4.3",
-      "resolved": "https://registry.npmjs.org/debug/-/debug-4.4.3.tgz",
-      "integrity": "sha512-RGwwWnwQvkVfavKVt22FGLw+xYSdzARwm0ru6DhTVA3umU5hZc28V3kO4stgYryrTlLpuvgI9GiijltAjNbcqA==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "ms": "^2.1.3"
-      },
-      "engines": {
-        "node": ">=6.0"
-      },
-      "peerDependenciesMeta": {
-        "supports-color": {
-          "optional": true
-        }
-      }
-    },
-    "node_modules/decompress-response": {
-      "version": "6.0.0",
-      "resolved": "https://registry.npmjs.org/decompress-response/-/decompress-response-6.0.0.tgz",
-      "integrity": "sha512-aW35yZM6Bb/4oJlZncMH2LCoZtJXTRxES17vE3hoRiowU2kWHaJKFkSBDnDR+cm9J+9QhXmREyIfv0pji9ejCQ==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "mimic-response": "^3.1.0"
-      },
-      "engines": {
-        "node": ">=10"
-      },
-      "funding": {
-        "url": "https://github.com/sponsors/sindresorhus"
-      }
-    },
-    "node_modules/decompress-response/node_modules/mimic-response": {
-      "version": "3.1.0",
-      "resolved": "https://registry.npmjs.org/mimic-response/-/mimic-response-3.1.0.tgz",
-      "integrity": "sha512-z0yWI+4FDrrweS8Zmt4Ej5HdJmky15+L2e6Wgn3+iK5fWzb6T3fhNFq2+MeTRb064c6Wr4N/wv0DzQTjNzHNGQ==",
-      "dev": true,
-      "license": "MIT",
-      "engines": {
-        "node": ">=10"
-      },
-      "funding": {
-        "url": "https://github.com/sponsors/sindresorhus"
-      }
-    },
-    "node_modules/defer-to-connect": {
-      "version": "2.0.1",
-      "resolved": "https://registry.npmjs.org/defer-to-connect/-/defer-to-connect-2.0.1.tgz",
-      "integrity": "sha512-4tvttepXG1VaYGrRibk5EwJd1t4udunSOVMdLSAL6mId1ix438oPwPZMALY41FCijukO1L0twNcGsdzS7dHgDg==",
-      "dev": true,
-      "license": "MIT",
-      "engines": {
-        "node": ">=10"
-      }
-    },
-    "node_modules/define-data-property": {
-      "version": "1.1.4",
-      "resolved": "https://registry.npmjs.org/define-data-property/-/define-data-property-1.1.4.tgz",
-      "integrity": "sha512-rBMvIzlpA8v6E+SJZoo++HAYqsLrkg7MSfIinMPFhmkorw7X+dOXVJQs+QT69zGkzMyfDnIMN2Wid1+NbL3T+A==",
-      "dev": true,
-      "license": "MIT",
-      "optional": true,
-      "dependencies": {
-        "es-define-property": "^1.0.0",
-        "es-errors": "^1.3.0",
-        "gopd": "^1.0.1"
-      },
-      "engines": {
-        "node": ">= 0.4"
-      },
-      "funding": {
-        "url": "https://github.com/sponsors/ljharb"
-      }
-    },
-    "node_modules/define-properties": {
-      "version": "1.2.1",
-      "resolved": "https://registry.npmjs.org/define-properties/-/define-properties-1.2.1.tgz",
-      "integrity": "sha512-8QmQKqEASLd5nx0U1B1okLElbUuuttJ/AnYmRXbbbGDWh6uS208EjD4Xqq/I9wK7u0v6O08XhTWnt5XtEbR6Dg==",
-      "dev": true,
-      "license": "MIT",
-      "optional": true,
-      "dependencies": {
-        "define-data-property": "^1.0.1",
-        "has-property-descriptors": "^1.0.0",
-        "object-keys": "^1.1.1"
-      },
-      "engines": {
-        "node": ">= 0.4"
-      },
-      "funding": {
-        "url": "https://github.com/sponsors/ljharb"
-      }
-    },
-    "node_modules/detect-node": {
-      "version": "2.1.0",
-      "resolved": "https://registry.npmjs.org/detect-node/-/detect-node-2.1.0.tgz",
-      "integrity": "sha512-T0NIuQpnTvFDATNuHN5roPwSBG83rFsuO+MXXH9/3N1eFbn4wcPjttvjMLEPWJ0RGUYgQE7cGgS3tNxbqCGM7g==",
-      "dev": true,
-      "license": "MIT",
-      "optional": true
-    },
-    "node_modules/electron": {
-      "version": "33.4.11",
-      "resolved": "https://registry.npmjs.org/electron/-/electron-33.4.11.tgz",
-      "integrity": "sha512-xmdAs5QWRkInC7TpXGNvzo/7exojubk+72jn1oJL7keNeIlw7xNglf8TGtJtkR4rWC5FJq0oXiIXPS9BcK2Irg==",
-      "dev": true,
-      "hasInstallScript": true,
-      "license": "MIT",
-      "dependencies": {
-        "@electron/get": "^2.0.0",
-        "@types/node": "^20.9.0",
-        "extract-zip": "^2.0.1"
-      },
-      "bin": {
-        "electron": "cli.js"
-      },
-      "engines": {
-        "node": ">= 12.20.55"
-      }
-    },
-    "node_modules/end-of-stream": {
-      "version": "1.4.5",
-      "resolved": "https://registry.npmjs.org/end-of-stream/-/end-of-stream-1.4.5.tgz",
-      "integrity": "sha512-ooEGc6HP26xXq/N+GCGOT0JKCLDGrq2bQUZrQ7gyrJiZANJ/8YDTxTpQBXGMn+WbIQXNVpyWymm7KYVICQnyOg==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "once": "^1.4.0"
-      }
-    },
-    "node_modules/env-paths": {
-      "version": "2.2.1",
-      "resolved": "https://registry.npmjs.org/env-paths/-/env-paths-2.2.1.tgz",
-      "integrity": "sha512-+h1lkLKhZMTYjog1VEpJNG7NZJWcuc2DDk/qsqSTRRCOXiLjeQ1d1/udrUGhqMxUgAlwKNZ0cf2uqan5GLuS2A==",
-      "dev": true,
-      "license": "MIT",
-      "engines": {
-        "node": ">=6"
-      }
-    },
-    "node_modules/es-define-property": {
-      "version": "1.0.1",
-      "resolved": "https://registry.npmjs.org/es-define-property/-/es-define-property-1.0.1.tgz",
-      "integrity": "sha512-e3nRfgfUZ4rNGL232gUgX06QNyyez04KdjFrF+LTRoOXmrOgFKDg4BCdsjW8EnT69eqdYGmRpJwiPVYNrCaW3g==",
-      "dev": true,
-      "license": "MIT",
-      "optional": true,
-      "engines": {
-        "node": ">= 0.4"
-      }
-    },
-    "node_modules/es-errors": {
-      "version": "1.3.0",
-      "resolved": "https://registry.npmjs.org/es-errors/-/es-errors-1.3.0.tgz",
-      "integrity": "sha512-Zf5H2Kxt2xjTvbJvP2ZWLEICxA6j+hAmMzIlypy4xcBg1vKVnx89Wy0GbS+kf5cwCVFFzdCFh2XSCFNULS6csw==",
-      "dev": true,
-      "license": "MIT",
-      "optional": true,
-      "engines": {
-        "node": ">= 0.4"
-      }
-    },
-    "node_modules/es6-error": {
-      "version": "4.1.1",
-      "resolved": "https://registry.npmjs.org/es6-error/-/es6-error-4.1.1.tgz",
-      "integrity": "sha512-Um/+FxMr9CISWh0bi5Zv0iOD+4cFh5qLeks1qhAopKVAJw3drgKbKySikp7wGhDL0HPeaja0P5ULZrxLkniUVg==",
-      "dev": true,
-      "license": "MIT",
-      "optional": true
-    },
-    "node_modules/escape-string-regexp": {
-      "version": "4.0.0",
-      "resolved": "https://registry.npmjs.org/escape-string-regexp/-/escape-string-regexp-4.0.0.tgz",
-      "integrity": "sha512-TtpcNJ3XAzx3Gq8sWRzJaVajRs0uVxA2YAkdb1jm2YkPz4G6egUFAyA3n5vtEIZefPk5Wa4UXbKuS5fKkJWdgA==",
-      "dev": true,
-      "license": "MIT",
-      "optional": true,
-      "engines": {
-        "node": ">=10"
-      },
-      "funding": {
-        "url": "https://github.com/sponsors/sindresorhus"
-      }
-    },
-    "node_modules/extract-zip": {
-      "version": "2.0.1",
-      "resolved": "https://registry.npmjs.org/extract-zip/-/extract-zip-2.0.1.tgz",
-      "integrity": "sha512-GDhU9ntwuKyGXdZBUgTIe+vXnWj0fppUEtMDL0+idd5Sta8TGpHssn/eusA9mrPr9qNDym6SxAYZjNvCn/9RBg==",
-      "dev": true,
-      "license": "BSD-2-Clause",
-      "dependencies": {
-        "debug": "^4.1.1",
-        "get-stream": "^5.1.0",
-        "yauzl": "^2.10.0"
-      },
-      "bin": {
-        "extract-zip": "cli.js"
-      },
-      "engines": {
-        "node": ">= 10.17.0"
-      },
-      "optionalDependencies": {
-        "@types/yauzl": "^2.9.1"
-      }
-    },
-    "node_modules/fd-slicer": {
-      "version": "1.1.0",
-      "resolved": "https://registry.npmjs.org/fd-slicer/-/fd-slicer-1.1.0.tgz",
-      "integrity": "sha512-cE1qsB/VwyQozZ+q1dGxR8LBYNZeofhEdUNGSMbQD3Gw2lAzX9Zb3uIU6Ebc/Fmyjo9AWWfnn0AUCHqtevs/8g==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "pend": "~1.2.0"
-      }
-    },
-    "node_modules/fs-extra": {
-      "version": "8.1.0",
-      "resolved": "https://registry.npmjs.org/fs-extra/-/fs-extra-8.1.0.tgz",
-      "integrity": "sha512-yhlQgA6mnOJUKOsRUFsgJdQCvkKhcz8tlZG5HBQfReYZy46OwLcY+Zia0mtdHsOo9y/hP+CxMN0TU9QxoOtG4g==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "graceful-fs": "^4.2.0",
-        "jsonfile": "^4.0.0",
-        "universalify": "^0.1.0"
-      },
-      "engines": {
-        "node": ">=6 <7 || >=8"
-      }
-    },
-    "node_modules/get-stream": {
-      "version": "5.2.0",
-      "resolved": "https://registry.npmjs.org/get-stream/-/get-stream-5.2.0.tgz",
-      "integrity": "sha512-nBF+F1rAZVCu/p7rjzgA+Yb4lfYXrpl7a6VmJrU8wF9I1CKvP/QwPNZHnOlwbTkY6dvtFIzFMSyQXbLoTQPRpA==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "pump": "^3.0.0"
-      },
-      "engines": {
-        "node": ">=8"
-      },
-      "funding": {
-        "url": "https://github.com/sponsors/sindresorhus"
-      }
-    },
-    "node_modules/global-agent": {
-      "version": "3.0.0",
-      "resolved": "https://registry.npmjs.org/global-agent/-/global-agent-3.0.0.tgz",
-      "integrity": "sha512-PT6XReJ+D07JvGoxQMkT6qji/jVNfX/h364XHZOWeRzy64sSFr+xJ5OX7LI3b4MPQzdL4H8Y8M0xzPpsVMwA8Q==",
-      "dev": true,
-      "license": "BSD-3-Clause",
-      "optional": true,
-      "dependencies": {
-        "boolean": "^3.0.1",
-        "es6-error": "^4.1.1",
-        "matcher": "^3.0.0",
-        "roarr": "^2.15.3",
-        "semver": "^7.3.2",
-        "serialize-error": "^7.0.1"
-      },
-      "engines": {
-        "node": ">=10.0"
-      }
-    },
-    "node_modules/global-agent/node_modules/semver": {
-      "version": "7.7.4",
-      "resolved": "https://registry.npmjs.org/semver/-/semver-7.7.4.tgz",
-      "integrity": "sha512-vFKC2IEtQnVhpT78h1Yp8wzwrf8CM+MzKMHGJZfBtzhZNycRFnXsHk6E5TxIkkMsgNS7mdX3AGB7x2QM2di4lA==",
-      "dev": true,
-      "license": "ISC",
-      "optional": true,
-      "bin": {
-        "semver": "bin/semver.js"
-      },
-      "engines": {
-        "node": ">=10"
-      }
-    },
-    "node_modules/globalthis": {
-      "version": "1.0.4",
-      "resolved": "https://registry.npmjs.org/globalthis/-/globalthis-1.0.4.tgz",
-      "integrity": "sha512-DpLKbNU4WylpxJykQujfCcwYWiV/Jhm50Goo0wrVILAv5jOr9d+H+UR3PhSCD2rCCEIg0uc+G+muBTwD54JhDQ==",
-      "dev": true,
-      "license": "MIT",
-      "optional": true,
-      "dependencies": {
-        "define-properties": "^1.2.1",
-        "gopd": "^1.0.1"
-      },
-      "engines": {
-        "node": ">= 0.4"
-      },
-      "funding": {
-        "url": "https://github.com/sponsors/ljharb"
-      }
-    },
-    "node_modules/gopd": {
-      "version": "1.2.0",
-      "resolved": "https://registry.npmjs.org/gopd/-/gopd-1.2.0.tgz",
-      "integrity": "sha512-ZUKRh6/kUFoAiTAtTYPZJ3hw9wNxx+BIBOijnlG9PnrJsCcSjs1wyyD6vJpaYtgnzDrKYRSqf3OO6Rfa93xsRg==",
-      "dev": true,
-      "license": "MIT",
-      "optional": true,
-      "engines": {
-        "node": ">= 0.4"
-      },
-      "funding": {
-        "url": "https://github.com/sponsors/ljharb"
-      }
-    },
-    "node_modules/got": {
-      "version": "11.8.6",
-      "resolved": "https://registry.npmjs.org/got/-/got-11.8.6.tgz",
-      "integrity": "sha512-6tfZ91bOr7bOXnK7PRDCGBLa1H4U080YHNaAQ2KsMGlLEzRbk44nsZF2E1IeRc3vtJHPVbKCYgdFbaGO2ljd8g==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "@sindresorhus/is": "^4.0.0",
-        "@szmarczak/http-timer": "^4.0.5",
-        "@types/cacheable-request": "^6.0.1",
-        "@types/responselike": "^1.0.0",
-        "cacheable-lookup": "^5.0.3",
-        "cacheable-request": "^7.0.2",
-        "decompress-response": "^6.0.0",
-        "http2-wrapper": "^1.0.0-beta.5.2",
-        "lowercase-keys": "^2.0.0",
-        "p-cancelable": "^2.0.0",
-        "responselike": "^2.0.0"
-      },
-      "engines": {
-        "node": ">=10.19.0"
-      },
-      "funding": {
-        "url": "https://github.com/sindresorhus/got?sponsor=1"
-      }
-    },
-    "node_modules/graceful-fs": {
-      "version": "4.2.11",
-      "resolved": "https://registry.npmjs.org/graceful-fs/-/graceful-fs-4.2.11.tgz",
-      "integrity": "sha512-RbJ5/jmFcNNCcDV5o9eTnBLJ/HszWV0P73bc+Ff4nS/rJj+YaS6IGyiOL0VoBYX+l1Wrl3k63h/KrH+nhJ0XvQ==",
-      "dev": true,
-      "license": "ISC"
-    },
-    "node_modules/has-property-descriptors": {
-      "version": "1.0.2",
-      "resolved": "https://registry.npmjs.org/has-property-descriptors/-/has-property-descriptors-1.0.2.tgz",
-      "integrity": "sha512-55JNKuIW+vq4Ke1BjOTjM2YctQIvCT7GFzHwmfZPGo5wnrgkid0YQtnAleFSqumZm4az3n2BS+erby5ipJdgrg==",
-      "dev": true,
-      "license": "MIT",
-      "optional": true,
-      "dependencies": {
-        "es-define-property": "^1.0.0"
-      },
-      "funding": {
-        "url": "https://github.com/sponsors/ljharb"
-      }
-    },
-    "node_modules/http-cache-semantics": {
-      "version": "4.2.0",
-      "resolved": "https://registry.npmjs.org/http-cache-semantics/-/http-cache-semantics-4.2.0.tgz",
-      "integrity": "sha512-dTxcvPXqPvXBQpq5dUr6mEMJX4oIEFv6bwom3FDwKRDsuIjjJGANqhBuoAn9c1RQJIdAKav33ED65E2ys+87QQ==",
-      "dev": true,
-      "license": "BSD-2-Clause"
-    },
-    "node_modules/http2-wrapper": {
-      "version": "1.0.3",
-      "resolved": "https://registry.npmjs.org/http2-wrapper/-/http2-wrapper-1.0.3.tgz",
-      "integrity": "sha512-V+23sDMr12Wnz7iTcDeJr3O6AIxlnvT/bmaAAAP/Xda35C90p9599p0F1eHR/N1KILWSoWVAiOMFjBBXaXSMxg==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "quick-lru": "^5.1.1",
-        "resolve-alpn": "^1.0.0"
-      },
-      "engines": {
-        "node": ">=10.19.0"
-      }
-    },
-    "node_modules/json-buffer": {
-      "version": "3.0.1",
-      "resolved": "https://registry.npmjs.org/json-buffer/-/json-buffer-3.0.1.tgz",
-      "integrity": "sha512-4bV5BfR2mqfQTJm+V5tPPdf+ZpuhiIvTuAB5g8kcrXOZpTT/QwwVRWBywX1ozr6lEuPdbHxwaJlm9G6mI2sfSQ==",
-      "dev": true,
-      "license": "MIT"
-    },
-    "node_modules/json-stringify-safe": {
-      "version": "5.0.1",
-      "resolved": "https://registry.npmjs.org/json-stringify-safe/-/json-stringify-safe-5.0.1.tgz",
-      "integrity": "sha512-ZClg6AaYvamvYEE82d3Iyd3vSSIjQ+odgjaTzRuO3s7toCdFKczob2i0zCh7JE8kWn17yvAWhUVxvqGwUalsRA==",
-      "dev": true,
-      "license": "ISC",
-      "optional": true
-    },
-    "node_modules/jsonfile": {
-      "version": "4.0.0",
-      "resolved": "https://registry.npmjs.org/jsonfile/-/jsonfile-4.0.0.tgz",
-      "integrity": "sha512-m6F1R3z8jjlf2imQHS2Qez5sjKWQzbuuhuJ/FKYFRZvPE3PuHcSMVZzfsLhGVOkfd20obL5SWEBew5ShlquNxg==",
-      "dev": true,
-      "license": "MIT",
-      "optionalDependencies": {
-        "graceful-fs": "^4.1.6"
-      }
-    },
-    "node_modules/keyv": {
-      "version": "4.5.4",
-      "resolved": "https://registry.npmjs.org/keyv/-/keyv-4.5.4.tgz",
-      "integrity": "sha512-oxVHkHR/EJf2CNXnWxRLW6mg7JyCCUcG0DtEGmL2ctUo1PNTin1PUil+r/+4r5MpVgC/fn1kjsx7mjSujKqIpw==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "json-buffer": "3.0.1"
-      }
-    },
-    "node_modules/lowercase-keys": {
-      "version": "2.0.0",
-      "resolved": "https://registry.npmjs.org/lowercase-keys/-/lowercase-keys-2.0.0.tgz",
-      "integrity": "sha512-tqNXrS78oMOE73NMxK4EMLQsQowWf8jKooH9g7xPavRT706R6bkQJ6DY2Te7QukaZsulxa30wQ7bk0pm4XiHmA==",
-      "dev": true,
-      "license": "MIT",
-      "engines": {
-        "node": ">=8"
-      }
-    },
-    "node_modules/matcher": {
-      "version": "3.0.0",
-      "resolved": "https://registry.npmjs.org/matcher/-/matcher-3.0.0.tgz",
-      "integrity": "sha512-OkeDaAZ/bQCxeFAozM55PKcKU0yJMPGifLwV4Qgjitu+5MoAfSQN4lsLJeXZ1b8w0x+/Emda6MZgXS1jvsapng==",
-      "dev": true,
-      "license": "MIT",
-      "optional": true,
-      "dependencies": {
-        "escape-string-regexp": "^4.0.0"
-      },
-      "engines": {
-        "node": ">=10"
-      }
-    },
-    "node_modules/mimic-response": {
-      "version": "1.0.1",
-      "resolved": "https://registry.npmjs.org/mimic-response/-/mimic-response-1.0.1.tgz",
-      "integrity": "sha512-j5EctnkH7amfV/q5Hgmoal1g2QHFJRraOtmx0JpIqkxhBhI/lJSl1nMpQ45hVarwNETOoWEimndZ4QK0RHxuxQ==",
-      "dev": true,
-      "license": "MIT",
-      "engines": {
-        "node": ">=4"
-      }
-    },
-    "node_modules/ms": {
-      "version": "2.1.3",
-      "resolved": "https://registry.npmjs.org/ms/-/ms-2.1.3.tgz",
-      "integrity": "sha512-6FlzubTLZG3J2a/NVCAleEhjzq5oxgHyaCU9yYXvcLsvoVaHJq/s5xXI6/XXP6tz7R9xAOtHnSO/tXtF3WRTlA==",
-      "dev": true,
-      "license": "MIT"
-    },
-    "node_modules/normalize-url": {
-      "version": "6.1.0",
-      "resolved": "https://registry.npmjs.org/normalize-url/-/normalize-url-6.1.0.tgz",
-      "integrity": "sha512-DlL+XwOy3NxAQ8xuC0okPgK46iuVNAK01YN7RueYBqqFeGsBjV9XmCAzAdgt+667bCl5kPh9EqKKDwnaPG1I7A==",
-      "dev": true,
-      "license": "MIT",
-      "engines": {
-        "node": ">=10"
-      },
-      "funding": {
-        "url": "https://github.com/sponsors/sindresorhus"
-      }
-    },
-    "node_modules/object-keys": {
-      "version": "1.1.1",
-      "resolved": "https://registry.npmjs.org/object-keys/-/object-keys-1.1.1.tgz",
-      "integrity": "sha512-NuAESUOUMrlIXOfHKzD6bpPu3tYt3xvjNdRIQ+FeT0lNb4K8WR70CaDxhuNguS2XG+GjkyMwOzsN5ZktImfhLA==",
-      "dev": true,
-      "license": "MIT",
-      "optional": true,
-      "engines": {
-        "node": ">= 0.4"
-      }
-    },
-    "node_modules/once": {
-      "version": "1.4.0",
-      "resolved": "https://registry.npmjs.org/once/-/once-1.4.0.tgz",
-      "integrity": "sha512-lNaJgI+2Q5URQBkccEKHTQOPaXdUxnZZElQTZY0MFUAuaEqe1E+Nyvgdz/aIyNi6Z9MzO5dv1H8n58/GELp3+w==",
-      "dev": true,
-      "license": "ISC",
-      "dependencies": {
-        "wrappy": "1"
-      }
-    },
-    "node_modules/p-cancelable": {
-      "version": "2.1.1",
-      "resolved": "https://registry.npmjs.org/p-cancelable/-/p-cancelable-2.1.1.tgz",
-      "integrity": "sha512-BZOr3nRQHOntUjTrH8+Lh54smKHoHyur8We1V8DSMVrl5A2malOOwuJRnKRDjSnkoeBh4at6BwEnb5I7Jl31wg==",
-      "dev": true,
-      "license": "MIT",
-      "engines": {
-        "node": ">=8"
-      }
-    },
-    "node_modules/pend": {
-      "version": "1.2.0",
-      "resolved": "https://registry.npmjs.org/pend/-/pend-1.2.0.tgz",
-      "integrity": "sha512-F3asv42UuXchdzt+xXqfW1OGlVBe+mxa2mqI0pg5yAHZPvFmY3Y6drSf/GQ1A86WgWEN9Kzh/WrgKa6iGcHXLg==",
-      "dev": true,
-      "license": "MIT"
-    },
-    "node_modules/progress": {
-      "version": "2.0.3",
-      "resolved": "https://registry.npmjs.org/progress/-/progress-2.0.3.tgz",
-      "integrity": "sha512-7PiHtLll5LdnKIMw100I+8xJXR5gW2QwWYkT6iJva0bXitZKa/XMrSbdmg3r2Xnaidz9Qumd0VPaMrZlF9V9sA==",
-      "dev": true,
-      "license": "MIT",
-      "engines": {
-        "node": ">=0.4.0"
-      }
-    },
-    "node_modules/pump": {
-      "version": "3.0.3",
-      "resolved": "https://registry.npmjs.org/pump/-/pump-3.0.3.tgz",
-      "integrity": "sha512-todwxLMY7/heScKmntwQG8CXVkWUOdYxIvY2s0VWAAMh/nd8SoYiRaKjlr7+iCs984f2P8zvrfWcDDYVb73NfA==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "end-of-stream": "^1.1.0",
-        "once": "^1.3.1"
-      }
-    },
-    "node_modules/quick-lru": {
-      "version": "5.1.1",
-      "resolved": "https://registry.npmjs.org/quick-lru/-/quick-lru-5.1.1.tgz",
-      "integrity": "sha512-WuyALRjWPDGtt/wzJiadO5AXY+8hZ80hVpe6MyivgraREW751X3SbhRvG3eLKOYN+8VEvqLcf3wdnt44Z4S4SA==",
-      "dev": true,
-      "license": "MIT",
-      "engines": {
-        "node": ">=10"
-      },
-      "funding": {
-        "url": "https://github.com/sponsors/sindresorhus"
-      }
-    },
-    "node_modules/resolve-alpn": {
-      "version": "1.2.1",
-      "resolved": "https://registry.npmjs.org/resolve-alpn/-/resolve-alpn-1.2.1.tgz",
-      "integrity": "sha512-0a1F4l73/ZFZOakJnQ3FvkJ2+gSTQWz/r2KE5OdDY0TxPm5h4GkqkWWfM47T7HsbnOtcJVEF4epCVy6u7Q3K+g==",
-      "dev": true,
-      "license": "MIT"
-    },
-    "node_modules/responselike": {
-      "version": "2.0.1",
-      "resolved": "https://registry.npmjs.org/responselike/-/responselike-2.0.1.tgz",
-      "integrity": "sha512-4gl03wn3hj1HP3yzgdI7d3lCkF95F21Pz4BPGvKHinyQzALR5CapwC8yIi0Rh58DEMQ/SguC03wFj2k0M/mHhw==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "lowercase-keys": "^2.0.0"
-      },
-      "funding": {
-        "url": "https://github.com/sponsors/sindresorhus"
-      }
-    },
-    "node_modules/roarr": {
-      "version": "2.15.4",
-      "resolved": "https://registry.npmjs.org/roarr/-/roarr-2.15.4.tgz",
-      "integrity": "sha512-CHhPh+UNHD2GTXNYhPWLnU8ONHdI+5DI+4EYIAOaiD63rHeYlZvyh8P+in5999TTSFgUYuKUAjzRI4mdh/p+2A==",
-      "dev": true,
-      "license": "BSD-3-Clause",
-      "optional": true,
-      "dependencies": {
-        "boolean": "^3.0.1",
-        "detect-node": "^2.0.4",
-        "globalthis": "^1.0.1",
-        "json-stringify-safe": "^5.0.1",
-        "semver-compare": "^1.0.0",
-        "sprintf-js": "^1.1.2"
-      },
-      "engines": {
-        "node": ">=8.0"
-      }
-    },
-    "node_modules/semver": {
-      "version": "6.3.1",
-      "resolved": "https://registry.npmjs.org/semver/-/semver-6.3.1.tgz",
-      "integrity": "sha512-BR7VvDCVHO+q2xBEWskxS6DJE1qRnb7DxzUrogb71CWoSficBxYsiAGd+Kl0mmq/MprG9yArRkyrQxTO6XjMzA==",
-      "dev": true,
-      "license": "ISC",
-      "bin": {
-        "semver": "bin/semver.js"
-      }
-    },
-    "node_modules/semver-compare": {
-      "version": "1.0.0",
-      "resolved": "https://registry.npmjs.org/semver-compare/-/semver-compare-1.0.0.tgz",
-      "integrity": "sha512-YM3/ITh2MJ5MtzaM429anh+x2jiLVjqILF4m4oyQB18W7Ggea7BfqdH/wGMK7dDiMghv/6WG7znWMwUDzJiXow==",
-      "dev": true,
-      "license": "MIT",
-      "optional": true
-    },
-    "node_modules/serialize-error": {
-      "version": "7.0.1",
-      "resolved": "https://registry.npmjs.org/serialize-error/-/serialize-error-7.0.1.tgz",
-      "integrity": "sha512-8I8TjW5KMOKsZQTvoxjuSIa7foAwPWGOts+6o7sgjz41/qMD9VQHEDxi6PBvK2l0MXUmqZyNpUK+T2tQaaElvw==",
-      "dev": true,
-      "license": "MIT",
-      "optional": true,
-      "dependencies": {
-        "type-fest": "^0.13.1"
-      },
-      "engines": {
-        "node": ">=10"
-      },
-      "funding": {
-        "url": "https://github.com/sponsors/sindresorhus"
-      }
-    },
-    "node_modules/sprintf-js": {
-      "version": "1.1.3",
-      "resolved": "https://registry.npmjs.org/sprintf-js/-/sprintf-js-1.1.3.tgz",
-      "integrity": "sha512-Oo+0REFV59/rz3gfJNKQiBlwfHaSESl1pcGyABQsnnIfWOFt6JNj5gCog2U6MLZ//IGYD+nA8nI+mTShREReaA==",
-      "dev": true,
-      "license": "BSD-3-Clause",
-      "optional": true
-    },
-    "node_modules/sumchecker": {
-      "version": "3.0.1",
-      "resolved": "https://registry.npmjs.org/sumchecker/-/sumchecker-3.0.1.tgz",
-      "integrity": "sha512-MvjXzkz/BOfyVDkG0oFOtBxHX2u3gKbMHIF/dXblZsgD3BWOFLmHovIpZY7BykJdAjcqRCBi1WYBNdEC9yI7vg==",
-      "dev": true,
-      "license": "Apache-2.0",
-      "dependencies": {
-        "debug": "^4.1.0"
-      },
-      "engines": {
-        "node": ">= 8.0"
-      }
-    },
-    "node_modules/type-fest": {
-      "version": "0.13.1",
-      "resolved": "https://registry.npmjs.org/type-fest/-/type-fest-0.13.1.tgz",
-      "integrity": "sha512-34R7HTnG0XIJcBSn5XhDd7nNFPRcXYRZrBB2O2jdKqYODldSzBAqzsWoZYYvduky73toYS/ESqxPvkDf/F0XMg==",
-      "dev": true,
-      "license": "(MIT OR CC0-1.0)",
-      "optional": true,
-      "engines": {
-        "node": ">=10"
-      },
-      "funding": {
-        "url": "https://github.com/sponsors/sindresorhus"
-      }
-    },
-    "node_modules/undici-types": {
-      "version": "6.21.0",
-      "resolved": "https://registry.npmjs.org/undici-types/-/undici-types-6.21.0.tgz",
-      "integrity": "sha512-iwDZqg0QAGrg9Rav5H4n0M64c3mkR59cJ6wQp+7C4nI0gsmExaedaYLNO44eT4AtBBwjbTiGPMlt2Md0T9H9JQ==",
-      "dev": true,
-      "license": "MIT"
-    },
-    "node_modules/universalify": {
-      "version": "0.1.2",
-      "resolved": "https://registry.npmjs.org/universalify/-/universalify-0.1.2.tgz",
-      "integrity": "sha512-rBJeI5CXAlmy1pV+617WB9J63U6XcazHHF2f2dbJix4XzpUF0RS3Zbj0FGIOCAva5P/d/GBOYaACQ1w+0azUkg==",
-      "dev": true,
-      "license": "MIT",
-      "engines": {
-        "node": ">= 4.0.0"
-      }
-    },
-    "node_modules/wrappy": {
-      "version": "1.0.2",
-      "resolved": "https://registry.npmjs.org/wrappy/-/wrappy-1.0.2.tgz",
-      "integrity": "sha512-l4Sp/DRseor9wL6EvV2+TuQn63dMkPjZ/sp9XkghTEbV9KlPS1xUsZ3u7/IQO4wxtcFB4bgpQPRcR3QCvezPcQ==",
-      "dev": true,
-      "license": "ISC"
-    },
-    "node_modules/yauzl": {
-      "version": "2.10.0",
-      "resolved": "https://registry.npmjs.org/yauzl/-/yauzl-2.10.0.tgz",
-      "integrity": "sha512-p4a9I6X6nu6IhoGmBqAcbJy1mlC4j27vEPZX9F4L4/vZT3Lyq1VkFHw/V/PUcB9Buo+DG3iHkT0x3Qya58zc3g==",
-      "dev": true,
-      "license": "MIT",
-      "dependencies": {
-        "buffer-crc32": "~0.2.3",
-        "fd-slicer": "~1.1.0"
-      }
-    }
-  }
-}
diff --git a/examples/electron-demo/package.json b/examples/electron-demo/package.json
deleted file mode 100644
index 1d9996bb9..000000000
--- a/examples/electron-demo/package.json
+++ /dev/null
@@ -1,16 +0,0 @@
-{
-  "name": "agent-relay-electron-demo",
-  "version": "1.0.0",
-  "description": "Minimal Electron app demonstrating @agent-relay/sdk",
-  "main": "main.mjs",
-  "type": "module",
-  "scripts": {
-    "start": "electron ."
-  },
-  "dependencies": {
-    "@agent-relay/sdk": "file:../../packages/sdk"
-  },
-  "devDependencies": {
-    "electron": "^33.0.0"
-  }
-}
diff --git a/examples/electron-demo/preload.cjs b/examples/electron-demo/preload.cjs
deleted file mode 100644
index 466896e0f..000000000
--- a/examples/electron-demo/preload.cjs
+++ /dev/null
@@ -1,13 +0,0 @@
-const { contextBridge, ipcRenderer } = require('electron');
-
-contextBridge.exposeInMainWorld('relay', {
-  spawn:       (name, cli, task)  => ipcRenderer.invoke('spawn', name, cli, task),
-  release:     (name)             => ipcRenderer.invoke('release', name),
-  sendMessage: (to, text)         => ipcRenderer.invoke('send-message', to, text),
-  listAgents:  ()                 => ipcRenderer.invoke('list-agents'),
-
-  onMessage:     (cb) => { ipcRenderer.on('message',      (_e, d) => cb(d)); },
-  onAgentUpdate: (cb) => { ipcRenderer.on('agent-update', (_e, d) => cb(d)); },
-  onBrokerStatus:(cb) => { ipcRenderer.on('broker-status',(_e, d) => cb(d)); },
-  onBrokerLog:   (cb) => { ipcRenderer.on('broker-log',   (_e, d) => cb(d)); },
-});
diff --git a/examples/programmatic-usage.ts b/examples/programmatic-usage.ts
deleted file mode 100644
index d75e78ab6..000000000
--- a/examples/programmatic-usage.ts
+++ /dev/null
@@ -1,126 +0,0 @@
-/**
- * Programmatic Usage Examples
- *
- * These examples show how to use agent-relay as a library
- * in your own Node.js/TypeScript applications.
- */
-
-import { Daemon } from 'agent-relay';
-import { RelayClient } from 'agent-relay';
-import { TmuxWrapper } from 'agent-relay';
-import { getProjectPaths, ensureProjectDir } from 'agent-relay';
-
-// ============================================
-// Example 1: Start a daemon programmatically
-// ============================================
-
-async function startDaemon() {
-  const paths = ensureProjectDir();
-
-  const daemon = new Daemon({
-    socketPath: paths.socketPath,
-    storagePath: paths.dbPath,
-  });
-
-  await daemon.start();
-  console.log(`Daemon running on ${paths.socketPath}`);
-
-  // Graceful shutdown
-  process.on('SIGINT', async () => {
-    await daemon.stop();
-    process.exit(0);
-  });
-}
-
-// ============================================
-// Example 2: Connect as a client
-// ============================================
-
-async function connectAsClient() {
-  const paths = getProjectPaths();
-
-  const client = new RelayClient({
-    name: 'MyAgent',
-    socketPath: paths.socketPath,
-  });
-
-  // Handle incoming messages
-  client.on('message', (msg) => {
-    console.log(`Received from ${msg.from}: ${msg.body}`);
-  });
-
-  await client.connect();
-
-  // Send a message
-  await client.send({
-    to: 'OtherAgent',
-    body: 'Hello from MyAgent!',
-  });
-
-  // Broadcast to all
-  await client.broadcast('Hello everyone!');
-}
-
-// ============================================
-// Example 3: Wrap a command with TmuxWrapper
-// ============================================
-
-async function wrapCommand() {
-  const paths = getProjectPaths();
-
-  const wrapper = new TmuxWrapper({
-    name: 'Claude',
-    command: 'claude',
-    args: [],
-    socketPath: paths.socketPath,
-    debug: false,
-    useInbox: true,
-    inboxDir: paths.dataDir,
-  });
-
-  process.on('SIGINT', () => {
-    wrapper.stop();
-    process.exit(0);
-  });
-
-  await wrapper.start();
-}
-
-// ============================================
-// Example 4: Custom project paths
-// ============================================
-
-function customPaths() {
-  // Get paths for current directory
-  const currentPaths = getProjectPaths();
-  console.log('Current project:', currentPaths);
-
-  // Get paths for a specific project root
-  const customPaths = getProjectPaths('/path/to/my/project');
-  console.log('Custom project:', customPaths);
-
-  // ProjectPaths structure:
-  // {
-  //   dataDir: string;      // Root directory for project data
-  //   teamDir: string;      // Team data directory
-  //   dbPath: string;       // SQLite database path
-  //   socketPath: string;   // Unix socket path
-  //   projectRoot: string;  // The project root used
-  //   projectId: string;    // Short hash identifier
-  // }
-}
-
-// ============================================
-// Example 5: Environment-based configuration
-// ============================================
-
-function envConfig() {
-  // Set environment variables before importing agent-relay
-  process.env.AGENT_RELAY_DATA_DIR = '/custom/data/dir';
-  process.env.AGENT_RELAY_DASHBOARD_PORT = '4000';
-  process.env.AGENT_RELAY_SQLITE_DRIVER = 'node';
-
-  // Now paths will use the custom data directory
-  const paths = getProjectPaths();
-  console.log(paths.dataDir); // /custom/data/dir/<project-hash>
-}
diff --git a/examples/slack-claude-bot.ts b/examples/slack-claude-bot.ts
deleted file mode 100644
index 54dd629d8..000000000
--- a/examples/slack-claude-bot.ts
+++ /dev/null
@@ -1,201 +0,0 @@
-/**
- * Slack Claude Bot via Agent Relay
- *
- * A simple Slack bot that uses Claude Code CLI (subscription-based, no API costs)
- * bridged through agent-relay for message coordination.
- *
- * Setup:
- *   1. Create a Slack app at https://api.slack.com/apps
- *   2. Enable Socket Mode and get an App Token (xapp-...)
- *   3. Add Bot Token Scopes: app_mentions:read, chat:write, channels:history
- *   4. Install to workspace and get Bot Token (xoxb-...)
- *   5. Ensure `claude` CLI is installed and logged in
- *   6. Start agent-relay daemon: `agent-relay up`
- *
- * Run:
- *   SLACK_BOT_TOKEN=xoxb-... SLACK_APP_TOKEN=xapp-... npx ts-node examples/slack-claude-bot.ts
- */
-
-import { App } from '@slack/bolt';
-import { spawn, ChildProcess } from 'child_process';
-import { RelayClient } from 'agent-relay';
-import { getProjectPaths } from 'agent-relay';
-
-// Configuration
-const SLACK_BOT_TOKEN = process.env.SLACK_BOT_TOKEN;
-const SLACK_APP_TOKEN = process.env.SLACK_APP_TOKEN;
-const BOT_NAME = process.env.BOT_NAME || 'SlackBot';
-
-if (!SLACK_BOT_TOKEN || !SLACK_APP_TOKEN) {
-  console.error('Missing SLACK_BOT_TOKEN or SLACK_APP_TOKEN');
-  process.exit(1);
-}
-
-// Initialize Slack app with Socket Mode
-const slack = new App({
-  token: SLACK_BOT_TOKEN,
-  appToken: SLACK_APP_TOKEN,
-  socketMode: true,
-});
-
-// Initialize agent-relay client
-const paths = getProjectPaths();
-const relay = new RelayClient({
-  name: BOT_NAME,
-  socketPath: paths.socketPath,
-});
-
-// Track Slack threads to relay threads
-const threadMap = new Map<string, string>();
-
-/**
- * Ask Claude using the CLI (uses subscription, not API)
- */
-async function askClaude(prompt: string): Promise<string> {
-  return new Promise((resolve, reject) => {
-    const claude = spawn('claude', ['--print', prompt], {
-      env: { ...process.env },
-      stdio: ['pipe', 'pipe', 'pipe'],
-    });
-
-    let output = '';
-    let error = '';
-
-    claude.stdout.on('data', (data) => {
-      output += data.toString();
-    });
-
-    claude.stderr.on('data', (data) => {
-      error += data.toString();
-    });
-
-    claude.on('close', (code) => {
-      if (code === 0) {
-        resolve(output.trim());
-      } else {
-        reject(new Error(error || `Claude exited with code ${code}`));
-      }
-    });
-
-    // Timeout after 2 minutes
-    const timeout = setTimeout(() => {
-      claude.kill();
-      reject(new Error('Claude response timeout'));
-    }, 120000);
-
-    claude.on('close', () => clearTimeout(timeout));
-  });
-}
-
-/**
- * Handle Slack @mentions
- */
-slack.event('app_mention', async ({ event, say }) => {
-  const threadTs = event.thread_ts || event.ts;
-  const text = event.text.replace(/<@[A-Z0-9]+>/g, '').trim();
-
-  console.log(`[Slack] @mention in ${event.channel}: ${text}`);
-
-  try {
-    // Notify relay that we received a Slack message
-    await relay.send({
-      to: '*',
-      body: `[Slack ${event.channel}] ${text}`,
-      data: { source: 'slack', channel: event.channel, thread: threadTs },
-    });
-
-    // Get response from Claude
-    const response = await askClaude(text);
-
-    // Post response to Slack
-    await say({
-      text: response,
-      thread_ts: threadTs,
-    });
-
-    // Notify relay of the response
-    await relay.send({
-      to: '*',
-      body: `[Slack Response] ${response.substring(0, 200)}...`,
-      data: { source: 'slack-response', channel: event.channel },
-    });
-  } catch (err) {
-    console.error('[Slack] Error:', err);
-    await say({
-      text: `Sorry, I encountered an error: ${err instanceof Error ? err.message : 'Unknown error'}`,
-      thread_ts: threadTs,
-    });
-  }
-});
-
-/**
- * Handle incoming relay messages - forward to Slack
- */
-relay.on('message', async (msg) => {
-  // Skip messages from ourselves or other Slack sources
-  if (msg.from === BOT_NAME || msg.data?.source?.startsWith('slack')) {
-    return;
-  }
-
-  console.log(`[Relay] Message from ${msg.from}: ${msg.body}`);
-
-  // Check if message specifies a Slack channel
-  const targetChannel = msg.data?.slackChannel || process.env.SLACK_DEFAULT_CHANNEL;
-
-  if (targetChannel) {
-    try {
-      await slack.client.chat.postMessage({
-        channel: targetChannel,
-        text: `*${msg.from}*: ${msg.body}`,
-        thread_ts: msg.data?.slackThread,
-      });
-    } catch (err) {
-      console.error('[Relay→Slack] Failed to post:', err);
-    }
-  }
-});
-
-/**
- * Handle relay connection events
- */
-relay.on('connected', () => {
-  console.log(`[Relay] Connected as ${BOT_NAME}`);
-});
-
-relay.on('disconnected', () => {
-  console.log('[Relay] Disconnected, will reconnect...');
-});
-
-/**
- * Startup
- */
-async function main() {
-  try {
-    // Connect to relay daemon
-    await relay.connect();
-    console.log(`[Relay] Connected to ${paths.socketPath}`);
-
-    // Start Slack app
-    await slack.start();
-    console.log('[Slack] Bot is running!');
-
-    // Announce presence
-    await relay.broadcast(`${BOT_NAME} online - bridging Slack ↔ Relay`);
-
-    console.log('\nReady! Mention the bot in Slack to interact.');
-    console.log('Messages from relay agents will be forwarded to Slack.\n');
-  } catch (err) {
-    console.error('Startup failed:', err);
-    process.exit(1);
-  }
-}
-
-// Graceful shutdown
-process.on('SIGINT', async () => {
-  console.log('\nShutting down...');
-  await relay.disconnect();
-  await slack.stop();
-  process.exit(0);
-});
-
-main();
diff --git a/examples/slack-claude-standalone.ts b/examples/slack-claude-standalone.ts
deleted file mode 100644
index f3d3674e8..000000000
--- a/examples/slack-claude-standalone.ts
+++ /dev/null
@@ -1,88 +0,0 @@
-#!/usr/bin/env npx ts-node
-/**
- * Standalone Slack Claude Bot
- *
- * Minimal Slack bot using Claude Code CLI - no agent-relay required.
- * Uses your Claude Code subscription (no API costs).
- *
- * Setup:
- *   1. Create Slack app: https://api.slack.com/apps
- *   2. Enable Socket Mode → get App Token (xapp-...)
- *   3. OAuth & Permissions → add scopes: app_mentions:read, chat:write
- *   4. Install to workspace → get Bot Token (xoxb-...)
- *   5. Event Subscriptions → Subscribe to: app_mention
- *   6. Ensure `claude` CLI is logged in: `claude auth login`
- *
- * Run:
- *   SLACK_BOT_TOKEN=xoxb-... SLACK_APP_TOKEN=xapp-... ./examples/slack-claude-standalone.ts
- */
-
-import { App } from '@slack/bolt';
-import { spawn } from 'child_process';
-
-const app = new App({
-  token: process.env.SLACK_BOT_TOKEN!,
-  appToken: process.env.SLACK_APP_TOKEN!,
-  socketMode: true,
-});
-
-// Conversation history per thread
-const threads = new Map<string, Array<{ role: string; text: string }>>();
-
-async function askClaude(prompt: string, history: Array<{ role: string; text: string }> = []): Promise<string> {
-  // Build context from history
-  let fullPrompt = prompt;
-  if (history.length > 0) {
-    const context = history.map((m) => `${m.role}: ${m.text}`).join('\n');
-    fullPrompt = `Previous conversation:\n${context}\n\nUser: ${prompt}`;
-  }
-
-  return new Promise((resolve, reject) => {
-    const claude = spawn('claude', ['--print', fullPrompt], {
-      stdio: ['pipe', 'pipe', 'pipe'],
-    });
-
-    let output = '';
-    claude.stdout.on('data', (d) => (output += d));
-    claude.stderr.on('data', (d) => console.error('[claude stderr]', d.toString()));
-
-    claude.on('close', (code) => {
-      code === 0 ? resolve(output.trim()) : reject(new Error(`Exit ${code}`));
-    });
-
-    setTimeout(() => {
-      claude.kill();
-      reject(new Error('Timeout'));
-    }, 120000);
-  });
-}
-
-app.event('app_mention', async ({ event, say }) => {
-  const threadId = event.thread_ts || event.ts;
-  const text = event.text.replace(/<@[A-Z0-9]+>/g, '').trim();
-
-  console.log(`[${new Date().toISOString()}] @mention: "${text}"`);
-
-  // Get thread history
-  const history = threads.get(threadId) || [];
-
-  try {
-    const response = await askClaude(text, history);
-
-    // Update history (keep last 10 exchanges)
-    history.push({ role: 'User', text });
-    history.push({ role: 'Claude', text: response });
-    threads.set(threadId, history.slice(-20));
-
-    await say({ text: response, thread_ts: threadId });
-  } catch (err) {
-    console.error('Error:', err);
-    await say({ text: `Error: ${err}`, thread_ts: threadId });
-  }
-});
-
-(async () => {
-  await app.start();
-  console.log('⚡ Slack Claude bot running (using Claude Code subscription)');
-  console.log('   Mention the bot in any channel to chat!');
-})();
diff --git a/examples/slack-codex-standalone.ts b/examples/slack-codex-standalone.ts
deleted file mode 100644
index b79cbc4f3..000000000
--- a/examples/slack-codex-standalone.ts
+++ /dev/null
@@ -1,81 +0,0 @@
-#!/usr/bin/env npx ts-node
-/**
- * Standalone Slack Codex Bot
- *
- * Minimal Slack bot using OpenAI Codex CLI.
- *
- * Setup:
- *   1. Install Codex CLI: npm install -g @openai/codex
- *   2. Login: codex auth login
- *   3. Create Slack app with Socket Mode (see README)
- *
- * Run:
- *   SLACK_BOT_TOKEN=xoxb-... SLACK_APP_TOKEN=xapp-... npx ts-node examples/slack-codex-standalone.ts
- */
-
-import { App } from '@slack/bolt';
-import { spawn } from 'child_process';
-
-const app = new App({
-  token: process.env.SLACK_BOT_TOKEN!,
-  appToken: process.env.SLACK_APP_TOKEN!,
-  socketMode: true,
-});
-
-const threads = new Map<string, Array<{ role: string; text: string }>>();
-
-async function askCodex(prompt: string, history: Array<{ role: string; text: string }> = []): Promise<string> {
-  let fullPrompt = prompt;
-  if (history.length > 0) {
-    const context = history.map((m) => `${m.role}: ${m.text}`).join('\n');
-    fullPrompt = `Previous conversation:\n${context}\n\nUser: ${prompt}`;
-  }
-
-  return new Promise((resolve, reject) => {
-    // Use codex CLI with --print flag for non-interactive output
-    const codex = spawn('codex', ['--print', fullPrompt], {
-      stdio: ['pipe', 'pipe', 'pipe'],
-    });
-
-    let output = '';
-    codex.stdout.on('data', (d) => (output += d));
-    codex.stderr.on('data', (d) => console.error('[codex stderr]', d.toString()));
-
-    codex.on('close', (code) => {
-      code === 0 ? resolve(output.trim()) : reject(new Error(`Exit ${code}`));
-    });
-
-    setTimeout(() => {
-      codex.kill();
-      reject(new Error('Timeout'));
-    }, 120000);
-  });
-}
-
-app.event('app_mention', async ({ event, say }) => {
-  const threadId = event.thread_ts || event.ts;
-  const text = event.text.replace(/<@[A-Z0-9]+>/g, '').trim();
-
-  console.log(`[${new Date().toISOString()}] @mention: "${text}"`);
-
-  const history = threads.get(threadId) || [];
-
-  try {
-    const response = await askCodex(text, history);
-
-    history.push({ role: 'User', text });
-    history.push({ role: 'Codex', text: response });
-    threads.set(threadId, history.slice(-20));
-
-    await say({ text: response, thread_ts: threadId });
-  } catch (err) {
-    console.error('Error:', err);
-    await say({ text: `Error: ${err}`, thread_ts: threadId });
-  }
-});
-
-(async () => {
-  await app.start();
-  console.log('⚡ Slack Codex bot running');
-  console.log('   Mention the bot in any channel to chat!');
-})();
diff --git a/examples/team-config.json b/examples/team-config.json
deleted file mode 100644
index d20fdcd12..000000000
--- a/examples/team-config.json
+++ /dev/null
@@ -1,30 +0,0 @@
-{
-  "name": "agent-relay-release",
-  "project": "/Users/khaliqgant/Projects/prpm/agent-relay",
-  "agents": [
-    {
-      "name": "DocWriter",
-      "cli": "claude",
-      "role": "Documentation Lead",
-      "tasks": ["Review README.md", "Create examples/", "Document CLI commands"]
-    },
-    {
-      "name": "CodePolish",
-      "cli": "claude",
-      "role": "Code Quality Lead",
-      "tasks": ["Audit test coverage", "Add tests for team commands", "Improve error handling"]
-    },
-    {
-      "name": "DevOps",
-      "cli": "codex",
-      "role": "CI/CD Lead",
-      "tasks": ["Review package.json", "Add LICENSE", "Setup npm publish"]
-    },
-    {
-      "name": "Dashboard",
-      "cli": "gemini",
-      "role": "Web Dashboard Lead",
-      "tasks": ["Build real-time dashboard", "Add message visualization", "Integrate with CLI"]
-    }
-  ]
-}
diff --git a/fly.toml b/fly.toml
deleted file mode 100644
index 3b20df8de..000000000
--- a/fly.toml
+++ /dev/null
@@ -1,58 +0,0 @@
-# Fly.io configuration for Agent Relay Cloud
-# Main app: API + Landing Page + Dashboard
-
-app = "agent-relay"
-primary_region = "sjc"
-
-[build]
-  dockerfile = "Dockerfile"
-
-[deploy]
-  strategy = "rolling"
-  # Wait for new machine to become healthy before stopping old one
-  # This prevents downtime from bad deploys
-  wait_timeout = "5m"
-
-[env]
-  NODE_ENV = "production"
-  PORT = "3000"
-
-[http_service]
-  internal_port = 3000
-  force_https = true
-  auto_stop_machines = false
-  auto_start_machines = true
-  min_machines_running = 1
-  processes = ["app"]
-
-  [http_service.concurrency]
-    type = "requests"
-    hard_limit = 250
-    soft_limit = 200
-
-[[vm]]
-  cpu_kind = "shared"
-  cpus = 1
-  memory_mb = 512
-
-[checks]
-  [checks.health]
-    grace_period = "10s"
-    interval = "30s"
-    method = "get"
-    path = "/health"
-    port = 3000
-    timeout = "5s"
-    type = "http"
-
-# Secrets to set via `fly secrets set`:
-# - SESSION_SECRET (openssl rand -hex 32)
-# - DATABASE_URL (Neon or Fly Postgres connection string)
-# - REDIS_URL (Upstash Redis URL)
-# - GITHUB_CLIENT_ID
-# - GITHUB_CLIENT_SECRET
-# - VAULT_ENCRYPTION_KEY (openssl rand -hex 32)
-# - STRIPE_SECRET_KEY
-# - STRIPE_PUBLISHABLE_KEY
-# - STRIPE_WEBHOOK_SECRET
-# - FLY_API_TOKEN (for workspace provisioning)
diff --git a/package.json b/package.json
index 0bae0cdec..e75ca4a84 100644
--- a/package.json
+++ b/package.json
@@ -10,7 +10,6 @@
     "install.sh",
     "scripts/postinstall.js",
     "scripts/build-cjs.mjs",
-    "relay-snippets",
     "LICENSE",
     "README.md"
   ],
diff --git a/relay-snippets/agent-policy-snippet.md b/relay-snippets/agent-policy-snippet.md
deleted file mode 100644
index 8d85700da..000000000
--- a/relay-snippets/agent-policy-snippet.md
+++ /dev/null
@@ -1,40 +0,0 @@
-# Agent Policy
-
-You are operating under organizational agent policies. These policies govern your interactions with other agents and tools.
-
-## Your Permissions
-
-Check the policy service for your specific permissions. If no explicit restrictions are defined, you have full permissions.
-
-## General Rules
-
-1. **Spawn Authorization**: Only spawn agents you are authorized to spawn. Check with Lead before spawning if unsure.
-
-2. **Message Routing**: Only message agents you are authorized to communicate with. Use proper channels.
-
-3. **Tool Usage**: Only use tools you are authorized to use. Read-only operations are generally safer.
-
-4. **Rate Limits**: Respect rate limits on messages. Don't spam other agents.
-
-## Restricted Agents
-
-Workers and non-lead agents typically have these restrictions:
-- Cannot spawn other agents without Lead approval
-- Can only message Lead, Coordinator, and their assigned peers
-- Limited to read-only tools unless explicitly granted write access
-
-## Lead Agents
-
-Lead agents typically have elevated permissions:
-- Can spawn Worker agents
-- Can message all agents
-- Can use all tools
-- Responsible for enforcing policy on spawned agents
-
-## Enforcement
-
-Policy violations are blocked at runtime. If your action is blocked, you'll receive a denial message explaining why. Do not attempt to circumvent policy restrictions.
-
-## Checking Your Policy
-
-To see your current policy, ask Lead or check the dashboard at `/api/policy/:workspaceId`.
diff --git a/relay-snippets/agent-relay-protocol.md b/relay-snippets/agent-relay-protocol.md
deleted file mode 100644
index 14599c5d7..000000000
--- a/relay-snippets/agent-relay-protocol.md
+++ /dev/null
@@ -1,64 +0,0 @@
-# Agent Relay Protocol (Internal)
-
-Advanced features for session continuity and trajectory tracking.
-
-## Session Continuity
-
-Use `mcp__relaycast__message_dm_send` with a continuity message to save state for session recovery:
-
-```
-mcp__relaycast__message_dm_send(to: "system", text: "KIND: continuity\nACTION: save\n\nCurrent task: Implementing user authentication\nCompleted: User model, JWT utils\nIn progress: Login endpoint")
-```
-
-### When to Save
-
-- Before long-running operations (builds, tests)
-- When switching task areas
-- Every 15-20 minutes of active work
-- Before ending session
-
-## Work Trajectories
-
-Record your work as a trajectory for future agents.
-
-### Starting Work
-
-```bash
-trail start "Implement user authentication"
-trail start "Fix login bug" --task "agent-relay-123"
-```
-
-### Recording Decisions
-
-```bash
-trail decision "Chose JWT over sessions" --reasoning "Stateless scaling"
-trail decision "Used existing auth middleware"
-```
-
-### Completing Work
-
-```bash
-trail complete --summary "Added JWT auth" --confidence 0.85
-```
-
-Confidence: 0.9+ (high), 0.7-0.9 (good), 0.5-0.7 (some uncertainty), <0.5 (needs review)
-
-### Abandoning Work
-
-```bash
-trail abandon --reason "Blocked by missing credentials"
-```
-
-## Cross-Project Messaging
-
-In bridge mode, use `project:agent` format with `mcp__relaycast__message_dm_send`:
-
-```
-mcp__relaycast__message_dm_send(to: "frontend:Designer", text: "Please update the login UI.")
-```
-
-Special targets:
-
-- `project:lead` - Lead agent of that project
-- `project:*` - Broadcast to project
-- `*:*` - Broadcast to all projects
diff --git a/relay-snippets/agent-relay-snippet.md b/relay-snippets/agent-relay-snippet.md
deleted file mode 100644
index 189254c28..000000000
--- a/relay-snippets/agent-relay-snippet.md
+++ /dev/null
@@ -1,117 +0,0 @@
-# Agent Relay
-
-Real-time agent-to-agent messaging via MCP tools.
-
-## MCP Tools
-
-All agent communication uses MCP tools provided by the Relaycast MCP server.
-Tool names use dot-notation: Claude uses `mcp__relaycast__<category>_<action>`, other CLIs use `relaycast.<category>.<action>`.
-
-| Tool                              | Description                           |
-| --------------------------------- | ------------------------------------- |
-| `message.dm.send(to, text)`       | Send a DM to an agent                 |
-| `message_post(channel, text)`     | Post a message to a channel           |
-| `message.inbox.check()`           | Check your inbox for new messages     |
-| `agent_list()`                    | List online agents                    |
-| `agent_add(name, cli, task)`      | Spawn a new worker agent              |
-| `agent_remove(name)`              | Release/stop a worker agent           |
-
-## Sending Messages
-
-### Direct Messages
-
-```
-mcp__relaycast__message_dm_send(to: "Bob", text: "Can you review my code changes?")
-```
-
-### Channel Messages
-
-```
-mcp__relaycast__message_post(channel: "general", text: "The API endpoints are ready")
-```
-
-## Spawning & Releasing Agents
-
-### Spawn a Worker
-
-```
-mcp__relaycast__agent_add(name: "WorkerName", cli: "claude", task: "Task description here")
-```
-
-### CLI Options
-
-| CLI Value   | Description                  |
-| ----------- | ---------------------------- |
-| `claude`    | Claude Code (Anthropic)      |
-| `codex`     | Codex CLI (OpenAI)           |
-| `gemini`    | Gemini CLI (Google)          |
-| `opencode`  | OpenCode CLI (multi-model)   |
-| `aider`     | Aider coding assistant       |
-| `goose`     | Goose AI assistant           |
-
-### Release a Worker
-
-```
-mcp__relaycast__agent_remove(name: "WorkerName")
-```
-
-## Receiving Messages
-
-Messages appear as:
-
-```
-Relay message from Alice [abc123]: Content here
-```
-
-Channel messages include `[#channel]`:
-
-```
-Relay message from Alice [abc123] [#general]: Hello!
-```
-
-Reply to the channel shown, not the sender.
-
-## When You Are Spawned
-
-If you were spawned by another agent:
-
-1. Your first message is your task from your spawner
-2. Use `mcp__relaycast__message_dm_send` to reply to your spawner
-3. Report status to your spawner (your lead), not broadcast
-
-```
-mcp__relaycast__message_dm_send(to: "Lead", text: "ACK: Starting on the task.")
-```
-
-## Protocol
-
-- **ACK** when you receive a task: `ACK: Brief description`
-- **DONE** when complete: `DONE: What was accomplished`
-- Send status to your **lead**, not broadcast
-
-## Agent Naming (Local vs Bridge)
-
-**Local communication** uses plain agent names. The `project:` prefix is **ONLY** for cross-project bridge mode.
-
-| Context                | Correct                                                     | Incorrect                                              |
-| ---------------------- | ----------------------------------------------------------- | ------------------------------------------------------ |
-| Local (same project)   | `mcp__relaycast__message_dm_send(to: "Lead", ...)`          | `mcp__relaycast__message_dm_send(to: "project:lead", ...)`     |
-| Bridge (cross-project) | `mcp__relaycast__message_dm_send(to: "frontend:Designer", ...)` | N/A                                                    |
-
-## Multi-Workspace
-
-When connected to multiple workspaces, messages include workspace context:
-
-```
-Relay message from Alice [my-team / abc123]: Hello!
-```
-
-- Messages are scoped to the originating workspace
-- Reply within the same workspace context shown in the message header
-
-## Checking Status
-
-```
-mcp__relaycast__agent_list()          # List online agents
-mcp__relaycast__message_inbox_check() # Check for unread messages
-```
diff --git a/specs/cli-native-plugins.md b/specs/cli-native-plugins.md
deleted file mode 100644
index 0c6757455..000000000
--- a/specs/cli-native-plugins.md
+++ /dev/null
@@ -1,1293 +0,0 @@
-# CLI Native Plugins — Implementation Spec
-
-**Covers**: OpenCode Plugin, Claude Code Plugin, Gemini CLI Extension (Codex deferred)
-**Status**: Draft
-**Date**: 2026-03-13
-**Author**: Design session (human + Claude)
-
----
-
-## 1. Vision
-
-Native plugins for **OpenCode, Claude Code, and Gemini CLI** that enable spawning and coordinating multiple instances communicating via Agent Relay. Unlike oh-my-openagent's one-way parent→child model, this unlocks full peer-to-peer messaging between independent sessions — across tools, across processes, across machines. All brokerless via Relaycast.
-
-### What This Unlocks
-
-1. **Native multi-instance orchestration** — Spawn additional CLI instances from within a session, each with its own task, all communicating via Relay.
-2. **Peer-to-peer messaging** — Any instance can DM any other, post to channels, and participate in threads. Not limited to parent→child.
-3. **Cross-tool interop** — OpenCode instances can communicate with Claude Code, Gemini CLI, Codex, Aider, or any other agent on the same Relay workspace.
-4. **Zero-config for end users** — One install command and you're on the relay.
-
-### Differentiation from oh-my-openagent
-
-| | oh-my-openagent | Relay Plugin |
-|---|---|---|
-| Architecture | In-process sub-sessions | Independent processes |
-| Communication | One-way parent→child | Full peer-to-peer (DMs, channels, threads) |
-| Cross-tool | OpenCode ↔ OpenCode only | OpenCode ↔ any CLI agent |
-| Discovery | Parent knows children | All agents visible via `relay_agents` |
-| Persistence | Session-scoped | Relaycast-backed (survives restarts) |
-
----
-
-## 2. Plugin Structure
-
-```
-opencode-relay-plugin/
-├── package.json
-├── tsconfig.json
-├── src/
-│   └── index.ts          # Plugin entry point — exports tools + hooks
-├── README.md
-└── tests/
-    ├── tools.test.ts
-    ├── spawn.test.ts
-    └── polling.test.ts
-```
-
-### package.json
-
-```json
-{
-  "name": "opencode-relay-plugin",
-  "version": "0.1.0",
-  "description": "Agent Relay plugin for OpenCode — multi-instance messaging and orchestration",
-  "main": "dist/index.js",
-  "type": "module",
-  "keywords": ["opencode-plugin", "agent-relay", "multi-agent"],
-  "peerDependencies": {
-    "opencode": ">=0.1.0"
-  }
-}
-```
-
-> **Architecture note**: Unlike the Claude Code and Gemini plugins which use an MCP server for transport, OpenCode's native `tool()` API means the plugin talks to Relaycast directly via HTTP. There is no WebSocket client — the plugin polls via HTTP on `relay_inbox` calls and via the `session.idle` hook. This keeps the dependency footprint minimal (no `ws` package) and aligns with the poll-based pattern used by the hooks on all three platforms.
-
----
-
-## 3. Plugin Entry Point
-
-OpenCode plugins export an async function that receives a plugin context with `tool()`, `hook()`, and other helpers.
-
-```typescript
-// src/index.ts
-import type { PluginContext } from 'opencode';
-
-export default async function relayPlugin(ctx: PluginContext) {
-  // ── State ──
-  const state = new RelayState();
-
-  // ── Tools ──
-  registerTools(ctx, state);
-
-  // ── Hooks ──
-  registerHooks(ctx, state);
-}
-```
-
-### RelayState
-
-```typescript
-class RelayState {
-  agentName: string | null = null;
-  workspace: string | null = null;
-  token: string | null = null;
-  spawned: Map<string, SpawnedAgent> = new Map();
-  connected = false;
-}
-
-interface Message {
-  id: string;
-  from: string;
-  text: string;
-  channel?: string;
-  thread?: string;
-  ts: string;
-}
-
-interface SpawnedAgent {
-  name: string;
-  process: ChildProcess;
-  task: string;
-  status: 'running' | 'done' | 'error';
-}
-```
-
----
-
-## 4. Tools
-
-Six tools exposed to the LLM via OpenCode's `tool()` API.
-
-### 4.1 `relay_connect`
-
-Connects to a Relay workspace. Must be called before other tools.
-
-```typescript
-ctx.tool({
-  name: 'relay_connect',
-  description: 'Connect to an Agent Relay workspace. Call this first.',
-  schema: {
-    type: 'object',
-    properties: {
-      workspace: { type: 'string', description: 'Workspace key (rk_live_...)' },
-      name: { type: 'string', description: 'Your agent name on the relay' },
-    },
-    required: ['workspace', 'name'],
-  },
-  async handler({ workspace, name }) {
-    state.workspace = workspace;
-    state.agentName = name;
-
-    // Register with Relaycast via HTTP
-    const res = await fetch(`https://www.relaycast.dev/api/v1/register`, {
-      method: 'POST',
-      headers: { 'Content-Type': 'application/json' },
-      body: JSON.stringify({ workspace, name, cli: 'opencode' }),
-    });
-    const data = await res.json();
-    state.token = data.token;
-
-    state.connected = true;
-    return { ok: true, name, workspace: workspace.slice(0, 12) + '...' };
-  },
-});
-```
-
-### 4.2 `relay_send`
-
-Send a DM to another agent.
-
-```typescript
-ctx.tool({
-  name: 'relay_send',
-  description: 'Send a direct message to another agent on the relay.',
-  schema: {
-    type: 'object',
-    properties: {
-      to: { type: 'string', description: 'Recipient agent name' },
-      text: { type: 'string', description: 'Message content' },
-    },
-    required: ['to', 'text'],
-  },
-  async handler({ to, text }) {
-    assertConnected(state);
-    await relaycastAPI(state, 'dm/send', { to, text });
-    return { sent: true, to };
-  },
-});
-```
-
-### 4.3 `relay_inbox`
-
-Check for new messages.
-
-```typescript
-ctx.tool({
-  name: 'relay_inbox',
-  description: 'Check your inbox for new messages from other agents.',
-  schema: { type: 'object', properties: {} },
-  async handler() {
-    assertConnected(state);
-    // Poll Relaycast HTTP API for new messages
-    const data = await relaycastAPI(state, 'inbox/check', {});
-    const messages = data.messages || [];
-    return { count: messages.length, messages };
-  },
-});
-```
-
-### 4.4 `relay_post`
-
-Post a message to a channel.
-
-```typescript
-ctx.tool({
-  name: 'relay_post',
-  description: 'Post a message to a relay channel.',
-  schema: {
-    type: 'object',
-    properties: {
-      channel: { type: 'string', description: 'Channel name' },
-      text: { type: 'string', description: 'Message content' },
-    },
-    required: ['channel', 'text'],
-  },
-  async handler({ channel, text }) {
-    assertConnected(state);
-    await relaycastAPI(state, 'message/post', { channel, text });
-    return { posted: true, channel };
-  },
-});
-```
-
-### 4.5 `relay_agents`
-
-List online agents.
-
-```typescript
-ctx.tool({
-  name: 'relay_agents',
-  description: 'List all agents currently on the relay.',
-  schema: { type: 'object', properties: {} },
-  async handler() {
-    assertConnected(state);
-    const data = await relaycastAPI(state, 'agent/list', {});
-    return { agents: data.agents };
-  },
-});
-```
-
-### 4.6 `relay_spawn`
-
-Spawn a new OpenCode instance as a worker agent.
-
-```typescript
-ctx.tool({
-  name: 'relay_spawn',
-  description:
-    'Spawn a new OpenCode instance as a worker agent on the relay. ' +
-    'The worker runs independently and can communicate with any agent.',
-  schema: {
-    type: 'object',
-    properties: {
-      name: { type: 'string', description: 'Worker agent name' },
-      task: { type: 'string', description: 'Task for the worker' },
-      dir: {
-        type: 'string',
-        description: 'Working directory (defaults to current)',
-      },
-      model: {
-        type: 'string',
-        description: 'Model override (e.g., "claude-sonnet-4-6")',
-      },
-    },
-    required: ['name', 'task'],
-  },
-  async handler({ name, task, dir, model }) {
-    assertConnected(state);
-
-    // Register worker with Relaycast
-    await relaycastAPI(state, 'agent/add', {
-      name,
-      cli: 'opencode',
-      task,
-    });
-
-    // Build the system prompt that bootstraps the worker onto the relay
-    // NOTE: workspace key is passed via env var, NOT in the prompt (security)
-    const systemPrompt = [
-      `You are ${name}, a worker agent on Agent Relay.`,
-      `Your task: ${task}`,
-      ``,
-      `IMPORTANT: At the start, call relay_connect with:`,
-      `  workspace: (read from RELAY_WORKSPACE env var)`,
-      `  name: "${name}"`,
-      ``,
-      `Then send a DM to "${state.agentName}" with "ACK: <your understanding of the task>".`,
-      `When done, send "DONE: <summary>" to "${state.agentName}".`,
-    ].join('\n');
-
-    // Spawn OpenCode process
-    const args = ['--prompt', systemPrompt];
-    if (dir) args.push('--dir', dir);
-    if (model) args.push('--model', model);
-
-    const proc = spawn('opencode', args, {
-      cwd: dir || process.cwd(),
-      stdio: 'pipe',
-      detached: true,
-      env: {
-        ...process.env,
-        RELAY_WORKSPACE: state.workspace!,
-        RELAY_AGENT_NAME: name,
-      },
-    });
-
-    state.spawned.set(name, {
-      name,
-      process: proc,
-      task,
-      status: 'running',
-    });
-
-    proc.on('exit', (code) => {
-      const agent = state.spawned.get(name);
-      if (agent) {
-        agent.status = code === 0 ? 'done' : 'error';
-      }
-    });
-
-    return {
-      spawned: true,
-      name,
-      pid: proc.pid,
-      hint: `Worker "${name}" is starting. It will ACK via DM when ready.`,
-    };
-  },
-});
-```
-
-### 4.7 `relay_dismiss`
-
-Stop and release a spawned worker.
-
-```typescript
-ctx.tool({
-  name: 'relay_dismiss',
-  description: 'Stop and release a spawned worker agent.',
-  schema: {
-    type: 'object',
-    properties: {
-      name: { type: 'string', description: 'Worker name to dismiss' },
-    },
-    required: ['name'],
-  },
-  async handler({ name }) {
-    assertConnected(state);
-
-    const agent = state.spawned.get(name);
-    if (agent && agent.status === 'running') {
-      agent.process.kill('SIGTERM');
-    }
-
-    await relaycastAPI(state, 'agent/remove', { name });
-    state.spawned.delete(name);
-    return { dismissed: true, name };
-  },
-});
-```
-
----
-
-## 5. Hooks
-
-> **⚠ OpenCode hook API is provisional.** The hook event names below (`session.idle`, `session.compacting`, `session.end`) are based on early documentation and may change. Verify against the latest OpenCode plugin API before implementation. If OpenCode does not support these hooks natively, fall back to a polling-based approach using a background interval timer within the plugin entry point.
-
-### 5.1 `session.idle` — Inbound Message Polling
-
-When the LLM is idle (waiting for user input), poll for inbound messages and surface them.
-
-```typescript
-ctx.hook('session.idle', async () => {
-  if (!state.connected) return;
-
-  const data = await relaycastAPI(state, 'inbox/check', {});
-  const messages = data.messages || [];
-  if (messages.length === 0) return;
-
-  // Format messages for injection into the session
-  const formatted = messages
-    .map((m) => {
-      const prefix = m.channel
-        ? `Relay message from ${m.from} [#${m.channel}]`
-        : `Relay message from ${m.from}`;
-      return `${prefix}: ${m.text}`;
-    })
-    .join('\n\n');
-
-  return {
-    inject: formatted,
-    continue: true, // Keep the session active to process the messages
-  };
-});
-```
-
-### 5.2 `session.compacting` — Context Preservation
-
-When OpenCode compacts context, preserve relay state so the agent doesn't lose its identity.
-
-```typescript
-ctx.hook('session.compacting', async () => {
-  if (!state.connected) return;
-
-  const workers = Array.from(state.spawned.entries())
-    .map(([name, a]) => `  - ${name}: ${a.status} — "${a.task}"`)
-    .join('\n');
-
-  return {
-    preserve: [
-      `## Relay State (preserve across compaction)`,
-      `- Connected as: ${state.agentName}`,
-      `- Workspace: ${state.workspace?.slice(0, 16)}...`,
-      `- Spawned workers:\n${workers || '  (none)'}`,
-    ].join('\n'),
-  };
-});
-```
-
-### 5.3 `session.end` — Cleanup
-
-Gracefully disconnect and clean up spawned workers on session end.
-
-```typescript
-ctx.hook('session.end', async () => {
-  if (!state.connected) return;
-
-  // Terminate spawned workers
-  for (const [name, agent] of state.spawned) {
-    if (agent.status === 'running') {
-      agent.process.kill('SIGTERM');
-    }
-  }
-
-  state.connected = false;
-});
-```
-
----
-
-## 6. Helper Functions
-
-```typescript
-function assertConnected(state: RelayState) {
-  if (!state.connected) {
-    throw new Error(
-      'Not connected to Relay. Call relay_connect first.'
-    );
-  }
-}
-
-async function relaycastAPI(
-  state: RelayState,
-  endpoint: string,
-  body: Record<string, unknown>
-): Promise<Record<string, unknown>> {
-  const res = await fetch(
-    `https://www.relaycast.dev/api/v1/${endpoint}`,
-    {
-      method: 'POST',
-      headers: {
-        'Content-Type': 'application/json',
-        Authorization: `Bearer ${state.token}`,
-      },
-      body: JSON.stringify(body),
-    }
-  );
-  if (!res.ok) {
-    throw new Error(`Relay API error: ${res.status} ${await res.text()}`);
-  }
-  return res.json();
-}
-```
-
----
-
-## 7. Usage Examples
-
-### Basic: Two OpenCode Instances Collaborating
-
-**User session (Lead):**
-```
-> Connect to relay workspace rk_live_abc123 as "Lead"
-> Spawn a worker called "Researcher" to investigate the auth module
-> Wait for their findings, then spawn "Implementer" to build the fix
-```
-
-The Lead calls `relay_connect`, then `relay_spawn` for "Researcher". Researcher boots, connects, ACKs, does work, sends `DONE: Found the issue in auth/middleware.ts — token expiry not checked`. Lead reads this via `relay_inbox`, then spawns "Implementer" with context from Researcher.
-
-### Cross-Tool: OpenCode + Claude Code
-
-```
-> Connect to relay as "Analyzer"
-> Send a DM to "CodeReviewer" (a Claude Code instance) asking them to review my changes
-```
-
-The OpenCode Analyzer and a Claude Code instance (connected via Relaycast MCP) communicate seamlessly — same workspace, same DM/channel primitives.
-
-### Fan-Out Pattern
-
-```
-> Spawn 3 workers: "TestWriter", "DocWriter", "Linter"
-> Give each their task, wait for all to ACK, then monitor progress
-```
-
----
-
-## 8. Implementation Phases
-
-### Phase 1: OpenCode Core Tools
-
-**Scope**: `relay_connect`, `relay_send`, `relay_inbox`, `relay_agents`, `relay_post`
-
-**Tests**:
-- Unit: Each tool handler with mocked HTTP
-- Integration: Connect to test workspace, send/receive a round-trip message
-
-**Exit criteria**: Can connect, send DMs, check inbox, list agents, post to channels.
-
-### Phase 2: OpenCode Spawn & Dismiss
-
-**Scope**: `relay_spawn`, `relay_dismiss`
-
-**Tests**:
-- Unit: Process spawning with mocked `spawn()`
-- Integration: Spawn a real OpenCode instance, verify it connects and ACKs
-- Lifecycle: Spawn → ACK → DONE → dismiss flow
-
-**Exit criteria**: Can spawn workers that self-bootstrap onto the relay and communicate back.
-
-### Phase 3: OpenCode Hooks & Polish
-
-**Scope**: Idle polling, context preservation, cleanup (hook names provisional — see Section 5 note)
-
-**Tests**:
-- Unit: Idle hook surfaces messages correctly
-- Unit: Compacting hook preserves relay state
-- Integration: Full lifecycle — connect, spawn workers, receive messages during idle, compact without losing state, clean up on exit
-
-**Exit criteria**: Messages surface automatically during idle. Context survives compaction. Clean shutdown.
-
-### Phase 4: OpenCode Distribution
-
-**Scope**: npm publish, OpenCode plugin registry submission, documentation
-
-**Exit criteria**: `opencode plugin add agent-relay` works. README covers all tools and examples.
-
----
-
-## 9. Testing Strategy
-
-### Test Fixtures
-
-```typescript
-class MockRelayServer {
-  messages: Message[] = [];
-  agents: string[] = [];
-
-  /** Simulate an inbound message */
-  injectMessage(from: string, text: string) {
-    this.messages.push({ id: crypto.randomUUID(), from, text, ts: new Date().toISOString() });
-  }
-
-  /** Mock HTTP handler */
-  async handle(endpoint: string, body: Record<string, unknown>) {
-    switch (endpoint) {
-      case 'dm/send':
-        this.messages.push({ id: crypto.randomUUID(), from: 'self', text: body.text as string, ts: new Date().toISOString() });
-        return { ok: true };
-      case 'inbox/check':
-        const msgs = [...this.messages];
-        this.messages = [];
-        return { messages: msgs };
-      case 'agent/list':
-        return { agents: this.agents };
-      case 'register':
-        return { token: 'test-token-123' };
-      default:
-        return { ok: true };
-    }
-  }
-}
-```
-
-### Test Matrix
-
-| Test | Phase | Type | Description |
-|------|-------|------|-------------|
-| connect-success | 1 | unit | Registers via HTTP and sets token |
-| connect-bad-workspace | 1 | unit | Rejects invalid workspace key |
-| send-dm | 1 | unit | Sends DM via API |
-| send-not-connected | 1 | unit | Throws if not connected |
-| inbox-poll | 1 | unit | Polls HTTP API and returns messages |
-| agents-list | 1 | unit | Returns agent list |
-| post-channel | 1 | unit | Posts to channel |
-| spawn-worker | 2 | unit | Spawns process with correct args |
-| spawn-env-vars | 2 | unit | Workspace key passed via env var, not in prompt |
-| spawn-exit-tracking | 2 | unit | Tracks worker exit status |
-| dismiss-running | 2 | unit | Kills running process |
-| dismiss-already-done | 2 | unit | Handles already-exited worker |
-| idle-no-messages | 3 | unit | Returns nothing when inbox empty |
-| idle-surfaces-messages | 3 | unit | Formats and injects messages |
-| compacting-preserves | 3 | unit | State string includes all workers |
-| end-cleanup | 3 | unit | Terminates workers and disconnects |
-| round-trip | all | integration | Full send → receive cycle |
-| spawn-ack-flow | all | integration | Spawn → ACK → DONE lifecycle |
-
----
-
-## 10. Performance Constraints
-
-| Metric | Target |
-|--------|--------|
-| `relay_connect` latency | < 1s (HTTP register) |
-| `relay_send` latency | < 500ms |
-| `relay_inbox` latency | < 500ms (HTTP poll) |
-| `relay_spawn` to worker ACK | < 15s (includes CLI boot) |
-| Hook inbox poll interval | 3s min between checks (configurable) |
-| BeforeModel rate limit | 5s min between inbox checks |
-| HTTP retry backoff | 1s, 2s, 4s (max 3 attempts) |
-
----
-
-## 11. Error Handling
-
-| Error | Behavior |
-|-------|----------|
-| HTTP request failure | Retry with exponential backoff, max 3 attempts. Return error on final failure. |
-| Workspace key invalid | Throw with clear message: "Invalid workspace key. Get one at relaycast.dev" |
-| Agent name taken | Throw: "Agent name already registered. Choose a different name." |
-| Concurrent name registration | First registration wins. Second attempt gets "name taken" error. Callers should add a suffix and retry (e.g., `Worker-1` → `Worker-1a`). |
-| Spawn failure | Return error with stderr. Don't crash parent session. |
-| Worker crash | Update status to 'error'. Don't auto-restart. Notify via next inbox check. |
-| API rate limit | Retry with backoff, max 3 attempts. |
-
----
-
-## 12. Relaycast API Contract
-
-The OpenCode plugin talks to Relaycast via HTTP. The Claude Code and Gemini plugins use the Relaycast MCP server which handles transport internally. No new backend endpoints required.
-
-> **Note**: The endpoint paths below mirror the MCP tool names. Verify these against the actual Relaycast API — the MCP server may use different internal endpoints. The `/api/v1/register` endpoint in particular needs confirmation as it may be handled differently (e.g., via the MCP server's own registration flow).
-
-| Endpoint | Method | Body | Response |
-|----------|--------|------|----------|
-| `/api/v1/register` | POST | `{ workspace, name, cli }` | `{ token }` |
-| `/api/v1/dm/send` | POST | `{ to, text }` | `{ ok }` |
-| `/api/v1/inbox/check` | POST | `{}` | `{ messages }` |
-| `/api/v1/message/post` | POST | `{ channel, text }` | `{ ok }` |
-| `/api/v1/agent/list` | POST | `{}` | `{ agents }` |
-| `/api/v1/agent/add` | POST | `{ name, cli, task }` | `{ ok }` |
-| `/api/v1/agent/remove` | POST | `{ name }` | `{ ok }` |
-
----
-
-## 13. Claude Code Companion Plugin
-
-### Architecture: Brokerless by Default
-
-Claude Code + Relaycast MCP achieves **full agent-to-agent communication without a broker**. The Relaycast MCP server connects directly to Relaycast over WebSocket. Any Claude Code instance with the MCP server configured can:
-
-- Send DMs (`mcp__relaycast__dm_send`)
-- Post to channels (`mcp__relaycast__message_post`)
-- Check inbox (`mcp__relaycast__inbox_check`)
-- Spawn independent agents (`mcp__relaycast__agent_add`)
-- List/dismiss agents (`mcp__relaycast__agent_list`, `mcp__relaycast__agent_remove`)
-
-No `agent-relay-broker` binary needed. Each spawned Claude Code instance is fully independent with its own context, tools, and peer-to-peer messaging.
-
-### Hooks: Reliable Message Injection
-
-Claude Code's hook system provides multiple hook events with robust injection mechanisms. The relevant ones for Relay are listed below.
-
-#### Existing Implementation (already in this repo)
-
-The relay codebase already implements inbox injection via hooks:
-
-- **`packages/hooks/src/inbox-check/hook.ts`** — `Stop` hook that checks inbox when Claude tries to stop. Returns `{"decision": "block", "reason": "You have N unread messages..."}` to force Claude to continue and process them.
-- **`src/hooks/check-inbox.sh`** — `PostToolUse` hook that checks inbox after every tool call for more frequent polling during active work.
-
-#### Hook-Based Injection Strategy
-
-| Hook Event | When It Fires | Injection Mechanism | Use Case |
-|------------|---------------|---------------------|----------|
-| `Stop` | Claude tries to stop | `decision: "block"` + `reason` feeds messages as next instruction | Catch messages before going idle |
-| `PostToolUse` | After every tool call | `additionalContext` appended to context | Frequent polling during active work |
-| `SubagentStart` | Worker is spawned | `additionalContext` injects relay bootstrap | Auto-configure spawned workers |
-| `PreCompact` | Before context compaction | Preserve relay state string | Maintain identity across compaction |
-| `SessionEnd` | Session terminates | Cleanup spawned workers | Graceful shutdown |
-
-#### Stop Hook with Loop Guard
-
-The `Stop` hook input includes `stop_hook_active: boolean` — `true` when Claude is already continuing from a previous stop-block. This prevents infinite loops.
-
-> **⚠ Gap in existing implementation:** The current hook at `packages/hooks/src/inbox-check/hook.ts` does NOT check `stop_hook_active`. This means infinite loops are possible if messages keep arriving. The plugin must fix this.
-
-```typescript
-// Stop hook — with loop guard
-const input: HookInput = readStdin();
-
-// Guard: if we already blocked once this cycle, let Claude stop
-// to avoid infinite block→retry loops
-if (input.stop_hook_active) {
-  output({ decision: 'approve' });
-  return;
-}
-
-const messages = await checkInbox();
-if (messages.length > 0) {
-  output({
-    decision: 'block',
-    reason: `You have ${messages.length} unread relay message(s):\n${formatMessages(messages)}\nPlease read and respond.`
-  });
-} else {
-  output({ decision: 'approve' });
-}
-```
-
-#### PostToolUse Hook for Real-Time Polling
-
-For higher-frequency message checking during active work:
-
-```bash
-#!/bin/bash
-# PostToolUse hook — check inbox after every tool call
-MESSAGES=$(curl -s -H "Authorization: Bearer $RELAY_TOKEN" \
-  https://www.relaycast.dev/api/v1/inbox/check)
-
-COUNT=$(echo "$MESSAGES" | jq '.messages | length')
-if [ "$COUNT" -gt 0 ]; then
-  FORMATTED=$(echo "$MESSAGES" | jq -r '.messages[] | "Relay message from \(.from): \(.text)"')
-  echo "$FORMATTED"  # Plain text stdout → additionalContext
-fi
-```
-
-### Plugin Structure
-
-```
-claude-relay-plugin/
-├── plugin.json              # MCP server config + hook registration
-├── hooks/
-│   ├── stop-inbox.ts        # Stop hook: block if unread messages
-│   ├── post-tool-inbox.sh   # PostToolUse hook: frequent polling
-│   ├── subagent-bootstrap.sh # SubagentStart hook: inject relay config
-│   └── pre-compact.sh       # PreCompact hook: preserve relay state
-├── skills/
-│   ├── relay-team.md        # /relay-team — spawn a coordinated team
-│   ├── relay-fanout.md      # /relay-fanout — fan-out pattern
-│   └── relay-pipeline.md    # /relay-pipeline — sequential pipeline
-├── agents/
-│   └── relay-worker/        # Pre-built worker agent definition
-│       ├── agent.md         # Worker persona with ACK/DONE protocol
-│       └── config.json      # MCP + hook config for workers
-└── README.md
-```
-
-### plugin.json
-
-```json
-{
-  "name": "agent-relay",
-  "description": "Multi-agent communication via Agent Relay",
-  "mcp": {
-    "relaycast": {
-      "command": "npx",
-      "args": ["@relaycast/mcp"],
-      "env": {
-        "RELAY_WORKSPACE": "${RELAY_WORKSPACE}"
-      }
-    }
-  },
-  "hooks": {
-    "Stop": [{
-      "hooks": [{
-        "type": "command",
-        "command": "node ./hooks/stop-inbox.js"
-      }]
-    }],
-    "PostToolUse": [{
-      "hooks": [{
-        "type": "command",
-        "command": "./hooks/post-tool-inbox.sh"
-      }]
-    }],
-    "SubagentStart": [{
-      "hooks": [{
-        "type": "command",
-        "command": "./hooks/subagent-bootstrap.sh"
-      }]
-    }],
-    "PreCompact": [{
-      "hooks": [{
-        "type": "command",
-        "command": "./hooks/pre-compact.sh"
-      }]
-    }]
-  }
-}
-```
-
-### What the Plugin Provides
-
-| Capability | Without Plugin | With Plugin |
-|---|---|---|
-| **Tools (send/inbox/spawn)** | Manual MCP server config in settings.json | Auto-configured via plugin.json |
-| **Inbound message injection** | No automatic injection | Stop + PostToolUse hooks surface messages automatically |
-| **Worker spawning** | Works but workers need manual relay instructions | SubagentStart hook auto-bootstraps workers onto relay |
-| **Context preservation** | Relay state lost on compaction | PreCompact hook preserves agent identity + worker list |
-| **Common patterns** | Write orchestration prompts from scratch | `/relay-team`, `/relay-fanout`, `/relay-pipeline` skills |
-| **Worker personas** | Ad-hoc task prompts | Pre-built agent definition with ACK/DONE protocol |
-
-### Capability Comparison: OpenCode vs Claude Code
-
-| Capability | OpenCode Plugin | Claude Code Plugin |
-|---|---|---|
-| Tool definition | Native `tool()` API | MCP server (Relaycast) |
-| Spawn processes | Native `spawn()` | MCP `agent_add` tool |
-| Stop-time injection | `session.idle` hook (provisional) | `Stop` hook with `decision: "block"` |
-| Mid-work injection | Depends on OpenCode hook API | `PostToolUse` hook with `additionalContext` |
-| Worker bootstrap | System prompt + env vars | `SubagentStart` hook injects config |
-| Context preservation | `session.compacting` (provisional) | `PreCompact` hook |
-| Cleanup on exit | `session.end` (provisional) | `SessionEnd` hook |
-| Loop guard | TBD (depends on hook API) | `stop_hook_active` boolean |
-
-Both plugins target the same end-user experience: connect, send, receive, spawn, dismiss — all without a broker. Claude Code's hook system is more mature and documented. OpenCode's hook API needs verification; if mid-work injection hooks are unavailable, agents will only see messages when they explicitly call `relay_inbox` or at idle time.
-
-### Implementation Phases
-
-**Phase 5: Claude Code Core Plugin**
-- Package MCP config + Stop hook (with `stop_hook_active` guard) + PostToolUse hook
-- One-command install
-- Test: install → connect → send/receive round-trip
-
-**Phase 6: Claude Code Skills & Agents**
-- `/relay-team`, `/relay-fanout`, `/relay-pipeline` skills
-- Worker agent definition with ACK/DONE protocol
-- SubagentStart bootstrap hook
-- PreCompact state preservation + SessionEnd cleanup
-
-### Recommendation
-
-Build the Claude Code plugin in **parallel with Phase 3-4** of the OpenCode plugin, not after. The core hooks (Stop + PostToolUse inbox injection) already exist in this repo — the plugin is largely packaging and distribution. The skills and agent definitions are net-new but straightforward.
-
----
-
-## 14. Gemini CLI Extension
-
-### Architecture: Same Pattern, Richer Hooks
-
-> **Source**: Hook capabilities documented at [geminicli.com/docs/hooks/reference](https://geminicli.com/docs/hooks/reference/). Extension structure from [geminicli.com/docs/extensions/writing-extensions](https://geminicli.com/docs/extensions/writing-extensions). Gemini CLI's extension system is newer than Claude Code's — verify capabilities against latest docs before implementation.
-
-Gemini CLI extensions bundle MCP servers, hooks, commands, sub-agents, skills, and themes into a single installable package via `gemini extensions install <github-url>`. Like Claude Code, tools come via MCP servers — so Relaycast provides full send/inbox/spawn/dismiss without a broker.
-
-What makes Gemini interesting is the **hook system offers more injection points than Claude Code**:
-
-| Hook Event | When | Injection Mechanism | Relay Use Case |
-|------------|------|---------------------|----------------|
-| `AfterTool` | After every tool call | `additionalContext` appended to results | Frequent inbox polling (like Claude's PostToolUse) |
-| `AfterAgent` | After agent responds | `reason` forces retry with new instructions | Block stop + inject unread messages |
-| `BeforeAgent` | Before planning begins | `additionalContext` extends prompt | Inject relay context at turn start |
-| `BeforeModel` | Before LLM request | Modify `llm_request.messages` directly | Prepend inbox messages to next model call |
-| `BeforeToolSelection` | Before tool routing | Filter/whitelist tools | Context-aware tool availability |
-| `SessionStart` | Session begins | `additionalContext` loads initial context | Auto-connect to relay workspace |
-| `SessionEnd` | Session ends | Cleanup | Disconnect + terminate workers |
-| `PreCompress` | Before context compression | — | Preserve relay state |
-| `Notification` | System alerts | Logging | Log relay events |
-
-#### Key Advantages Over Claude Code
-
-1. **`BeforeModel` hook** — Can directly modify the LLM request messages array. This means we can prepend inbox messages to the very next model call, not just append context after a tool. This is the most reliable injection point possible.
-
-2. **`AfterAgent` with retry** — When the agent finishes its response, this hook can force a full retry with a new `reason`. This is equivalent to Claude's `Stop` hook but more explicit — the agent gets a clean retry with the inbox messages as its prompt.
-
-3. **`BeforeToolSelection` filtering** — Can dynamically whitelist/blacklist tools based on relay state. For example, hide `relay_spawn` if the agent has already hit a worker limit.
-
-4. **Matcher patterns** — Hooks can target specific MCP tools via regex: `"matcher": "mcp_relaycast_.*"` to only fire on relay tool calls.
-
-5. **Sub-agents as `.md` files** — Worker definitions live in `agents/` as markdown files, loaded natively by Gemini CLI. No MCP wrapper needed for agent personas.
-
-6. **Custom commands via TOML** — `/relay:status`, `/relay:team` as lightweight command shortcuts without needing full skill definitions.
-
-### Extension Structure
-
-```
-gemini-relay-extension/
-├── gemini-extension.json     # Manifest: MCP + hooks + settings
-├── relay-server.js           # MCP server (Relaycast proxy or direct)
-├── hooks/
-│   ├── hooks.json            # Hook registration
-│   ├── after-tool-inbox.sh   # AfterTool: poll inbox after each tool
-│   ├── after-agent-inbox.sh  # AfterAgent: block stop if unread messages
-│   ├── before-model-inject.sh # BeforeModel: prepend inbox to next call
-│   ├── session-start.sh      # SessionStart: auto-connect
-│   └── session-end.sh        # SessionEnd: cleanup
-├── commands/
-│   ├── status/
-│   │   └── status.toml       # /relay:status — show connection + workers
-│   ├── team/
-│   │   └── team.toml         # /relay:team — spawn a coordinated team
-│   └── fanout/
-│       └── fanout.toml       # /relay:fanout — fan-out pattern
-├── skills/
-│   ├── relay-orchestration/
-│   │   └── SKILL.md          # Multi-agent orchestration patterns
-│   └── relay-protocol/
-│       └── SKILL.md          # ACK/DONE communication protocol
-├── agents/
-│   ├── relay-worker.md       # Generic worker sub-agent
-│   ├── relay-researcher.md   # Research-focused worker
-│   └── relay-reviewer.md     # Code review worker
-├── GEMINI.md                 # Context: relay instructions for the LLM
-├── package.json
-└── README.md
-```
-
-### gemini-extension.json
-
-```json
-{
-  "name": "agent-relay",
-  "version": "0.1.0",
-  "description": "Multi-agent communication via Agent Relay",
-  "contextFileName": "GEMINI.md",
-  "settings": [
-    {
-      "name": "Workspace Key",
-      "description": "Your Relay workspace key (rk_live_...)",
-      "envVar": "RELAY_WORKSPACE",
-      "sensitive": true
-    }
-  ],
-  "mcpServers": {
-    "relaycast": {
-      "command": "node",
-      "args": ["${extensionPath}${/}relay-server.js"],
-      "cwd": "${extensionPath}"
-    }
-  }
-}
-```
-
-### hooks/hooks.json
-
-```json
-{
-  "AfterTool": [
-    {
-      "matcher": ".*",
-      "hooks": [{
-        "type": "command",
-        "command": "sh ${extensionPath}/hooks/after-tool-inbox.sh",
-        "name": "Relay Inbox Poll",
-        "timeout": 5000
-      }]
-    }
-  ],
-  "AfterAgent": [
-    {
-      "hooks": [{
-        "type": "command",
-        "command": "sh ${extensionPath}/hooks/after-agent-inbox.sh",
-        "name": "Relay Stop Guard"
-      }]
-    }
-  ],
-  "BeforeModel": [
-    {
-      "hooks": [{
-        "type": "command",
-        "command": "sh ${extensionPath}/hooks/before-model-inject.sh",
-        "name": "Relay Message Injection"
-      }]
-    }
-  ],
-  "SessionStart": [
-    {
-      "hooks": [{
-        "type": "command",
-        "command": "sh ${extensionPath}/hooks/session-start.sh",
-        "name": "Relay Auto-Connect"
-      }]
-    }
-  ],
-  "SessionEnd": [
-    {
-      "hooks": [{
-        "type": "command",
-        "command": "sh ${extensionPath}/hooks/session-end.sh",
-        "name": "Relay Cleanup"
-      }]
-    }
-  ]
-}
-```
-
-### Hook Implementations
-
-#### AfterTool — Frequent Inbox Polling
-
-```bash
-#!/bin/bash
-# Fires after every tool call — check for new messages
-TOKEN=$(cat ~/.relay/token 2>/dev/null) || exit 0
-MESSAGES=$(curl -s -H "Authorization: Bearer $TOKEN" \
-  https://www.relaycast.dev/api/v1/inbox/check)
-
-COUNT=$(echo "$MESSAGES" | jq -r '.messages | length')
-if [ "$COUNT" -gt 0 ]; then
-  FORMATTED=$(echo "$MESSAGES" | jq -r '.messages[] | "Relay message from \(.from): \(.text)"' | head -20)
-  # Inject as additionalContext
-  cat <<EOF
-{
-  "hookSpecificOutput": {
-    "hookEventName": "AfterTool",
-    "additionalContext": "You have $COUNT new relay message(s):\n$FORMATTED\nPlease read and respond to these messages."
-  }
-}
-EOF
-else
-  echo '{}'
-fi
-```
-
-#### AfterAgent — Stop Guard (Equivalent to Claude's Stop Hook)
-
-```bash
-#!/bin/bash
-# Fires after agent responds — if unread messages exist, force retry
-# Loop guard: file-based counter prevents infinite retries
-TOKEN=$(cat ~/.relay/token 2>/dev/null) || exit 0
-GUARD_FILE="/tmp/relay-afteragent-guard-$$"
-RETRY_COUNT=0
-
-if [ -f "$GUARD_FILE" ]; then
-  RETRY_COUNT=$(cat "$GUARD_FILE")
-fi
-
-# Max 3 consecutive retries, then let the agent stop
-if [ "$RETRY_COUNT" -ge 3 ]; then
-  rm -f "$GUARD_FILE"
-  echo '{"decision": "allow"}'
-  exit 0
-fi
-
-MESSAGES=$(curl -s -H "Authorization: Bearer $TOKEN" \
-  https://www.relaycast.dev/api/v1/inbox/check)
-
-COUNT=$(echo "$MESSAGES" | jq -r '.messages | length')
-if [ "$COUNT" -gt 0 ]; then
-  FORMATTED=$(echo "$MESSAGES" | jq -r '.messages[] | "Relay message from \(.from): \(.text)"' | head -20)
-  # Increment retry counter
-  echo $((RETRY_COUNT + 1)) > "$GUARD_FILE"
-  # Block the stop and retry with messages as the reason
-  cat <<EOF
-{
-  "decision": "block",
-  "reason": "You have $COUNT unread relay message(s). Please process them before stopping:\n$FORMATTED"
-}
-EOF
-else
-  # No messages — reset counter and allow stop
-  rm -f "$GUARD_FILE"
-  echo '{"decision": "allow"}'
-fi
-```
-
-#### BeforeModel — Direct Message Injection
-
-```bash
-#!/bin/bash
-# Fires before every LLM call — prepend buffered messages to the request
-# This is the most reliable injection point: messages go directly into the model context
-# Rate-limited: checks inbox at most once every 5 seconds to avoid latency on rapid model calls
-TOKEN=$(cat ~/.relay/token 2>/dev/null) || exit 0
-RATE_FILE="/tmp/relay-beforemodel-last-check"
-NOW=$(date +%s)
-
-if [ -f "$RATE_FILE" ]; then
-  LAST_CHECK=$(cat "$RATE_FILE")
-  ELAPSED=$((NOW - LAST_CHECK))
-  if [ "$ELAPSED" -lt 5 ]; then
-    echo '{}'  # Skip — checked too recently
-    exit 0
-  fi
-fi
-echo "$NOW" > "$RATE_FILE"
-
-# Read the current llm_request from stdin
-INPUT=$(cat)
-MESSAGES=$(curl -s -H "Authorization: Bearer $TOKEN" \
-  https://www.relaycast.dev/api/v1/inbox/check)
-
-COUNT=$(echo "$MESSAGES" | jq -r '.messages | length')
-if [ "$COUNT" -gt 0 ]; then
-  FORMATTED=$(echo "$MESSAGES" | jq -r '.messages[] | "Relay message from \(.from): \(.text)"')
-
-  # Modify llm_request to prepend relay messages as a system message
-  LLM_REQUEST=$(echo "$INPUT" | jq -r '.llm_request')
-  MODIFIED=$(echo "$LLM_REQUEST" | jq --arg msgs "$FORMATTED" \
-    '.messages = [{"role": "system", "content": ("New relay messages:\n" + $msgs)}] + .messages')
-
-  cat <<EOF
-{
-  "hookSpecificOutput": {
-    "hookEventName": "BeforeModel",
-    "llm_request": $MODIFIED
-  }
-}
-EOF
-else
-  echo '{}'
-fi
-```
-
-### Custom Commands
-
-#### commands/status/status.toml
-
-```toml
-prompt = """Check the relay status:
-1. Call mcp_relaycast_agent_list to see who's online
-2. Call mcp_relaycast_inbox_check to see unread messages
-3. Report a summary of agents and any pending messages"""
-```
-
-#### commands/team/team.toml
-
-```toml
-prompt = """Spawn a coordinated team of relay agents for: {{args}}
-
-Follow this protocol:
-1. Analyze the task and determine how many workers are needed (max 5)
-2. For each worker, call mcp_relaycast_agent_add with a clear name and task
-3. Monitor their ACK messages via mcp_relaycast_inbox_check
-4. Coordinate their work by relaying information between them
-5. Collect DONE messages and synthesize the final result"""
-```
-
-### Sub-Agent Definitions
-
-#### agents/relay-worker.md
-
-```markdown
----
-name: relay-worker
-description: A worker agent that communicates via Agent Relay
-model: gemini-2.5-flash
----
-
-You are a Relay worker agent. When you start:
-
-1. Check your inbox with mcp_relaycast_inbox_check for your task assignment
-2. Send an ACK to your lead: mcp_relaycast_dm_send(to: "<lead>", text: "ACK: <your understanding>")
-3. Complete the assigned task
-4. Report back: mcp_relaycast_dm_send(to: "<lead>", text: "DONE: <summary of what you accomplished>")
-
-Always check your inbox periodically during long tasks in case your lead has updates.
-```
-
-### Capability Comparison: All Platforms
-
-| Capability | OpenCode | Claude Code | Gemini CLI | Codex (deferred) |
-|---|---|---|---|---|
-| Tool definition | Native `tool()` | MCP server | MCP server | MCP server |
-| Spawn processes | Native `spawn()` | MCP `agent_add` | MCP `agent_add` | MCP `agent_add` |
-| Stop-time injection | `session.idle` (provisional) | `Stop` hook + block | `AfterAgent` hook + block | `Stop` hook + block |
-| Mid-work injection | TBD | `PostToolUse` + context | `AfterTool` + context | **Not available** |
-| Pre-model injection | — | — | **`BeforeModel` + modify request** (rate-limited) | — |
-| Worker bootstrap | System prompt + env vars | `SubagentStart` hook | Sub-agent `.md` files | — |
-| Context preservation | `session.compacting` (provisional) | `PreCompact` | `PreCompress` | — |
-| Cleanup | `session.end` (provisional) | `SessionEnd` | `SessionEnd` | — |
-| Custom commands | — | Skills (slash commands) | **TOML commands + Skills** | — |
-| Loop guard | TBD | `stop_hook_active` | File-based counter (max 3 retries) | `stop_hook_active` |
-| Install method | `opencode plugin add` | Plugin install | `gemini extensions install` | `npx agent-relay setup codex` |
-| **Real-time ready?** | TBD | **Yes** | **Yes** | **No** — Stop hook only |
-
-**Summary**: Claude Code and Gemini CLI are implementation-ready with mid-work injection hooks. OpenCode needs hook API verification. Codex is deferred — its `Stop` hook only fires at task completion, making real-time communication impractical. Codex becomes viable when `AfterToolUse` hooks are exposed or via app-server `turn/steer` integration (separate relay core spec).
-
-### Implementation Phases
-
-**Phase 7: Gemini Core Extension**
-- gemini-extension.json with Relaycast MCP server
-- AfterTool inbox polling hook + AfterAgent stop guard (with file-based loop guard)
-- SessionStart/SessionEnd hooks
-- Test: `gemini extensions install` → connect → send/receive
-
-**Phase 8: Gemini BeforeModel + Commands**
-- BeforeModel hook for direct message injection (with rate limiting)
-- BeforeToolSelection for context-aware tool filtering
-- `/relay:status`, `/relay:team`, `/relay:fanout` commands
-- Orchestration + protocol skills
-- Worker/researcher/reviewer sub-agent definitions
-- GEMINI.md context file
-
-### Recommendation
-
-Build the Gemini extension **alongside the Claude Code plugin** (Phase 5-6). The structure is nearly identical — same MCP server, similar hooks, same ACK/DONE protocol. The main unique work is the `BeforeModel` hook (which is worth prioritizing as it's the best injection mechanism available on any platform) and the TOML command definitions.
-
----
-
-## 15. Codex — Future (Not Implementing Now)
-
-Codex is **deferred** from this spec's implementation scope. While Codex has a hooks engine ([PR #13276](https://github.com/openai/codex/pull/13276)) with `SessionStart` and `Stop` hooks, it lacks the mid-work injection needed for real-time communication.
-
-### Why Not Now
-
-The `Stop` hook only fires **when the agent finishes its entire response** — after all tool calls are done. There is no `PostToolUse` / `AfterTool` equivalent. If an agent is working for 5 minutes making 20 tool calls, it won't check its inbox until the very end.
-
-| CLI | When inbox is checked | Worst-case latency |
-|---|---|---|
-| Claude Code | After every tool call (`PostToolUse`) | Seconds |
-| Gemini CLI | After every tool call (`AfterTool`) | Seconds |
-| **Codex (hooks only)** | **When agent finishes entire task (`Stop`)** | **Minutes** |
-
-The `AfterToolUse` infrastructure exists in Codex's Rust codebase (`codex-rs/core/src/tools/registry.rs` dispatches after every tool call) but is not wired to the config system — the hooks vector is initialized empty and the `HookEventName` enum only has `SessionStart` and `Stop`.
-
-### When to Revisit
-
-Codex becomes viable for real-time relay communication when **either**:
-
-1. **`AfterToolUse` hooks are exposed** — OpenAI adds `AfterToolUse` to the `HookEventName` enum and wires it through the config/discovery system. The dispatch infrastructure is already production-ready.
-2. **App-server integration via relay core** — The Codex app-server (JSON-RPC 2.0) exposes `turn/steer` for mid-execution message injection. This is the more powerful path but belongs in a separate relay core injection spec — similar to how relay achieves OpenCode injection today.
-
-### What a Future Plugin Would Look Like
-
-When ready, the Codex plugin would follow the same MCP + hooks pattern as Claude Code and Gemini:
-
-```bash
-npx agent-relay setup codex  # writes MCP + hooks to Codex config
-codex                         # just works
-```
-
-The Stop hook implementation can be shared directly with Claude Code (same block/allow/inject pattern, same `stop_hook_active` loop guard). Adding `AfterToolUse` would bring it to full parity.
-
----
-
-## 16. Open Questions
-
-1. **OpenCode plugin API stability** — The plugin system is evolving. Pin to specific API version? Hook event names (`session.idle`, `session.compacting`, `session.end`) are unverified.
-2. **Worker bootstrap** — Should workers install the relay plugin themselves, or should the system prompt include raw tool definitions as a fallback?
-3. **Multi-workspace** — Should a single session support connecting to multiple workspaces simultaneously?
-4. **Auth flow** — Should `relay_connect` support OAuth in addition to workspace keys?
-5. **Process isolation** — Should spawned workers use Bun's shell API instead of Node's `child_process` for better integration with OpenCode's runtime?
-6. **Shared MCP server** — Claude Code and Gemini both use MCP servers natively, so they share the Relaycast MCP server. OpenCode uses native `tool()` with direct HTTP calls. If OpenCode adds MCP support, it could share the same server. For now, OpenCode is the only one with a separate transport layer.
-7. **Migration from file-based inbox** — The existing hooks (`packages/hooks/src/inbox-check/`) read from `/tmp/agent-relay/{agent}/inbox.md` (file-based, broker-backed). The plugins described here use Relaycast HTTP API (brokerless). This is a different architecture. The existing hooks should be updated to support both backends, or the plugin should fully replace them.
-8. **Relaycast API endpoint verification** — The HTTP endpoints listed in Section 12 (e.g., `/api/v1/register`) need verification against the actual Relaycast API. The MCP server may use different internal endpoints.
-9. **Model passthrough on spawn** — The OpenCode plugin passes `--model` to spawned workers. The Claude Code and Gemini plugins spawn via MCP `agent_add` — does this tool support model selection? (Ref: recent fix in commit `4b0eb5a6`)
-10. **Relay core injection spec** — The Codex app-server's `turn/steer` capability (and similar patterns for OpenCode injection) should be specced separately as a relay core injection layer. This would generalize mid-execution message delivery across CLIs that expose programmatic control APIs. This is also what unblocks Codex for real-time communication.
-
-### Resolved
-
-- ~~**BeforeModel injection frequency**~~ — Resolved: rate-limited to once per 5 seconds via timestamp file (see BeforeModel hook implementation).
-- ~~**Gemini AfterAgent retry loop**~~ — Resolved: file-based counter with max 3 retries (see AfterAgent hook implementation).
-
----
-
-## 17. Success Criteria
-
-### OpenCode Plugin
-- [ ] `opencode plugin add agent-relay` installs and registers tools
-- [ ] Single OpenCode instance can connect, send, and receive messages
-- [ ] Can spawn 3+ workers that all communicate independently
-- [ ] Workers survive parent context compaction
-- [ ] All tests pass (unit + integration)
-- [ ] < 50ms overhead per tool call (excluding network)
-- [ ] Clean shutdown — no orphaned processes
-
-### Claude Code Plugin
-- [ ] One-command plugin install configures MCP + hooks
-- [ ] Stop hook blocks when unread messages exist
-- [ ] PostToolUse hook surfaces messages during active work
-- [ ] SubagentStart hook auto-bootstraps spawned workers
-- [ ] PreCompact hook preserves relay state
-- [ ] Skills (`/relay-team`, `/relay-fanout`, `/relay-pipeline`) work correctly
-- [ ] Worker agent definition follows ACK/DONE protocol
-
-### Gemini CLI Extension
-- [ ] `gemini extensions install` registers MCP + hooks + commands
-- [ ] AfterTool hook polls inbox after each tool call
-- [ ] AfterAgent hook blocks stop when unread messages exist
-- [ ] BeforeModel hook injects messages directly into LLM request
-- [ ] Custom commands (`/relay:status`, `/relay:team`) work
-- [ ] Sub-agent definitions spawn correctly
-- [ ] SessionStart auto-connects, SessionEnd cleans up
-
-### Cross-Platform
-- [ ] OpenCode ↔ Claude Code messaging works via shared workspace
-- [ ] OpenCode ↔ Gemini CLI messaging works via shared workspace
-- [ ] Claude Code ↔ Gemini CLI messaging works via shared workspace
-- [ ] Mixed team (all three platforms) can coordinate on a single task
-
-### Codex (deferred — revisit when `AfterToolUse` hooks are available)
-- [ ] Monitor Codex hooks API for `AfterToolUse` support
-- [ ] Spec relay core injection layer for app-server `turn/steer` integration
diff --git a/specs/connect-sdk.md b/specs/connect-sdk.md
deleted file mode 100644
index c92452a5a..000000000
--- a/specs/connect-sdk.md
+++ /dev/null
@@ -1,1153 +0,0 @@
-# Agent Relay SDK — `on_relay` Implementation Spec
-
-**Status**: Draft
-**Date**: 2026-03-13
-**Author**: Design session (human + Claude)
-
----
-
-## 1. Vision
-
-The Agent Relay SDK has two modes:
-
-### Orchestrate Mode (existing)
-
-For **CLI-based agent harnesses** — Claude Code, Codex, Gemini CLI, Aider, Goose, etc. Relay is the runtime: it spawns processes, manages lifecycles, and runs workflows.
-
-```python
-from agent_relay import AgentRelay
-
-relay = AgentRelay()
-await relay.claude.spawn(name="Worker", task="Fix the auth bug")
-await relay.codex.spawn(name="Reviewer", task="Review Worker's changes")
-```
-
-**Use when:** You're orchestrating CLI tools that run as subprocesses. Requires the `agent-relay-broker` binary. This is the existing SDK.
-
-### Communicate Mode (new — this spec)
-
-For **SDK-based agent frameworks** — OpenAI Agents, Claude Agent SDK, Google ADK, CrewAI, Swarms, Agno, Pi. Your framework is the runtime. Relay just adds the wire between agents.
-
-```python
-from agent_relay import on_relay
-
-agent = on_relay(Agent(name="Researcher", ...))
-# Your agent is now "on the relay" — it can talk to any other agent
-```
-
-**Use when:** You already have agents built in a framework and want them to communicate with each other — across frameworks, across processes, across machines. No broker binary needed.
-
-### One SDK, Two Entry Points
-
-Both modes live in the same package:
-
-| Registry | Package |
-|----------|---------|
-| PyPI | `agent-relay-sdk` |
-| npm | `@agent-relay/sdk` |
-
-```python
-# Orchestrate: spawn and manage CLI agents
-from agent_relay import AgentRelay
-
-# Communicate: put existing framework agents on the relay
-from agent_relay import on_relay
-```
-
-The `on_relay()` function and its supporting code (`Relay` core, transport, adapters) are new additions to the existing SDK package. No new package to install.
-
----
-
-## 2. Supported Frameworks (Communicate Mode)
-
-| Framework | Language | Send Mechanism | Receive Mechanism | Push Tier |
-|-----------|----------|---------------|-------------------|-----------|
-| Claude Agent SDK | TS/Python | MCP server | Hook `systemMessage` (PostToolUse, Stop) | 1 — per tool |
-| Pi | TypeScript | AgentTool | `steer()` / `followUp()` | 1 — instant |
-| Google ADK | Python | Function tool | `before_model_callback` mutates LLM request | 1 — per LLM call |
-| OpenAI Agents | Python | `@function_tool` | Dynamic `instructions` callable | 2 — per turn |
-| Agno | Python | Function / MCP | Dynamic `instructions` callable + pre-hook | 2 — per run |
-| Swarms | Python | Plain callable | `receive_message()` triggers new run | 2 — per run |
-| CrewAI | Python | `@tool` | Flow `@listen` + state | 2 — per task |
-
-**Tier 1**: Messages injected mid-execution (during tool calls or before LLM calls).
-**Tier 2**: Messages injected at natural boundaries (between turns, runs, or tasks). Still works — messages arrive via WebSocket in real-time and are buffered locally, so there's no network round-trip delay at injection time.
-
----
-
-## 3. Architecture
-
-```
-Relaycast Cloud (or self-hosted)
-       │
-       │ WebSocket (always-on, lazy-connect)
-       ▼
-┌──────────────┐
-│  Relay Core  │  from agent_relay.communicate import Relay
-│              │  - send(to, text)
-│              │  - post(channel, text)
-│              │  - inbox() → Message[]
-│              │  - agents() → str[]
-│              │  - on_message(callback)
-└──────┬───────┘
-       │
-       │  on_relay(agent) — per-framework adapter
-       │  ~25-30 lines each
-       │
-  ┌────┼────────┬──────────┬────────────┬───────────┬──────────┬──────────┐
-  ▼    ▼        ▼          ▼            ▼           ▼          ▼          ▼
- Pi  Claude   Google     OpenAI      Agno       Swarms     CrewAI     Custom
-     SDK      ADK        Agents
-```
-
-### 3.1 Orchestrate vs Communicate
-
-```
-┌─────────────────────────────────────────────────────────────────────┐
-│                       agent-relay-sdk                               │
-│                                                                     │
-│  ┌─────────────────────────┐   ┌────────────────────────────────┐  │
-│  │    Orchestrate Mode     │   │      Communicate Mode          │  │
-│  │                         │   │                                │  │
-│  │  AgentRelay()           │   │  on_relay(agent)               │  │
-│  │  AgentRelayClient       │   │  Relay class                   │  │
-│  │  WorkflowBuilder        │   │  Framework adapters            │  │
-│  │                         │   │                                │  │
-│  │  Needs: broker binary   │   │  Needs: nothing (brokerless)   │  │
-│  │  Agents: CLI processes  │   │  Agents: your framework's      │  │
-│  │  Comms: MCP tools       │   │  Comms: on_relay() injects     │  │
-│  └─────────────────────────┘   └────────────────────────────────┘  │
-│                                                                     │
-│  Shared: Relaycast messaging infrastructure                         │
-└─────────────────────────────────────────────────────────────────────┘
-```
-
-An Orchestrate-mode CLI agent and a Communicate-mode SDK agent can talk to each other — they both use the same Relaycast messaging network. A Claude Code agent spawned via `relay.claude.spawn()` can DM a Swarms agent that was put `on_relay()`.
-
-### 3.2 Transport
-
-The `Relay` core always opens a WebSocket to Relaycast for real-time message delivery. Messages arriving via WebSocket are:
-1. Delivered to any registered `on_message` callback (for Tier 1 push frameworks)
-2. Buffered in `_pending` list (for Tier 2 poll frameworks, drained via `inbox()`)
-
-### 3.3 Brokerless Mode
-
-Communicate mode talks directly to the Relaycast HTTP/WS API. No Rust broker binary needed. This enables:
-- Serverless / cloud deployment
-- Lightweight integrations
-- Cross-machine agent coordination
-
-If a broker IS running (detected via env var or explicit config), the core routes through the broker instead for unified agent management.
-
----
-
-## 4. Package Structure
-
-The communicate mode code lives inside the existing SDK packages — no new packages to publish.
-
-### 4.1 Python — new files in `packages/sdk-py/`
-
-```
-packages/sdk-py/src/agent_relay/
-├── __init__.py              # ADD: export on_relay
-├── relay.py                 # existing orchestrate mode
-├── client.py                # existing orchestrate mode
-├── ...                      # existing files unchanged
-│
-├── communicate/             # NEW — all communicate mode code
-│   ├── __init__.py          # re-exports Relay, Message, on_relay
-│   ├── core.py              # Relay class (~200 lines)
-│   ├── types.py             # Message, RelayConfig dataclasses
-│   ├── transport.py         # WebSocket + HTTP client for Relaycast
-│   ├── _utils.py            # Shared helpers (format messages, etc.)
-│   └── adapters/
-│       ├── __init__.py
-│       ├── openai_agents.py # on_relay() for OpenAI Agents SDK
-│       ├── claude_sdk.py    # on_relay() for Claude Agent SDK (Python)
-│       ├── google_adk.py    # on_relay() for Google ADK
-│       ├── agno.py          # on_relay() for Agno
-│       ├── swarms.py        # on_relay() for Swarms
-│       └── crewai.py        # on_relay() for CrewAI
-
-packages/sdk-py/tests/
-├── ...                      # existing tests unchanged
-└── communicate/             # NEW
-    ├── conftest.py          # Shared fixtures, mock Relaycast server
-    ├── test_core.py         # Relay class unit tests
-    ├── test_transport.py    # WebSocket/HTTP transport tests
-    ├── test_types.py        # Type validation tests
-    ├── adapters/
-    │   ├── test_openai_agents.py
-    │   ├── test_claude_sdk.py
-    │   ├── test_google_adk.py
-    │   ├── test_agno.py
-    │   ├── test_swarms.py
-    │   └── test_crewai.py
-    └── integration/
-        ├── test_cross_framework.py  # Agent A talks to Agent B across frameworks
-        └── test_end_to_end.py       # Full round-trip with real Relaycast
-```
-
-### 4.2 TypeScript — new files in `packages/sdk/src/`
-
-```
-packages/sdk/src/
-├── index.ts                 # ADD: export communicate module
-├── relay.ts                 # existing orchestrate mode
-├── client.ts                # existing orchestrate mode
-├── ...                      # existing files unchanged
-│
-├── communicate/             # NEW — all communicate mode code
-│   ├── index.ts             # re-exports
-│   ├── core.ts              # Relay class (~200 lines)
-│   ├── types.ts             # Message, RelayConfig interfaces
-│   ├── transport.ts         # WebSocket + HTTP client
-│   ├── utils.ts             # Shared helpers
-│   └── adapters/
-│       ├── claude-sdk.ts    # onRelay() for Claude Agent SDK
-│       └── pi.ts            # onRelay() for Pi
-
-packages/sdk/src/__tests__/
-├── ...                      # existing tests unchanged
-└── communicate/             # NEW
-    ├── core.test.ts
-    ├── transport.test.ts
-    ├── adapters/
-    │   ├── claude-sdk.test.ts
-    │   └── pi.test.ts
-    └── integration/
-        └── cross-framework.test.ts
-```
-
-### 4.3 Import Paths
-
-```python
-# Python — top-level convenience import
-from agent_relay import on_relay
-
-# Python — explicit communicate module
-from agent_relay.communicate import Relay, Message, RelayConfig
-from agent_relay.communicate.adapters.openai_agents import on_relay
-from agent_relay.communicate.adapters.google_adk import on_relay
-
-# The top-level on_relay auto-detects framework (see Section 6.9)
-```
-
-```typescript
-// TypeScript — subpath export
-import { onRelay } from "@agent-relay/sdk/communicate";
-import { Relay } from "@agent-relay/sdk/communicate";
-
-// Framework-specific
-import { onRelay } from "@agent-relay/sdk/communicate/adapters/claude-sdk";
-import { onRelay } from "@agent-relay/sdk/communicate/adapters/pi";
-```
-
-### 4.4 New SDK Exports
-
-Add subpath export to `packages/sdk/package.json`:
-
-```json
-{
-  "exports": {
-    "./communicate": {
-      "types": "./dist/communicate/index.d.ts",
-      "import": "./dist/communicate/index.js"
-    }
-  }
-}
-```
-
-### 4.5 Dependencies
-
-Framework SDKs are **optional** — adapters use lazy imports and raise clear errors if the framework isn't installed. The communicate module adds minimal new dependencies:
-
-**Python** — add to `pyproject.toml`:
-
-```toml
-[project.optional-dependencies]
-communicate = [
-    "aiohttp>=3.9",          # HTTP + WebSocket client (brokerless transport)
-]
-openai-agents = ["openai-agents>=0.1"]
-claude-sdk = ["claude-agent-sdk>=0.1"]
-google-adk = ["google-adk>=0.1"]
-agno = ["agno>=0.1"]
-swarms = ["swarms>=0.1"]
-crewai = ["crewai>=0.1"]
-```
-
-Install: `pip install agent-relay-sdk[communicate]`
-
-**TypeScript** — add `ws` as optional dependency (only needed for brokerless mode; existing SDK may already have WebSocket support via the broker).
-
----
-
-## 5. Core API Specification
-
-### 5.1 Types
-
-#### Python (`types.py`)
-
-```python
-from dataclasses import dataclass, field
-from typing import Callable, Optional, Awaitable
-from enum import Enum
-
-@dataclass(frozen=True)
-class Message:
-    """An inbound message from another agent."""
-    sender: str
-    text: str
-    channel: Optional[str] = None     # None = DM, otherwise channel name
-    thread_id: Optional[str] = None
-    timestamp: Optional[float] = None
-    message_id: Optional[str] = None
-
-@dataclass
-class RelayConfig:
-    """Configuration for a Relay connection. Everything optional with env var defaults."""
-    workspace: Optional[str] = None       # default: RELAY_WORKSPACE env var
-    api_key: Optional[str] = None         # default: RELAY_API_KEY env var
-    base_url: Optional[str] = None        # default: RELAY_BASE_URL or Relaycast cloud
-    channels: list[str] = field(default_factory=lambda: ["general"])
-    poll_interval_ms: int = 1000          # fallback polling if WS fails
-    auto_cleanup: bool = True             # atexit cleanup
-
-MessageCallback = Callable[[Message], None] | Callable[[Message], Awaitable[None]]
-```
-
-#### TypeScript (`types.ts`)
-
-```typescript
-export interface Message {
-  readonly sender: string;
-  readonly text: string;
-  readonly channel?: string;
-  readonly threadId?: string;
-  readonly timestamp?: number;
-  readonly messageId?: string;
-}
-
-export interface RelayConfig {
-  workspace?: string;
-  apiKey?: string;
-  baseUrl?: string;
-  channels?: string[];
-  pollIntervalMs?: number;
-  autoCleanup?: boolean;
-}
-
-export type MessageCallback = (message: Message) => void | Promise<void>;
-```
-
-### 5.2 Relay Class (Core)
-
-#### Python (`core.py`)
-
-```python
-class Relay:
-    """Lightweight connection to the Agent Relay network.
-
-    Usage:
-        relay = Relay("MyAgent")
-        await relay.send("Bob", "Hello!")
-        messages = await relay.inbox()
-        await relay.close()
-    """
-
-    def __init__(self, name: str, config: RelayConfig | None = None):
-        """Register this agent with the Relay network.
-
-        Args:
-            name: Agent name visible to other agents.
-            config: Optional configuration. Defaults from env vars.
-        """
-
-    # ── Sending ─────────────────────────────────────────
-
-    async def send(self, to: str, text: str) -> None:
-        """Send a DM to another agent."""
-
-    async def post(self, channel: str, text: str) -> None:
-        """Post a message to a channel."""
-
-    async def reply(self, message_id: str, text: str) -> None:
-        """Reply to a specific message in its thread."""
-
-    # ── Receiving ───────────────────────────────────────
-
-    async def inbox(self) -> list[Message]:
-        """Drain and return all buffered messages since last call.
-
-        Messages arrive via WebSocket in real-time and are buffered.
-        This method returns the buffer and clears it.
-        Returns an empty list if no new messages.
-        """
-
-    def on_message(self, callback: MessageCallback) -> Callable[[], None]:
-        """Register a callback for real-time message delivery.
-
-        Returns an unsubscribe function.
-        The callback fires immediately when a message arrives via WebSocket.
-        Messages delivered to callbacks are NOT buffered for inbox().
-        """
-
-    # ── Discovery ───────────────────────────────────────
-
-    async def agents(self) -> list[str]:
-        """List currently online agent names."""
-
-    # ── Lifecycle ───────────────────────────────────────
-
-    async def close(self) -> None:
-        """Unregister from the network and close connections."""
-
-    # ── Sync wrappers (for frameworks that need sync) ───
-
-    def send_sync(self, to: str, text: str) -> None: ...
-    def post_sync(self, channel: str, text: str) -> None: ...
-    def inbox_sync(self) -> list[Message]: ...
-    def agents_sync(self) -> list[str]: ...
-    def close_sync(self) -> None: ...
-```
-
-#### TypeScript (`core.ts`)
-
-Same interface but async-only (no sync wrappers needed in TS).
-
-### 5.3 Internal Behavior
-
-#### Lazy Connection
-- WebSocket connects on first `send()`, `post()`, `inbox()`, or `on_message()` call
-- Agent registers with Relaycast on connect
-- No connection attempt at `__init__` time
-
-#### Message Routing
-- If `on_message` callback(s) registered: messages go to callbacks only (not buffered)
-- If no callbacks: messages buffered in `_pending` list, drained by `inbox()`
-- If both: messages go to callbacks AND are buffered (adapter decides which path to use)
-
-#### Auto-Cleanup
-- If `config.auto_cleanup` is True (default), register `atexit` handler to call `close_sync()`
-- Also support async context manager: `async with Relay("name") as relay: ...`
-
-#### Error Handling
-- WebSocket disconnect: auto-reconnect with exponential backoff (1s, 2s, 4s, max 30s)
-- HTTP errors: raise `RelayConnectionError` with status code and message
-- Missing env vars: raise `RelayConfigError` with clear message about which var is needed
-
----
-
-## 6. Adapter Specifications
-
-Each adapter implements `on_relay()` that wraps a framework-native agent object. The function:
-1. Extracts the agent name from the framework's convention
-2. Creates a `Relay` instance (or accepts one as optional parameter)
-3. Appends sending tools in the framework's native tool format
-4. Wraps the instruction/callback mechanism for receiving
-5. Returns the modified agent (same object, mutated in place)
-
-### 6.1 OpenAI Agents Python
-
-**File**: `communicate/adapters/openai_agents.py`
-**Framework**: `openai-agents` (pip install openai-agents)
-**Import**: `from agents import Agent, Runner, function_tool`
-
-```python
-def on_relay(agent: Agent, relay: Relay | None = None) -> Agent:
-    """Put an OpenAI Agents SDK agent on the relay.
-
-    Sending: Injects relay_send, relay_inbox, relay_post, relay_agents as @function_tool
-    Receiving: Wraps agent.instructions as callable that prepends inbox contents
-    """
-```
-
-**Sending tools injected:**
-- `relay_send(to: str, text: str) -> str`
-- `relay_inbox() -> str`
-- `relay_post(channel: str, text: str) -> str`
-- `relay_agents() -> str`
-
-**Receiving mechanism:**
-- Wrap `agent.instructions` (str or callable) as an async callable
-- On each turn, drain `relay.inbox()` and prepend to instructions if non-empty
-- Format: `\n\nNew messages from other agents:\n  {sender}: {text}\n  ...`
-
-**Edge cases:**
-- If `agent.instructions` is already a callable, wrap and chain
-- If `agent.instructions` is a string, convert to callable that returns that string + inbox
-- If `agent.instructions` is None, create callable that returns only inbox (or empty string)
-
-### 6.2 Claude Agent SDK (TypeScript)
-
-**File**: `communicate/adapters/claude-sdk.ts`
-**Framework**: `@anthropic-ai/claude-agent-sdk`
-**Import**: `import { query } from "@anthropic-ai/claude-agent-sdk"`
-
-```typescript
-function onRelay(name: string, opts: QueryOptions): QueryOptions
-```
-
-Note: Claude Agent SDK doesn't have an Agent class — it uses `query(prompt, options)`. So `onRelay` wraps the options object, not an agent.
-
-**Sending:** Inject Relaycast MCP server into `options.mcpServers`
-**Receiving:** Add hooks:
-- `PostToolUse`: drain inbox, return as `systemMessage` if non-empty
-- `Stop`: drain inbox, if non-empty return `systemMessage` + `continue: true` to keep agent alive
-
-### 6.3 Claude Agent SDK (Python)
-
-**File**: `communicate/adapters/claude_sdk.py`
-**Framework**: `claude-agent-sdk`
-**Import**: `from claude_agent_sdk import query, ClaudeAgentOptions`
-
-Same pattern as TypeScript but in Python. Uses Python hooks API.
-
-### 6.4 Pi (TypeScript)
-
-**File**: `communicate/adapters/pi.ts`
-**Framework**: `@mariozechner/pi-coding-agent`
-
-```typescript
-function onRelay(name: string, config: AgentSessionConfig): AgentSessionConfig
-```
-
-**Sending:** Append Relay tools as `AgentTool` objects with TypeBox schemas
-**Receiving:** Register `relay.onMessage()` callback that calls:
-- `session.steer()` if agent is streaming (interrupts)
-- `session.followUp()` if agent is idle (queues)
-
-Requires capturing the session reference. The adapter adds an `onSessionCreated` hook to the config.
-
-### 6.5 Google ADK
-
-**File**: `communicate/adapters/google_adk.py`
-**Framework**: `google-adk`
-**Import**: `from google.adk.agents import Agent`
-
-```python
-def on_relay(agent: Agent, relay: Relay | None = None) -> Agent:
-```
-
-**Sending:** Append plain Python functions to `agent.tools`
-**Receiving:** Set/chain `agent.before_model_callback` to:
-- Drain `relay.inbox()`
-- Append messages to `llm_request.contents` as user Content parts
-- Chain to original callback if one existed
-
-### 6.6 Agno
-
-**File**: `communicate/adapters/agno.py`
-**Framework**: `agno`
-**Import**: `from agno.agent import Agent`
-
-```python
-def on_relay(agent: Agent, relay: Relay | None = None) -> Agent:
-```
-
-**Sending:** Append plain Python functions to `agent.tools`
-**Receiving:** Wrap `agent.instructions` as callable that drains inbox (same pattern as OpenAI Agents)
-
-### 6.7 Swarms
-
-**File**: `communicate/adapters/swarms.py`
-**Framework**: `swarms`
-**Import**: `from swarms import Agent`
-
-```python
-def on_relay(agent: Agent, relay: Relay | None = None) -> Agent:
-```
-
-**Sending:** Append plain callables to `agent.tools`
-**Receiving:** Register `relay.on_message()` callback that calls `agent.receive_message(sender, text)`, which triggers a new run
-
-### 6.8 CrewAI
-
-**File**: `communicate/adapters/crewai.py`
-**Framework**: `crewai`
-**Import**: `from crewai import Agent`
-
-```python
-def on_relay(agent: Agent, relay: Relay | None = None) -> Agent:
-```
-
-**Sending:** Append `@tool` decorated functions to `agent.tools`
-**Receiving:** Wrap `agent.backstory` or `agent.goal` to include inbox contents. CrewAI is the most limited — no dynamic instructions, no mid-task hooks. The adapter documents this limitation.
-
-Alternative for CrewAI: provide a `RelayTool` that the agent can call to check inbox, and recommend using Flows for richer integration.
-
-### 6.9 Auto-Detect `on_relay()` (Top-Level)
-
-The top-level `from agent_relay import on_relay` auto-detects the framework by inspecting the agent object's type:
-
-```python
-# agent_relay/__init__.py
-
-def on_relay(agent, relay=None):
-    """Put any agent on the relay. Auto-detects the framework."""
-    cls_module = type(agent).__module__
-
-    if cls_module.startswith("agents"):
-        from agent_relay.communicate.adapters.openai_agents import on_relay as _adapt
-    elif cls_module.startswith("google.adk"):
-        from agent_relay.communicate.adapters.google_adk import on_relay as _adapt
-    elif cls_module.startswith("agno"):
-        from agent_relay.communicate.adapters.agno import on_relay as _adapt
-    elif cls_module.startswith("swarms"):
-        from agent_relay.communicate.adapters.swarms import on_relay as _adapt
-    elif cls_module.startswith("crewai"):
-        from agent_relay.communicate.adapters.crewai import on_relay as _adapt
-    else:
-        raise TypeError(
-            f"on_relay() doesn't recognize {type(agent).__name__} from {cls_module}. "
-            f"Use a framework-specific adapter: "
-            f"from agent_relay.communicate.adapters.openai_agents import on_relay"
-        )
-
-    return _adapt(agent, relay=relay)
-```
-
-For TypeScript and Claude Agent SDK / Pi (which don't pass an agent object), users import the framework-specific adapter directly:
-
-```typescript
-import { onRelay } from "@agent-relay/sdk/communicate/adapters/claude-sdk";
-import { onRelay } from "@agent-relay/sdk/communicate/adapters/pi";
-```
-
----
-
-## 7. Implementation Phases
-
-### Phase 1: Core + Tests (Foundation)
-
-**Goal**: Relay class works end-to-end. All core primitives tested.
-
-#### Wave 1.1: Types & Core Shell
-
-| Task | Agent | Description |
-|------|-------|-------------|
-| 1.1.1 | Worker A | Create `packages/sdk-py/src/agent_relay/communicate/` directory structure with `__init__.py` files |
-| 1.1.2 | Worker A | Implement `communicate/types.py`: `Message`, `RelayConfig`, `MessageCallback`, `RelayConnectionError`, `RelayConfigError` |
-| 1.1.3 | Worker B | Write `tests/communicate/test_types.py`: message creation, frozen immutability, default config values, env var resolution |
-
-**Review Gate 1.1**: Reviewer verifies types are correct, tests pass, no unnecessary complexity.
-
-#### Wave 1.2: Transport Layer
-
-| Task | Agent | Description |
-|------|-------|-------------|
-| 1.2.1 | Worker A | Implement `communicate/transport.py`: `RelayTransport` class with `connect()`, `disconnect()`, `send_http()`, `on_ws_message()` |
-| 1.2.2 | Worker A | HTTP methods: `register_agent()`, `unregister_agent()`, `send_dm()`, `post_message()`, `list_agents()`, `check_inbox()` |
-| 1.2.3 | Worker B | Write `tests/communicate/test_transport.py` with mock HTTP/WS server: connection lifecycle, reconnect on disconnect, message buffering, error handling |
-| 1.2.4 | Worker B | Write `tests/communicate/conftest.py`: `MockRelayServer` fixture using `aiohttp` or `pytest-httpserver` that simulates Relaycast API endpoints |
-
-**Review Gate 1.2**: Reviewer verifies transport handles all error cases, reconnect logic is correct, mock server is realistic.
-
-#### Wave 1.3: Relay Core
-
-| Task | Agent | Description |
-|------|-------|-------------|
-| 1.3.1 | Worker A | Implement `communicate/core.py`: `Relay` class using `RelayTransport`. Lazy connection, message buffering, callback routing, sync wrappers, atexit cleanup, context manager |
-| 1.3.2 | Worker B | Write `tests/communicate/test_core.py`: lazy connect on first use, send/receive round-trip, inbox drain clears buffer, on_message callback fires, sync wrappers work, close unregisters, context manager cleanup, concurrent access safety |
-
-**Review Gate 1.3**: Full core review. Verify: thread safety, no resource leaks, clean error messages, all tests pass.
-
----
-
-### Phase 2: Tier 1 Adapters (Push-Based)
-
-**Goal**: The three frameworks with real-time push work end-to-end.
-
-#### Wave 2.1: Claude Agent SDK Adapter
-
-| Task | Agent | Description |
-|------|-------|-------------|
-| 2.1.1 | Worker A | Implement `communicate/adapters/claude_sdk.py`: `on_relay()` wrapping query options, PostToolUse hook injection, Stop hook with continue, MCP server config |
-| 2.1.2 | Worker B | Write `tests/communicate/adapters/test_claude_sdk.py`: verify hooks are added to options, verify systemMessage returned when inbox non-empty, verify empty inbox returns no systemMessage, verify Stop hook continues when messages pending, verify MCP server injected, verify chaining with existing hooks |
-
-**Review Gate 2.1**: Reviewer verifies hook behavior, edge cases (existing hooks preserved), message formatting.
-
-#### Wave 2.2: Google ADK Adapter
-
-| Task | Agent | Description |
-|------|-------|-------------|
-| 2.2.1 | Worker A | Implement `communicate/adapters/google_adk.py`: `on_relay()`, function tools for send/inbox/post/agents, `before_model_callback` injection |
-| 2.2.2 | Worker B | Write `tests/communicate/adapters/test_google_adk.py`: verify tools appended, verify callback injected, verify callback drains inbox into llm_request.contents, verify empty inbox doesn't modify request, verify chaining with existing before_model_callback |
-
-**Review Gate 2.2**: Reviewer verifies ADK Content format is correct, callback chains properly.
-
-#### Wave 2.3: Pi Adapter (TypeScript)
-
-| Task | Agent | Description |
-|------|-------|-------------|
-| 2.3.1 | Worker A | Create `packages/sdk/src/communicate/` directory structure |
-| 2.3.2 | Worker A | Implement TypeScript `communicate/core.ts`, `communicate/types.ts`, `communicate/transport.ts` (port from Python) |
-| 2.3.3 | Worker A | Implement `communicate/adapters/pi.ts`: `onRelay()`, TypeBox tool schemas, steer/followUp routing |
-| 2.3.4 | Worker B | Write `tests/communicate/core.test.ts`, `tests/communicate/adapters/pi.test.ts`: same test coverage as Python core + Pi-specific steer vs followUp behavior |
-
-**Review Gate 2.3**: Reviewer verifies TypeScript core has parity with Python, Pi adapter correctly distinguishes steer vs followUp.
-
-#### Wave 2.4: Claude Agent SDK TypeScript Adapter
-
-| Task | Agent | Description |
-|------|-------|-------------|
-| 2.4.1 | Worker A | Implement `communicate/adapters/claude-sdk.ts`: `onRelay()` wrapping QueryOptions |
-| 2.4.2 | Worker B | Write `tests/communicate/adapters/claude-sdk.test.ts` |
-
-**Review Gate 2.4**: Reviewer verifies TS adapter matches Python Claude SDK adapter behavior.
-
----
-
-### Phase 3: Tier 2 Adapters (Poll-Based)
-
-**Goal**: All four poll-based frameworks work.
-
-#### Wave 3.1: OpenAI Agents Adapter
-
-| Task | Agent | Description |
-|------|-------|-------------|
-| 3.1.1 | Worker A | Implement `communicate/adapters/openai_agents.py`: `on_relay()`, function_tool creation, instructions wrapping (handle str, callable, None) |
-| 3.1.2 | Worker B | Write `tests/communicate/adapters/test_openai_agents.py`: verify tools added, verify instructions wrapped for each input type (str, callable, None), verify inbox injected into instructions, verify empty inbox returns base instructions unchanged |
-
-**Review Gate 3.1**: Reviewer verifies all three instructions input types handled correctly.
-
-#### Wave 3.2: Agno Adapter
-
-| Task | Agent | Description |
-|------|-------|-------------|
-| 3.2.1 | Worker A | Implement `communicate/adapters/agno.py`: `on_relay()`, function tools, instructions wrapping |
-| 3.2.2 | Worker B | Write `tests/communicate/adapters/test_agno.py` |
-
-#### Wave 3.3: Swarms Adapter
-
-| Task | Agent | Description |
-|------|-------|-------------|
-| 3.3.1 | Worker A | Implement `communicate/adapters/swarms.py`: `on_relay()`, callable tools, on_message → receive_message bridge |
-| 3.3.2 | Worker B | Write `tests/communicate/adapters/test_swarms.py`: verify on_message callback registered, verify receive_message called with correct args |
-
-#### Wave 3.4: CrewAI Adapter
-
-| Task | Agent | Description |
-|------|-------|-------------|
-| 3.4.1 | Worker A | Implement `communicate/adapters/crewai.py`: `on_relay()`, @tool decorated functions, document limitations |
-| 3.4.2 | Worker B | Write `tests/communicate/adapters/test_crewai.py` |
-
-**Review Gate 3.x**: Reviewer verifies all four adapters, consistent patterns, no framework SDK imported at module level (lazy imports only).
-
----
-
-### Phase 4: Integration Tests & Cross-Framework
-
-**Goal**: Prove agents in different frameworks can talk to each other.
-
-#### Wave 4.1: Cross-Framework Tests
-
-| Task | Agent | Description |
-|------|-------|-------------|
-| 4.1.1 | Worker A | Write `tests/communicate/integration/test_cross_framework.py`: OpenAI Agent sends message → Google ADK agent receives via before_model_callback |
-| 4.1.2 | Worker A | Test: Swarms agent sends → Claude SDK agent receives via hook systemMessage |
-| 4.1.3 | Worker A | Test: Multiple agents in different frameworks all posting to same channel |
-| 4.1.4 | Worker B | Write `tests/communicate/integration/test_end_to_end.py`: real Relaycast server (CI-only, behind env flag), full round-trip send/receive |
-
-**Review Gate 4.1**: Reviewer verifies cross-framework tests use mock server (not real Relaycast) for CI speed, end-to-end tests are clearly marked as integration-only.
-
-#### Wave 4.2: TypeScript Integration Tests
-
-| Task | Agent | Description |
-|------|-------|-------------|
-| 4.2.1 | Worker A | Write `tests/communicate/integration/cross-framework.test.ts`: Pi agent ↔ Claude SDK agent communication |
-
-**Review Gate 4.2**: Reviewer verifies TS integration tests.
-
----
-
-### Phase 5: Documentation & Examples
-
-**Goal**: Users can get started in 60 seconds.
-
-#### Wave 5.1: Documentation
-
-| Task | Agent | Description |
-|------|-------|-------------|
-| 5.1.1 | Worker A | Update SDK README files to document both Orchestrate and Communicate modes |
-| 5.1.2 | Worker B | Write example scripts in `packages/sdk-py/examples/communicate/`: one per framework, each under 20 lines |
-| 5.1.3 | Worker B | Write example scripts in `packages/sdk/examples/communicate/`: Pi + Claude SDK examples |
-
-#### Wave 5.2: Docs Site Pages
-
-| Task | Agent | Description |
-|------|-------|-------------|
-| 5.2.1 | Worker A | Write `docs/communicate.mdx` + `docs/markdown/communicate.md`: overview page — "Put your agents on the relay" |
-| 5.2.2 | Worker A | Write per-framework pages: `docs/communicate/openai-agents.mdx`, `docs/communicate/claude-sdk.mdx`, `docs/communicate/google-adk.mdx`, `docs/communicate/agno.mdx`, `docs/communicate/swarms.mdx`, `docs/communicate/crewai.mdx`, `docs/communicate/pi.mdx` |
-| 5.2.3 | Worker A | Update `docs/introduction.mdx` to explain the two SDK modes (Orchestrate for CLI harnesses, Communicate for SDK frameworks) |
-
-**Review Gate 5.x**: Reviewer verifies docs are accurate, examples run, both .mdx and .md files in sync per docs-sync rule.
-
----
-
-## 8. Test Strategy (TDD)
-
-### 8.1 Test-First Rule
-
-**Every implementation task MUST have its corresponding test task completed FIRST or IN PARALLEL.** The test file defines the contract. The implementation satisfies it.
-
-### 8.2 Test Categories
-
-| Category | Location | Runs In CI | Description |
-|----------|----------|-----------|-------------|
-| Unit | `tests/test_*.py` | Always | Core class behavior, no network |
-| Adapter | `tests/adapters/test_*.py` | Always | Adapter wrapping logic, mocked framework objects |
-| Integration | `tests/integration/` | Always (mock server) | Cross-framework messaging via mock Relaycast |
-| End-to-End | `tests/integration/test_end_to_end.py` | CI with `RELAY_E2E=1` | Real Relaycast, real WebSocket |
-
-### 8.3 Mock Relaycast Server
-
-A shared test fixture (`conftest.py`) provides `MockRelayServer` that:
-- Runs an HTTP server on a random port
-- Accepts agent registration (`POST /agents`)
-- Accepts message send (`POST /messages`)
-- Returns inbox messages (`GET /inbox/{agent}`)
-- Supports WebSocket upgrade at `/ws/{agent}` for push delivery
-- Tracks all messages for assertion
-
-```python
-@pytest.fixture
-async def relay_server():
-    server = MockRelayServer()
-    await server.start()
-    yield server
-    await server.stop()
-
-@pytest.fixture
-def relay(relay_server) -> Relay:
-    return Relay("TestAgent", RelayConfig(
-        base_url=relay_server.url,
-        api_key="test-key",
-    ))
-```
-
-### 8.4 Adapter Test Pattern
-
-Each adapter test mocks the framework's agent class minimally — just enough to verify `on_relay()` wired things correctly:
-
-```python
-# Example: test_openai_agents.py
-
-class MockAgent:
-    """Minimal mock of openai agents.Agent"""
-    def __init__(self, name, instructions=None, tools=None):
-        self.name = name
-        self.instructions = instructions
-        self.tools = tools or []
-
-async def test_on_relay_adds_tools(relay_server):
-    agent = MockAgent(name="Test")
-    agent = on_relay(agent, relay=Relay("Test", RelayConfig(base_url=relay_server.url)))
-
-    tool_names = [t.name for t in agent.tools]
-    assert "relay_send" in tool_names
-    assert "relay_inbox" in tool_names
-    assert "relay_post" in tool_names
-    assert "relay_agents" in tool_names
-
-async def test_on_relay_wraps_string_instructions(relay_server):
-    relay = Relay("Test", RelayConfig(base_url=relay_server.url))
-    # Pre-buffer a message
-    relay._pending.append(Message(sender="Alice", text="Hello"))
-
-    agent = MockAgent(name="Test", instructions="Be helpful.")
-    agent = on_relay(agent, relay=relay)
-
-    # Instructions should now be callable
-    result = await agent.instructions(None, agent)
-    assert "Be helpful." in result
-    assert "Alice: Hello" in result
-
-async def test_on_relay_wraps_callable_instructions(relay_server):
-    relay = Relay("Test", RelayConfig(base_url=relay_server.url))
-
-    original_called = False
-    async def original(ctx, ag):
-        nonlocal original_called
-        original_called = True
-        return "Original instructions"
-
-    agent = MockAgent(name="Test", instructions=original)
-    agent = on_relay(agent, relay=relay)
-
-    result = await agent.instructions(None, agent)
-    assert original_called
-    assert "Original instructions" in result
-
-async def test_on_relay_empty_inbox_no_modification(relay_server):
-    relay = Relay("Test", RelayConfig(base_url=relay_server.url))
-
-    agent = MockAgent(name="Test", instructions="Be helpful.")
-    agent = on_relay(agent, relay=relay)
-
-    result = await agent.instructions(None, agent)
-    assert result == "Be helpful."
-    assert "messages" not in result.lower()
-```
-
-### 8.5 Coverage Requirements
-
-- Core (`core.py`, `transport.py`, `types.py`): **≥90% line coverage**
-- Each adapter: **≥85% line coverage**
-- Integration tests: **≥1 cross-framework test per adapter**
-
----
-
-## 9. Relaycast API Contract
-
-The Connect SDK depends on these Relaycast HTTP/WS endpoints. This section documents the expected contract.
-
-### 9.1 HTTP Endpoints
-
-```
-POST   /v1/agents/register      { name, workspace }         → { agent_id, token }
-DELETE /v1/agents/{agent_id}                                 → 204
-POST   /v1/messages/dm           { to, text, from }         → { message_id }
-POST   /v1/messages/channel      { channel, text, from }    → { message_id }
-POST   /v1/messages/reply        { message_id, text, from } → { message_id }
-GET    /v1/inbox/{agent_id}                                  → { messages: Message[] }
-GET    /v1/agents                                            → { agents: string[] }
-```
-
-All requests require `Authorization: Bearer {api_key}` header.
-
-### 9.2 WebSocket
-
-```
-WS /v1/ws/{agent_id}?token={token}
-
-Server → Client messages (JSON):
-{ "type": "message", "sender": "...", "text": "...", "channel": "...", "message_id": "..." }
-{ "type": "ping" }
-
-Client → Server messages (JSON):
-{ "type": "pong" }
-```
-
-### 9.3 Note
-
-If the actual Relaycast API differs from the above, the `transport.py` layer is the ONLY file that needs to change. All adapters and core depend on `Relay` class, not on HTTP endpoints directly.
-
----
-
-## 10. Error Handling Spec
-
-| Error | Exception | When | Recovery |
-|-------|-----------|------|----------|
-| Missing RELAY_API_KEY | `RelayConfigError` | First connection attempt | User sets env var |
-| Missing RELAY_WORKSPACE | `RelayConfigError` | First connection attempt | User sets env var |
-| HTTP 401 | `RelayAuthError` | Any API call | User checks API key |
-| HTTP 4xx | `RelayConnectionError` | Any API call | Raise with status + body |
-| HTTP 5xx | `RelayConnectionError` | Any API call | Retry with backoff (3 attempts) |
-| WebSocket disconnect | (silent) | During operation | Auto-reconnect with backoff |
-| Framework not installed | `ImportError` with helpful message | Adapter import | User installs framework |
-| Agent name collision | `RelayConnectionError` | Registration | User picks unique name |
-
----
-
-## 11. Performance Constraints
-
-| Metric | Target |
-|--------|--------|
-| `Relay.__init__()` | <1ms (no I/O) |
-| First `send()` (cold start) | <500ms (register + send) |
-| Subsequent `send()` | <100ms |
-| `inbox()` (buffer drain) | <1ms (local memory only) |
-| WebSocket message delivery | <50ms (Relaycast → callback) |
-| Memory per buffered message | ~1KB |
-| Max buffer size | 10,000 messages (then oldest dropped with warning) |
-
----
-
-## 12. Agent Team Structure
-
-### Roles
-
-| Role | Count | Responsibility |
-|------|-------|---------------|
-| Lead | 1 | Coordinates waves, manages dependencies, resolves blockers |
-| Worker | 2-3 | Implement code (Worker A = implementation, Worker B = tests) |
-| Reviewer | 1-2 | Reviews each wave's output at review gates |
-
-### Workflow
-
-```
-1. Lead assigns wave tasks to workers
-2. Worker B writes tests FIRST (TDD)
-3. Worker A implements to pass tests
-4. Both workers self-verify (tests pass, linting clean)
-5. Lead triggers review gate
-6. Reviewer checks:
-   - Tests are meaningful (not trivially passing)
-   - Implementation is minimal (no over-engineering)
-   - Error handling covers documented cases
-   - No framework SDKs imported at module level
-   - Lazy imports used for optional dependencies
-   - Consistent code style across adapters
-7. Reviewer approves or requests changes
-8. Lead moves to next wave
-```
-
-### Review Gate Checklist
-
-Each review gate MUST verify:
-
-- [ ] All tests pass (`pytest` / `vitest`)
-- [ ] No tests are skipped without documented reason
-- [ ] Coverage meets thresholds (90% core, 85% adapters)
-- [ ] No unnecessary dependencies added
-- [ ] Framework SDKs are lazy-imported (not top-level)
-- [ ] Error messages are clear and actionable
-- [ ] `on_relay()` returns the same agent object (mutated, not cloned)
-- [ ] No global state (each `Relay` instance is independent)
-- [ ] Thread safety: `_pending` buffer is safe for concurrent read/write
-- [ ] Adapter is ≤50 lines of code (excluding imports and docstrings)
-- [ ] Type hints on all public functions
-- [ ] Docstrings on all public functions (Google style for Python, JSDoc for TS)
-
----
-
-## 13. Dependencies
-
-Dependencies are added to the EXISTING SDK packages — no new packages.
-
-### Python — additions to `packages/sdk-py/pyproject.toml`
-
-```toml
-# Add to existing [project.optional-dependencies]
-communicate = [
-    "aiohttp>=3.9",          # HTTP + WebSocket client for brokerless transport
-]
-openai-agents = ["openai-agents>=0.1"]
-claude-sdk = ["claude-agent-sdk>=0.1"]
-google-adk = ["google-adk>=0.1"]
-agno = ["agno>=0.1"]
-swarms = ["swarms>=0.1"]
-crewai = ["crewai>=0.1"]
-
-# Add to existing dev dependencies
-# "pytest-cov>=5.0",  (if not already present)
-```
-
-Install: `pip install agent-relay-sdk[communicate]`
-
-### TypeScript — additions to `packages/sdk/package.json`
-
-```json
-{
-  "peerDependencies": {
-    "@anthropic-ai/claude-agent-sdk": ">=0.1.0",
-    "@mariozechner/pi-coding-agent": ">=0.50.0"
-  },
-  "peerDependenciesMeta": {
-    "@anthropic-ai/claude-agent-sdk": { "optional": true },
-    "@mariozechner/pi-coding-agent": { "optional": true }
-  }
-}
-```
-
-Add `@sinclair/typebox` to devDependencies if not already present (needed for Pi adapter).
-
----
-
-## 14. Success Criteria
-
-The project is complete when:
-
-1. **All 7 `on_relay()` adapters pass their tests** (unit + adapter)
-2. **Cross-framework integration test passes**: Agent A (OpenAI Agents) sends message → Agent B (Google ADK) receives via before_model_callback → Agent B replies → Agent A sees reply in next turn's dynamic instructions
-3. **Each adapter is ≤50 lines** (excluding imports/docstrings)
-4. **Core is ≤200 lines** per language
-5. **Zero framework dependencies at install time** (all optional/peer)
-6. **READMEs show working example for each framework** in ≤10 lines
-7. **CI passes**: all tests, coverage thresholds met, no lint errors
-
----
-
-## 15. Open Questions
-
-1. **Relaycast API authentication for brokerless mode**: Does the current Relaycast API support direct agent registration without going through the broker? If not, a thin registration endpoint may be needed.
-
-2. **Message ordering guarantees**: Should `inbox()` guarantee chronological order? Current spec says yes (WebSocket messages arrive in order, buffer is FIFO).
-
-3. **Deduplication**: If the same message arrives via WebSocket AND a subsequent `inbox()` HTTP poll (fallback), should we deduplicate by `message_id`? Current spec says yes, using a bounded set of seen IDs.
-
-4. **Rate limiting**: Should the `Relay` class enforce client-side rate limits on `send()`? Recommendation: no, let Relaycast server enforce limits and surface 429 errors.
-
-5. **Binary/file messages**: Current spec is text-only. File/image support is out of scope for v1 but the `Message` type should be extensible.
-
----
-
-## Appendix A: Adapter Quick Reference
-
-### Pattern Template (Python, Tier 2)
-
-```python
-"""Adapter for {Framework} — puts {Framework} agents on the relay."""
-from __future__ import annotations
-from typing import TYPE_CHECKING
-
-from agent_relay.communicate.core import Relay
-from agent_relay.communicate.types import RelayConfig
-from agent_relay.communicate._utils import format_inbox
-
-if TYPE_CHECKING:
-    pass  # framework type imports for type checking only
-
-def on_relay(agent, relay: Relay | None = None):
-    """Put a {Framework} agent on the relay.
-
-    Args:
-        agent: A {Framework} Agent instance.
-        relay: Optional pre-configured Relay. Created from agent name if omitted.
-
-    Returns:
-        The same agent, with Relay tools and inbox injection added.
-    """
-    try:
-        from {framework} import {needed_imports}
-    except ImportError:
-        raise ImportError(
-            "on_relay() for {Framework} requires the '{package}' package. "
-            "Install it with: pip install {package}"
-        )
-
-    name = _extract_name(agent)
-    relay = relay or Relay(name)
-
-    # SENDING: append tools
-    agent.tools = [*(agent.tools or []), *_make_tools(relay)]
-
-    # RECEIVING: wrap instructions
-    _wrap_instructions(agent, relay)
-
-    return agent
-```
-
-### Pattern Template (TypeScript, Tier 1)
-
-```typescript
-/**
- * Adapter for {Framework} — puts {Framework} agents on the relay.
- */
-import { Relay } from '../core.js';
-import type { RelayConfig } from '../types.js';
-
-export function onRelay(name: string, config: FrameworkConfig, relay?: Relay): FrameworkConfig {
-  relay ??= new Relay(name);
-
-  return {
-    ...config,
-    // SENDING: inject tools or MCP
-    tools: [...(config.tools ?? []), ...makeTools(relay)],
-    // RECEIVING: inject hooks or callbacks
-    hooks: { ...config.hooks, ...makeHooks(relay) },
-  };
-}
-```
diff --git a/specs/reading-worker-dm-replies.md b/specs/reading-worker-dm-replies.md
deleted file mode 100644
index 4ff875186..000000000
--- a/specs/reading-worker-dm-replies.md
+++ /dev/null
@@ -1,319 +0,0 @@
-# Reading Worker DM Replies — Design Spec
-
-**Status**: Draft
-**Date**: 2026-05-15
-**Issue**: [#860 — Headless Orchestrator Friction Report](https://github.com/AgentWorkforce/relay/issues/860)
-**Related**: `src/cli/commands/messaging.ts`, `.claude/skills/running-headless-orchestrator/SKILL.md`
-
----
-
-## 1. Problem
-
-A headless orchestrator can spawn workers and send them tasks, but **cannot easily read what the workers said back**. The core loop works (`spawn → send → file written`); the introspection loop is broken.
-
-Concretely, after `agent-relay send Worker2 "..."`:
-
-| Command tried                              | What it returned                                    | What the user needed                                                 |
-| ------------------------------------------ | --------------------------------------------------- | -------------------------------------------------------------------- |
-| `agent-relay inbox --agent Worker2`        | `relay: 1` (count only)                             | The text of Worker2's reply                                          |
-| `agent-relay history --to Worker2`         | 60-char preview of one message per conversation     | The full reply, multiple messages, with timestamps                   |
-| `agent-relay inbox --agent Worker2 --json` | `from: "relay"`, `last_message: "Create a file..."` | The _worker's_ reply, not the orchestrator's outbound DM echoed back |
-
-Root causes, located in code:
-
-1. **`inbox` text renderer drops content** — `src/cli/commands/messaging.ts:862-868` prints `${dm.from}: ${dm.unreadCount}` and nothing else. The `last_message` field is populated in the JSON path (`:814-820`) but never rendered in human output.
-2. **`history --to <agent>` is conversation-summary mode, not message mode** — `:683-703` lists conversations with a 60-char preview (`:700`). To see messages, the user must also pass `--from <other-side>`, which is undocumented in `--help` and non-obvious.
-3. **The orchestrator's outbound DM appears as `from: "relay"`** — `:436` hard-codes `senderName = options.from?.trim() || 'relay'`. The literal string `relay` is also a brand name and an entity name, so the JSON payload looks like the _broker_ sent the message. The reply (from Worker2) and the outbound DM (from "relay") sit in the same conversation, and the inbox renderer surfaces _the most recent message regardless of direction_ as `last_message`, which is almost always the orchestrator's own message that triggered the worker.
-4. **No single command answers "what did Worker2 say to me?"** — the user must combine `--to Worker2 --from <self>` and know the conventional sender name, which is itself ambiguous.
-
-A secondary friction (worker didn't appear in `who` immediately after spawn) is acknowledged but addressed separately; see §8.
-
-## 2. Desired end state
-
-After this work, an orchestrator running headlessly can answer **"what did Worker2 say?"** with one command and get full, untruncated, sender-attributed message text:
-
-```text
-$ agent-relay replies Worker2
-[2026-05-15T15:31:02Z] Worker2: Done. Created result.json with {"status":"success","worker":"claude"}.
-[2026-05-15T15:30:55Z] Worker2: Working on it now.
-```
-
-And the JSON form returns structured records, never echoing the orchestrator's own outbound message as the headline:
-
-```json
-[
-  { "from": "Worker2", "text": "Done. Created result.json...", "createdAt": "...", "unread": true },
-  { "from": "Worker2", "text": "Working on it now.", "createdAt": "...", "unread": false }
-]
-```
-
-The existing `inbox`, `history`, and `send` commands keep working but become consistent and self-explanatory:
-
-- `agent-relay inbox --agent Worker2` shows **message content**, not counts.
-- `agent-relay history --to Worker2` shows **messages**, not a conversation summary, with no truncation by default.
-- The orchestrator's outbound DMs are tagged with a clear sender name (`orchestrator` by default, configurable) and a clear `direction` field, so callers can filter trivially.
-- The running-headless-orchestrator skill documents one canonical recipe per question ("How do I read replies?", "How do I detect completion?").
-
-## 3. Scope
-
-### In scope
-
-- A new `agent-relay replies <agent>` command (single-purpose: read inbound messages addressed to the orchestrator from a given worker).
-- Behavior changes to `inbox` and `history` for messages and content rendering.
-- A sender-name change for `agent-relay send` (default `orchestrator`, not `relay`), with backwards-compatible env var to opt out.
-- A `direction` field on returned DM records (`inbound` | `outbound`) relative to the registered agent of the call.
-- Skill updates so the canonical recipe is one command, not three.
-
-### Out of scope
-
-- Bidirectional streaming / `tail -f` semantics for DMs. (Polling + `replies --since` is sufficient and matches existing patterns.)
-- Reworking the broker's DM storage layer.
-- Fixing the `who` race after spawn (tracked separately; see §8).
-- Changing the broker's global read-tracking semantics. The existing
-  auto-read-on-inbox behavior is preserved unchanged. (The `replies
---mark-read` flag is in scope, but it is an explicit, command-local
-  acknowledgement only — it does not alter the default semantics other
-  commands rely on.)
-
-## 4. CLI surface
-
-### 4.1 New: `agent-relay replies <agent>`
-
-```text
-agent-relay replies <agent> [options]
-
-Show messages received from <agent> in the DM conversation between the
-orchestrator and that agent. Returns inbound messages only — never echoes
-the orchestrator's own outbound DMs.
-
-Options:
-  -n, --limit <count>      Number of messages to show (default: 50)
-  --since <time>           Only messages after time (e.g. "5m", "1h", ISO-8601)
-  --unread                 Only unread messages (does NOT mark them read)
-  --mark-read              After printing, mark the printed messages as read
-  --as <name>              Read as this orchestrator identity (default:
-                           $AGENT_RELAY_ORCHESTRATOR_NAME or "orchestrator")
-  --json                   Output as JSON
-  --full                   Disable any truncation (default: full text is shown
-                           in both text and JSON; this flag is a no-op kept for
-                           forward compatibility)
-```
-
-Behavior:
-
-- Resolves the DM conversation between `--as` and `<agent>` (creating none — if no conversation exists, prints `No DM conversation with <agent>.` and exits 0).
-- Lists messages where `sender == <agent>`, in chronological order (oldest first), newest at the bottom — matches how a terminal user reads a transcript.
-- **No text truncation.** Multi-line messages are printed verbatim, indented two spaces under a header line: `[<iso-ts>] <agent>:`.
-- Exit code: 0 if any messages printed or none found; 1 only on connection / auth failure.
-
-### 4.2 Changed: `agent-relay inbox --agent <name>`
-
-The text renderer for `Unread DMs` changes from a count line to a content block. The JSON shape is unchanged (already includes `last_message`); only the text path is updated.
-
-Before:
-
-```text
-Unread DMs:
-  relay: 1
-```
-
-After:
-
-```text
-Unread DMs:
-  Worker2 → orchestrator (3 unread):
-    [2026-05-15T15:31:02Z] Worker2: Done. Created result.json...
-    [2026-05-15T15:30:55Z] Worker2: Working on it now.
-    [2026-05-15T15:30:40Z] Worker2: Got it.
-```
-
-Rules:
-
-- Show **up to 3 most recent unread messages per conversation**, full text (no `...`).
-- If the conversation has more than 3 unread messages, append `… (N more — run \`agent-relay replies <agent> --unread\` to see all)` as the last line of that block.
-- The header line uses `<sender> → <reader>` so the user always knows who said what. The sender is the actual message sender, never a synthesized "relay" string.
-
-### 4.3 Changed: `agent-relay history --to <agent>` (when `<agent>` is not a channel)
-
-Today this command has two behaviors split by whether `--from` is also passed (`:648-703`). After this work it has **one** behavior: print messages in the conversation, newest at the bottom, no truncation.
-
-```text
-$ agent-relay history --to Worker2
-[2026-05-15T15:29:10Z] orchestrator: Create a file called result.json...
-[2026-05-15T15:30:40Z] Worker2: Got it.
-[2026-05-15T15:30:55Z] Worker2: Working on it now.
-[2026-05-15T15:31:02Z] Worker2: Done. Created result.json...
-```
-
-Rules:
-
-- Default `--limit` stays at 50.
-- `--from <agent>` continues to filter by sender (so `history --to Worker2 --from Worker2` is equivalent to `replies Worker2` for the no-`--unread` case).
-- `--json` output gains a `direction` field per message: `"inbound"` if `sender == <agent>`, `"outbound"` otherwise (where "otherwise" means the orchestrator's own sends echoed into the conversation). Existing fields are preserved.
-- The conversation-summary mode is removed. To list all conversations for an agent, use `agent-relay dms list --as <agent>` (existing — see `mcp__relaycast__message_dm_list` and its CLI mirror).
-
-### 4.4 Changed: `agent-relay send` default sender
-
-`:436` changes from:
-
-```ts
-const senderName = options.from?.trim() || 'relay';
-```
-
-to:
-
-```ts
-const senderName =
-  options.from?.trim() || process.env.AGENT_RELAY_ORCHESTRATOR_NAME?.trim() || 'orchestrator';
-```
-
-The `--from` flag's help text is updated:
-
-```text
---from <name>   Sender name (registered identity in relaycast).
-                Default: $AGENT_RELAY_ORCHESTRATOR_NAME or "orchestrator".
-                Used so workers' replies are addressed to a stable name
-                you can read with `agent-relay replies <worker>`.
-```
-
-This is a **user-visible default change**. Existing scripts that filter on the literal string `"relay"` will break. That is desired — `"relay"` was a footgun. Release notes must call this out. No silent migration; users who want the old behavior set `--from relay` or export `AGENT_RELAY_ORCHESTRATOR_NAME=relay`.
-
-### 4.5 New JSON field: `direction`
-
-On every DM message record returned by `replies`, `history --to <agent>`, and `inbox --json`'s `unread_dms[].last_message`, add:
-
-```jsonc
-{
-  "direction": "inbound" | "outbound",
-  // existing fields unchanged
-}
-```
-
-`inbound`/`outbound` is computed relative to the _reader identity_ of the call (the `--as` agent or, for `inbox`, the `--agent` agent). This makes filtering trivial and unambiguous, regardless of what name the orchestrator chose for `--from`.
-
-## 5. Skill updates
-
-`.claude/skills/running-headless-orchestrator/SKILL.md` must change such that the canonical answer to "How do I read worker replies?" is:
-
-```text
-agent-relay replies <worker>
-```
-
-Required edits:
-
-- Replace the lookup-table rows for "Read worker's unread DM replies" and "Read full DM conversation history" with a single row pointing at `agent-relay replies <agent>`.
-- The "Channel vs DM" section keeps its explanation but its examples switch to `replies`.
-- The "Critical: `history` only shows channel messages" caveat is removed (no longer true after §4.3).
-- Add a "Detecting task completion" subsection with a worked example using `agent-relay replies <worker> --since 30s` in a polling loop, terminating when a worker message matches a configurable pattern (default: case-insensitive `done|completed|finished|failed`). Provide the loop as a copy-pasteable bash snippet; do not introduce a new CLI subcommand for this.
-- Update the MCP examples to use `mcp__relaycast__message_dm_list` with `as: "<worker>"` as the equivalent path, and call out that the MCP tool returns full content.
-
-## 6. Tests
-
-### 6.1 Unit & integration tests
-
-Located in `src/cli/commands/messaging.test.ts` (existing) and a new `replies.test.ts`:
-
-- `replies` returns only inbound messages, verified against a seeded conversation with mixed-direction messages.
-- `replies --unread` filters by unread flag and does **not** flip read state.
-- `replies --mark-read` flips read state for printed messages and the next `replies --unread` is empty.
-- `replies` exits 0 with `No DM conversation with <agent>.` when no conversation exists.
-- `replies --since 1h` filters by parsed duration; reuses `parseSince` already in the file.
-- `inbox` text output renders up to 3 unread messages per conversation full-text, and the truncation footer appears when N > 3.
-- `inbox --json` changes are strictly additive: every pre-existing key keeps its
-  prior value and position, and the only difference is a new
-  `unread_dms[].last_message.direction` field. Callers that ignore unknown keys
-  are unaffected; no field is renamed, removed, or retyped.
-- `history --to <agent>` returns messages, not conversation summary, when `<agent>` is non-channel.
-- `history --to '#<channel>'` renders full channel message text, preserves multi-line payloads, and keeps the newest `--limit` messages in chronological order.
-- `history --from <agent>` combines DM and channel messages, applies `--limit` after combining/filtering, and returns partial results with a warning when only one source fails.
-- `who --json` uses broker metrics for `status`, `pid`, `uptimeSecs`, and `memoryBytes`; when metrics are unavailable, the fields fall back to list-only values/nulls without fabricating "last seen" timestamps.
-- `who` human output includes real `PID` and `UPTIME` columns and omits the old placeholder `LAST SEEN` column.
-- `send` without `--from` sends as `orchestrator` (verified by reading back via `replies`).
-- `send` honors `AGENT_RELAY_ORCHESTRATOR_NAME` env var when `--from` is omitted.
-
-The friction transcript from issue #860 is captured as an integration test fixture (`tests/fixtures/issue-860-transcript.test.ts`) that replays the exact command sequence the reporter ran and asserts the new outputs are useful.
-
-### 6.2 End-to-end CLI validation (required before merge)
-
-Automated tests alone are not sufficient. The implementer **must** run the locally-built CLI against a live broker and reproduce the issue #860 scenario end-to-end. This catches packaging, binary-resolution, and integration regressions that unit tests miss.
-
-Required steps, run from a clean working tree on the feature branch:
-
-1. **Build the local CLI from source.** Use `pnpm build` (or the equivalent monorepo build) so `agent-relay` resolves to the branch's compiled output, not a globally installed version. Confirm with `which agent-relay` and `agent-relay --version` — the version must match the bumped value from §7.
-2. **Start a fresh broker.** `agent-relay up` in a scratch project directory; verify `agent-relay status` reports healthy. Do not reuse a long-running broker — state from prior runs masks bugs.
-3. **Replay the issue #860 transcript verbatim.** Spawn two workers (one `codex`, one `claude`), send each a DM that asks them to write a file, then exercise every command in §4:
-   - `agent-relay replies Worker2` — prints full inbound text, no truncation, sender is `Worker2`.
-   - `agent-relay replies Worker2 --unread` — prints only unread, does not mark read.
-   - `agent-relay replies Worker2 --since 30s --json` — JSON includes the new `direction` field with value `inbound` for worker messages.
-   - `agent-relay inbox --agent Worker2` — renders message content (up to 3 per conversation), not counts. Run this **before** `--mark-read` so unread rendering is exercised against still-unread messages.
-   - `agent-relay inbox --agent Worker2 --json` — `unread_dms[].last_message` carries the worker's text; `from` is the worker's name, not `"relay"`.
-   - `agent-relay replies Worker2 --mark-read` — run **after** the inbox checks above; prints + marks read; a follow-up `--unread` call returns empty.
-   - `agent-relay history --to Worker2` — chronological messages, no 60-char preview, outbound sender is `orchestrator`.
-   - `agent-relay history --to '#general' --json` — channel posts are chronological, untruncated, and parseable; multi-line payloads remain intact.
-   - `agent-relay who --json` — Worker2 has real lifecycle fields (`status`, `pid`, `uptimeSecs`, `memoryBytes`) from broker metrics when available, or explicit nulls when unavailable.
-   - `agent-relay send Worker2 "ping"` with no `--from` — Worker2's subsequent `replies` view shows the outbound was attributed to `orchestrator`.
-   - `AGENT_RELAY_ORCHESTRATOR_NAME=ops agent-relay send Worker2 "ping"` — outbound attributed to `ops`.
-4. **Run the polling-loop snippet from the updated skill.** Confirm it terminates correctly when a worker emits `done`/`completed` and that it does not false-positive on the orchestrator's own outbound DMs (this is the failure mode the `direction` field is designed to prevent).
-5. **Tear down and rerun on a second harness.** Validate at least one of: a `claude`-spawned worker _and_ a `codex`-spawned worker, since the reporter hit asymmetric behavior between them.
-6. **Capture evidence.** Paste the live terminal transcript (commands + outputs) into the PR description under a `## E2E validation` heading. A green CI run is not a substitute — the PR must show the actual command outputs from a local broker. If any output diverges from §2 ("Desired end state") or §4 ("CLI surface"), the work is not done.
-
-The PR description must explicitly answer: "Did you run the local CLI end-to-end against a live broker?" with a transcript. Reviewers should reject PRs that skip this section.
-
-## 7. Migration & release
-
-- Bump minor version (`6.1.0`) — default sender name change is user-visible.
-- CHANGELOG entry under "Breaking" calls out:
-  - Default `send --from` is now `orchestrator`, not `relay`. Set `AGENT_RELAY_ORCHESTRATOR_NAME=relay` to restore old behavior.
-  - `history --to <agent>` no longer shows conversation summaries; use `agent-relay dms list --as <agent>` instead.
-- CHANGELOG entry under "Added": `agent-relay replies`, `direction` field on DM JSON.
-- CHANGELOG entry under "Changed": `inbox` text renderer shows DM content.
-- Docs sync: any docs that mention `relay` as the default sender or describe `inbox` count-only output must be updated in both `web/content/docs/*.mdx` and `docs/*.md` (per `.claude/rules/docs-sync.md`).
-
-## 8. Out-of-band: `who` race after spawn
-
-The reporter noted Worker1 (codex) did not appear in `who` immediately after spawn. This is a separate defect — likely a registration race between the codex injector and the broker's agent table — and is **not** addressed by this spec. File as a follow-up issue and link from #860; do not bundle the fix here.
-
-## 9. Acceptance
-
-This spec is complete when an orchestrator can run the exact command sequence from issue #860, and:
-
-- `agent-relay replies Worker2` prints Worker2's full reply text with sender attribution.
-- `agent-relay inbox --agent Worker2` prints content, not counts.
-- `agent-relay history --to Worker2` prints messages, not a conversation summary, and the orchestrator's outbound DMs are clearly attributed to `orchestrator` (or the configured name), not `relay`.
-- `agent-relay history --to '#general'` prints channel messages in chronological order without truncating substantive payloads.
-- `agent-relay who --json` exposes broker-derived lifecycle data suitable for polling instead of fabricated placeholders.
-- The running-headless-orchestrator skill's "read worker replies" guidance is one command, not three.
-- All tests in §6.1 pass.
-- §6.2 end-to-end validation has been performed against a locally-built CLI and a live broker, and the transcript is pasted into the PR description.
-- CHANGELOG and docs are updated, and the `.mdx`/`.md` mirror invariant from `.claude/rules/docs-sync.md` holds.
-
-## 10. Addendum: channel history & structured `who` (issue #860 follow-on)
-
-The original spec scoped the no-truncation fix to **DM** history only (§4.3,
-"when `<agent>` is not a channel"). Field use surfaced that the same friction
-applies to channel reads and to agent health, so this work additionally:
-
-- **Channel history is no longer truncated.** `agent-relay history --to '#<channel>'`
-  prints full message text (multi-line messages render under an indented
-  header), matching the DM transcript behavior. Substantive payloads (literal
-  diffs, grep counts, GO/NO-GO reasoning) are readable in full instead of cut
-  at ~200 chars.
-- **Channel history is chronological.** Messages are sorted oldest→newest and
-  the most recent `--limit` are kept, so a reader reconstructs the
-  conversation top-to-bottom without cross-referencing. The relaycast feed
-  order is no longer trusted; an explicit sort prevents interleaving. The
-  `--from` cross-context history view is de-truncated the same way.
-- **`agent-relay who` reports real lifecycle, not placeholders.** The previous
-  output fabricated `status: "ONLINE"` and `lastSeen: <now>`. `who` now joins
-  the broker `/api/metrics` data so `who --json` emits structured, pollable
-  records: `{ name, cli, status, pid, uptimeSecs, memoryBytes }`. This gives a
-  headless orchestrator a machine-readable health signal instead of scraping
-  the worker TTY. (Idle/exited/restart event-state and an in-TUI context-budget
-  figure remain out of scope — they require a follow-up broker change; `who`
-  does not synthesize values it cannot observe.)
-- **Skill guidance.** `running-headless-orchestrator` (canonical copy plus the
-  `.claude`/`.agents` mirrors) now states that the spawning orchestrator is not
-  a registered relaycast agent — `mcp__relaycast__message_*` tools fail with
-  `Not registered. Call agent.register first.` — so the CLI is the supported
-  path, and `--json` is the recommended way to read full, untruncated,
-  parseable output (`replies`, `history`, `inbox`, `who`).
diff --git a/specs/slack-primitive-impl.md b/specs/slack-primitive-impl.md
deleted file mode 100644
index a07cb4b17..000000000
--- a/specs/slack-primitive-impl.md
+++ /dev/null
@@ -1,67 +0,0 @@
-# Slack Primitive — Implementation Workflow
-
-**Status**: Ready
-**Date**: 2026-05-08
-**Design spec**: [`specs/slack-primitive.md`](./slack-primitive.md)
-**Runtime**: local
-
-This is the implementation prompt for ricky. The full design lives in `specs/slack-primitive.md`. This file exists so ricky has an unambiguous, local-only generation target without having to disambiguate the design doc's runtime-selection discussion.
-
-## Goal
-
-Implement the `packages/slack-primitive` package as described in the design spec. Mirror the layout of `packages/github-primitive` 1:1.
-
-## Files to create
-
-Target files (bare paths so the spec parser picks them up as `targetFiles`):
-
-- packages/slack-primitive/package.json
-- packages/slack-primitive/tsconfig.json
-- packages/slack-primitive/src/index.ts
-- packages/slack-primitive/src/types.ts
-- packages/slack-primitive/src/client.ts
-- packages/slack-primitive/src/workflow-step.ts
-- packages/slack-primitive/src/local-runtime.ts
-- packages/slack-primitive/src/adapter.ts
-- packages/slack-primitive/src/actions/post-message.ts
-- packages/slack-primitive/src/actions/resolve-user.ts
-- packages/slack-primitive/src/actions/resolve-channel.ts
-- packages/slack-primitive/src/__tests__/post-message.test.ts
-- packages/slack-primitive/examples/notify-on-pr.ts
-- packages/slack-primitive/examples/README.md
-
-## Scope (Phase A of the design spec)
-
-Phase A only — postMessage + resolveUser + resolveChannel, with the local Web API runtime. Do not implement askQuestion, the Nango proxy transport, or interactive Block Kit forms in this pass.
-
-Concretely:
-
-1. Create `packages/slack-primitive/` with `src/index.ts`, `src/types.ts`, `src/client.ts`, `src/workflow-step.ts`, `src/local-runtime.ts`, `src/adapter.ts`, and `src/actions/{post-message,resolve-user,resolve-channel}.ts`.
-2. Wire `SLACK_BOT_TOKEN` env-var auth in `local-runtime.ts`. Throw `SlackPostBackError('auth_token_missing')` if absent.
-3. Implement `createSlackStep` with `action: 'postMessage'`, supporting `channel`, `text`, `threadTs`, `mentions`, `unfurl`, and `{{steps.X.output.path}}` templating.
-4. Mention resolution: `@email@example.com` → `users.lookupByEmail`; bare handle `@khaliq` → user-cache lookup; raw user IDs pass through. Unresolved mentions are a soft error (logged on step output, message still posts).
-5. Channel resolution: `#name` → `conversations.list` + match; channel IDs pass through.
-6. Add an example workflow at `packages/slack-primitive/examples/notify-on-pr.ts` that posts a one-line PR-opened announcement (paired with `github-primitive`'s `createPR` step).
-7. Add unit tests in `packages/slack-primitive/src/__tests__/` covering: token-missing error, channel name resolution, mention resolution success and soft-fail, `{{steps.X.output}}` templating substitution.
-
-## Constraints
-
-- Runtime: local only. Do not generate the alternate-runtime adapter, the Nango proxy code, or the fallback-transport code in this pass — those land in later phases described in the design spec.
-- Use `@slack/web-api` as the underlying SDK.
-- TypeScript ES modules, follow the conventions in `.claude/rules/typescript.md`.
-- Match the public-API shape of `packages/github-primitive` so a developer who learned one can read the other in five minutes.
-- Do not modify `packages/github-primitive`. Do not modify the design spec.
-
-## Acceptance gates
-
-1. `pnpm -F slack-primitive build` passes.
-2. `pnpm -F slack-primitive test` passes with the unit tests above green.
-3. `examples/notify-on-pr.ts` type-checks against the rest of the SDK.
-4. A workflow that imports `createSlackStep` and posts to a real channel succeeds when `SLACK_BOT_TOKEN` is set and the bot is invited to the channel. (Manual smoke test — document the steps in `examples/README.md`.)
-
-## Out of scope
-
-- askQuestion (Phase B in the design spec).
-- The alternate-runtime adapter and its transports (Phase A's second half + Phase C in the design spec).
-- Interactive Block Kit, addReaction, updateMessage, replyToThread (Phase C).
-- Workflow runner schema changes for askQuestion audit trail (tracked in issue #825).
diff --git a/specs/slack-primitive.md b/specs/slack-primitive.md
deleted file mode 100644
index 732a3f0d8..000000000
--- a/specs/slack-primitive.md
+++ /dev/null
@@ -1,297 +0,0 @@
-# Slack Primitive — Design Spec
-
-**Status**: Draft
-**Date**: 2026-05-05
-**Author**: design session (human + Claude)
-**Related**: `packages/github-primitive` (precedent), `skills/writing-agent-relay-workflows` (recipe #4 — Escalation)
-
----
-
-## 1. Why this primitive
-
-Workflows already produce _code_ — Phase C push-back lands the diff, the github-primitive opens the PR. What workflows can't yet do well is **talk to a human in the loop**:
-
-- Tell the human something happened ("PR #451 opened, here's the diff").
-- Ask the human a question and **wait for the answer** ("Is this the right account ID? I see two candidates.").
-- Surface a blocker ("Auth failed, I need someone to re-auth this connection.").
-
-Today the answer is "post in a Slack-bridged relay channel and hope the bridge is up." That works in a sandbox where someone is watching. It does not work for cloud runs that the operator has walked away from. **The Slack primitive turns Slack into a first-class transport for workflow ↔ human communication**, with the same local/cloud adapter shape as the github-primitive, so the same workflow file works on a laptop and in `agent-relay cloud run`.
-
-This spec defines the API, runtime selection, and the two flagship verbs:
-
-1. **`postMessage`** — fire-and-forget human notification.
-2. **`askQuestion`** — block the workflow on a human reply.
-
-Plus the cultural change it's meant to enable: **agents should ask for clarification when blocked rather than hallucinate a fix**.
-
-## 2. What we're not building (yet)
-
-- A general-purpose Slack bot. The primitive is **outbound from the workflow**: it posts and it waits-for-reply. Inbound message classification, slash commands, app home views, etc., are out of scope.
-- Channel/user provisioning. The workflow assumes the channel and the bot user already exist.
-- Threaded conversations beyond a single round-trip. `askQuestion` reads exactly one reply (configurable: first reply, first reply by a specific user, first reply matching a regex). Multi-turn dialogue with the same agent goes through the existing relay channel primitive.
-
-## 3. Runtime selection (mirrors github-primitive)
-
-```ts
-type SlackRuntimePreference = 'local' | 'cloud' | 'auto';
-```
-
-| Runtime            | Transport                   | Auth source                      | When chosen                                            |
-| ------------------ | --------------------------- | -------------------------------- | ------------------------------------------------------ |
-| `local`            | Slack Web API directly      | `SLACK_BOT_TOKEN` env or config  | Operator running `agent-relay run` from a laptop       |
-| `cloud`            | Nango → Slack workspace App | Nango connection (per-workspace) | `agent-relay cloud run`, workspace has Slack connected |
-| `cloud` (fallback) | Relay-cloud Slack proxy     | Workspace bearer token           | `agent-relay cloud run`, no Nango Slack connection     |
-| `auto`             | Detects the above in order  | —                                | Default                                                |
-
-Same as github-primitive: **the workflow author writes one file**. `runtime: 'auto'` does the right thing on a laptop and in cloud.
-
-### Auth resolution (cloud path)
-
-Cloud's lambda already wires `Resource.NangoSecretKey.value` and resolves `(workspaceId, provider) → connectionId`. The Slack primitive's `cloud-runtime` reuses that resolver — no new resource binding.
-
-For Slack, we expect the connection to be a **bot user OAuth token** (`xoxb-*`), not user-token (`xoxp-*`). Posting and reading replies both work with `chat:write`, `channels:history`, and `groups:history` scopes. The primitive validates scopes on first call and throws a typed error early if they're missing.
-
-## 4. Public API
-
-The shape is the same as github-primitive: a `SlackClient` for direct calls, a `SlackStepExecutor` + `createSlackStep` for declarative use inside `workflow(...)`. Most workflows use the step form.
-
-### 4.1 Action enum
-
-```ts
-export enum SlackAction {
-  PostMessage = 'postMessage',
-  AskQuestion = 'askQuestion',
-  UpdateMessage = 'updateMessage',
-  AddReaction = 'addReaction',
-  ReplyToThread = 'replyToThread',
-  ResolveUser = 'resolveUser', // email/handle -> user id
-  ResolveChannel = 'resolveChannel', // name -> channel id
-}
-```
-
-The first two are the load-bearing ones. The rest exist to make the first two pleasant (e.g. `ResolveUser` so you can write `@khaliq` instead of `U02ABC123` in workflow source).
-
-### 4.2 `postMessage`
-
-```ts
-createSlackStep({
-  name: 'announce-pr',
-  action: 'postMessage',
-  params: {
-    channel?: '#wf-feature',          // or channel id; optional — see "Default channel resolution" below
-    text: 'PR opened: {{steps.open-pr.output.html_url}}',
-    threadTs?: string,                 // reply into a thread
-    mentions?: string[],               // ['@khaliq', 'U02ABC123', 'khaliq@agent-relay.com']
-    blocks?: SlackBlock[],             // optional rich blocks
-    unfurl?: boolean,                  // default true
-  },
-  output: { mode: 'data', path: 'ts' }, // message timestamp for follow-ups
-})
-```
-
-Notes:
-
-- **Mentions are resolved before send.** `@khaliq` is looked up via `users.lookupByEmail` or the user-cache; if not found, the message still posts but a typed `SlackPostBackError(unknown_mention)` is logged on the step output. This is the same "fail soft on cosmetic errors, fail hard on real errors" pattern as github-primitive.
-- **Templating uses the existing `{{steps.X.output.path}}` chain.** No special Slack-specific templating syntax.
-- **Channel may be a name (`#wf-feature`) or ID.** Names are resolved at step time.
-- **Channel is optional for `postMessage`.** If omitted, cloud-runtime calls sage's existing notify-channel resolver (`/api/internal/proactive/notify-channel`), which falls back: configured workspace default → `#general` → first joined channel (alphabetical). Local-runtime falls back to the `SLACK_DEFAULT_CHANNEL` env var; if that's also unset, validation fails. Reusing sage's resolver keeps "where do agent messages go" configured in one place per workspace. (Follow-up: factor the resolver into a shared cloud package once slack-primitive is the second consumer — tracked separately.)
-
-### 4.3 `askQuestion` — the load-bearing verb
-
-```ts
-createSlackStep({
-  name: 'confirm-account',
-  action: 'askQuestion',
-  params: {
-    channel: '#wf-feature',           // required for askQuestion (no default fallback — be deliberate about where you block on humans)
-    text: 'I found two AWS accounts that match `prod-*`. Which one should I deploy to?\n  • acct-1234 (us-east-1, last modified 2 weeks ago)\n  • acct-5678 (us-west-2, last modified yesterday)\nReply with `1` or `2`.',
-
-    // How long to wait before failing the step
-    timeoutSeconds: 1800,           // required: caller must set explicitly (1800 = 30 min)
-
-    // Who is allowed to answer. Default: anyone in the channel.
-    allowedReplyFrom?: string[],    // ['@khaliq']
-
-    // What constitutes a valid reply. Default: any non-empty text.
-    replyMatch?:
-      | { type: 'regex'; pattern: string }
-      | { type: 'choice'; choices: string[] }      // exact match against one of these
-      | { type: 'any' },
-
-    // Optional: a structured form via Slack Block Kit. When set, the
-    // primitive renders a button group / select / etc. and the
-    // step output is the chosen value, not the raw reply text.
-    interactive?: SlackInteractiveSpec,
-  },
-  output: {
-    mode: 'data',
-    // The step output is the parsed answer:
-    //   { reply: string, replierUserId: string, replyTs: string,
-    //     matchedChoice?: string, matchedGroups?: string[] }
-  },
-})
-```
-
-Semantics:
-
-1. The primitive posts the question, with the workflow run id appended in small text so a human can find the source run.
-2. It begins polling `conversations.history` (cloud) or subscribing via Slack Events API webhook (when configured) for replies in the channel after the question's `ts`. **No global event listener** — each `askQuestion` step polls its own scope, then unsubscribes. This is important: workflows must not interfere with each other.
-3. On a reply that matches `replyMatch` from a user in `allowedReplyFrom`:
-   - Reaction `:eyes:` added to the question (so the human sees their answer was registered).
-   - Step succeeds with the parsed reply as output.
-4. On timeout: step fails with a typed `SlackPostBackError(human_no_response, timeoutSeconds)` so the workflow's `onError` handler can decide whether to retry, escalate again, or hard-fail.
-5. The primitive **never** falls back to a default answer. Silence is failure.
-6. The primitive emits the full question/answer pair on the step's output record. **Durable persistence (for post-mortems) is the workflow runner's responsibility**, not the primitive's — see issue #825.
-
-#### Why `askQuestion` is the hard part
-
-Posting is trivial. Waiting on a human is the load-bearing piece. It introduces three constraints the rest of the SDK doesn't have:
-
-- **Workflows must be allowed to block on external input.** The runner already supports long-running steps (verification gates, sandbox bootstraps), so this is reusing existing plumbing — not inventing new lifecycle.
-- **The step must be resumable, and idempotent across retries.** If the workflow crashes between posting the question and receiving the answer — or if the step retries via `retries: N` — the resumed/retried attempt must find the existing question and rejoin the poll, not re-ask. Implementation: stash `(questionTs, runId, stepName)` in the workflow run record before the polling loop starts, and embed `(runId, stepName)` in the posted message's metadata. The dedup key is `(runId, stepName)` — retries within the same run reuse the same question; different runs are independent even if they share a step name; attempt number is **not** part of the key.
-- **The channel's history must include the question.** This means cloud-runtime cannot use private DMs (the bot can't read DM history without `im:history` scope and that scope is rarely granted). `askQuestion` against a DM throws at validation time.
-
-> **v1 limitation:** `askQuestion` only supports public/private channels, not DMs. To ask a single person privately, create a private channel containing just that person and the bot. DM support may land in v2 if real demand appears.
-
-### 4.4 `replyToThread`, `updateMessage`, `addReaction`
-
-These are utility verbs that exist so post/ask flows can be cleaned up:
-
-- `replyToThread` — post into the thread of a prior message (e.g. announce intermediate progress on a long workflow).
-- `updateMessage` — edit a posted message (e.g. update a "running…" message to "done ✅" with the PR link).
-- `addReaction` — `:white_check_mark:` on the question once the workflow's downstream succeeded; `:x:` on failure.
-
-## 5. Two recipes the skill should encourage
-
-These go into `skills/writing-agent-relay-workflows/SKILL.md` as new chat-native coordination recipes the moment the primitive ships.
-
-### 5.1 Announce + Done (post-result notification)
-
-```ts
-.step(createSlackStep({
-  name: 'notify-pr',
-  dependsOn: ['open-pr'],
-  action: 'postMessage',
-  params: {
-    channel: '#eng-cloud',
-    text: 'Workflow `{{workflow.name}}` opened {{steps.open-pr.output.html_url}}.',
-    mentions: ['@khaliq'],
-  },
-}), { executor: slack })
-```
-
-Pair with the github-primitive's `createPR` step. Whenever a workflow ships a PR, post a one-liner in a channel humans actually watch. This is what closes the loop — without it, PRs created by cloud workflows live in a tab no one opens.
-
-### 5.2 Ask Before You Guess (clarification)
-
-```ts
-.step('plan', {
-  agent: 'lead',
-  task: `... investigate the schema ...
-
-If the migration is ambiguous in any of these ways, do NOT guess and do NOT
-pick one heuristically:
-  - the column to drop has data in production
-  - two tables both look like candidates for the FK target
-  - the index name conflicts with an existing one in a sibling repo
-
-Use the slack primitive to ask the human:
-
-  await slack.askQuestion({
-    channel: '#wf-migration',
-    text: 'I see two candidates for the FK target. Which one?',
-    timeoutSeconds: 1800,
-    replyMatch: { type: 'choice', choices: ['users', 'accounts'] },
-  });
-
-Resume only after you get an answer. Do not exit. Do not pick a default.
-`,
-})
-```
-
-The cultural rule the skill should make explicit: **guessing is worse than asking.** Agents should be told, in their task strings, to escalate via Slack when they hit ambiguity in:
-
-- account/credential choice
-- destructive operations (drops, deletes, force-pushes)
-- scope conflicts ("the spec says X but the existing code does Y")
-- upstream dependencies that look stale or broken
-
-The agent posts the question, waits, and resumes from the answer. The workflow remains deterministic from the runner's point of view — only the _content_ of one step's output is human-supplied.
-
-## 6. Failure modes & error codes
-
-```ts
-type SlackPostBackErrorCode =
-  | 'auth_token_missing' // local: no SLACK_BOT_TOKEN; cloud: no Nango connection
-  | 'auth_token_invalid' // 401 from Slack — token revoked or wrong env
-  | 'missing_scope' // bot lacks chat:write / channels:history / etc.
-  | 'channel_not_found' // name didn't resolve, or bot not invited
-  | 'unknown_mention' // @-mention couldn't be resolved (soft error, logged)
-  | 'human_no_response' // askQuestion hit timeoutSeconds
-  | 'reply_did_not_match' // got a reply but replyMatch rejected it
-  | 'reply_from_unauthorized_user'
-  | 'rate_limited' // 429, with retry-after honored automatically
-  | 'slack_api_error'; // catch-all, includes upstream message
-```
-
-These match the github-primitive's error-code shape so workflow `onError` handlers can discriminate on `err.code` consistently across primitives.
-
-## 7. Implementation outline
-
-```text
-packages/slack-primitive/
-  src/
-    index.ts            // public exports
-    types.ts            // SlackAction, SlackRuntimeConfig, SlackPostBackError
-    client.ts           // SlackClient — direct API
-    workflow-step.ts    // SlackStepExecutor + createSlackStep
-    local-runtime.ts    // Web API via @slack/web-api
-    cloud-runtime.ts    // Nango proxy + relay-cloud fallback
-    adapter.ts          // runtime detection + selection
-    actions/
-      post-message.ts
-      ask-question.ts
-      reply-to-thread.ts
-      update-message.ts
-      resolve-user.ts
-      resolve-channel.ts
-    __tests__/
-  examples/
-    end-to-end-ask-question.ts
-    notify-on-pr.ts
-```
-
-Keep it 1:1 with `packages/github-primitive` so anyone who learned one can read the other in five minutes.
-
-### Cloud-runtime token sourcing
-
-The cloud runtime calls Nango via `nango.proxy({ providerConfigKey, connectionId, method: 'POST', endpoint: '/chat.postMessage', data: {...} })`. Slack accepts both bot-token (xoxb-\*) and user-token; the connection must be configured for bot-token in the Nango Slack integration. Unlike github-app, there's no "give me a token to use directly" semantic — Slack tokens don't rotate per-call — so the proxy form is the right shape here. (This avoids the `nango.getToken(..., true)` confusion the github-primitive had to work through.)
-
-### Local-runtime token sourcing
-
-```ts
-const token = config.token ?? process.env.SLACK_BOT_TOKEN;
-```
-
-If neither is set and we're in `auto` mode, `local` is _not_ selected; `auto` falls through to `cloud`. The detection chain is the same as github-primitive's.
-
-## 8. Acceptance criteria for v1
-
-The primitive ships when:
-
-1. The same workflow file runs unmodified in `agent-relay run` (local) and `agent-relay cloud run` (cloud), posting a Slack message to the configured channel in both.
-2. `askQuestion` blocks the workflow for at least 30 minutes, surfaces a reply matching the configured rule, and the parsed reply is available as `{{steps.X.output.reply}}` to downstream steps.
-3. Workflow resume after a sandbox restart picks up an in-flight `askQuestion` from the message metadata rather than re-asking.
-4. Mismatched scopes throw `missing_scope` at first call with a hint listing the missing scopes.
-5. Cloud-runtime auth uses the workspace's existing Slack Nango connection — no new SST resource bindings, no new env vars beyond what github-primitive already added.
-6. The `writing-agent-relay-workflows` skill has two new recipes: **Announce + Done** and **Ask Before You Guess**.
-
-## 9. Phasing
-
-| Phase | Scope                                                                                                                                                                                   |
-| ----- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
-| **A** | `postMessage` + `resolveUser` + `resolveChannel`; local + cloud-Nango runtimes; example workflow that posts a PR-opened notification.                                                   |
-| **B** | `askQuestion` with `replyMatch: { type: 'any' \| 'choice' }`; resumability via run-record metadata; example workflow that asks "deploy to prod?" and gates a deploy step on the answer. |
-| **C** | `interactive` Block Kit forms; `addReaction`, `updateMessage`, `replyToThread`; relay-cloud fallback transport; skill-doc update with the two recipes.                                  |
-
-A and B together are the v1 shipped surface — they're what unblocks the "agent should ask rather than guess" cultural change. C is polish that makes the primitive pleasant to use in production workflows.
diff --git a/workflows/PLAN-agent-relay-cli-tests.md b/workflows/PLAN-agent-relay-cli-tests.md
deleted file mode 100644
index d213686c9..000000000
--- a/workflows/PLAN-agent-relay-cli-tests.md
+++ /dev/null
@@ -1,12 +0,0 @@
-# PLAN — Agent Relay CLI Commands TDD Tests
-
-## Goal
-Test all agent-relay CLI commands in headless mode (spawn, who, agents:logs, release, set-model, send, history, inbox) to verify they work and fix failures.
-
-## Commands to Test
-- spawn, who, agents:logs, release, set-model, send, history, inbox
-
-## Known Issues
-- history requires RELAY_API_KEY (broken in local-only mode)
-- send has sender identity issues
-- inbox fails without RELAY_API_KEY
\ No newline at end of file
diff --git a/workflows/ci/add-swift-sdk.ts b/workflows/ci/add-swift-sdk.ts
deleted file mode 100644
index c11eb6232..000000000
--- a/workflows/ci/add-swift-sdk.ts
+++ /dev/null
@@ -1,543 +0,0 @@
-/**
- * add-swift-sdk.ts
- *
- * Creates a native Swift SDK (Swift Package Manager) for the Agent Relay broker.
- *
- * The SDK gives Swift/macOS/iOS apps a first-class client without needing a
- * TypeScript/Node bridge process. It mirrors the TypeScript and Python SDK
- * surface and ships as an SPM package at packages/sdk-swift/.
- *
- * Public API shape (produced by this workflow):
- *
- *   let relay = RelayCast(apiKey: "rk_live_...")
- *   let channel = relay.channel("wf-my-workflow")
- *   channel.subscribe()
- *   channel.post("Hello from Swift")
- *   for await event in channel.events { ... }
- *
- *   let agent  = try await relay.registerOrRotate(name: "my-agent")
- *   try await agent.post(to: "general", message: "Hi")
- *   try await agent.dm(to: "other-agent", message: "...")
- *
- * Phases:
- *   1. Context: read protocol, TS relay client, Python SDK, MSD reference impl
- *   2. Plan: lead designs the full SDK API and file breakdown
- *   3. Scaffold: create dir structure + Package.swift (deterministic)
- *   4. Implement: 3 parallel workers — types, transport, API
- *   5. Verify: file existence check + swift build
- *   6. Review: lead fixes build errors and commits
- *
- * Run with:
- *   agent-relay run workflows/add-swift-sdk.ts
- */
-
-import { workflow, createWorkflowRenderer } from '@agent-relay/sdk/workflows';
-
-const renderer = createWorkflowRenderer();
-
-const cwd = process.cwd(); // run from the relay repo root
-
-const [result] = await Promise.all([
-  workflow('add-swift-sdk')
-    .description(
-      'Create a native Swift SDK (SPM) for the Agent Relay broker — ' +
-        'WebSocket transport, typed events, channel pub/sub, and agent registration. ' +
-        'Mirrors the TypeScript and Python SDK surface.'
-    )
-    .pattern('dag')
-    .channel('wf-add-swift-sdk')
-    .maxConcurrency(5)
-    .timeout(3600000)
-
-    // ── Agents ──────────────────────────────────────────────────────────────
-
-    .agent('lead', {
-      cli: 'claude',
-      role:
-        'Swift SDK architect. Reads context, produces the API design plan, ' +
-        'assigns files to workers, reviews the build output, fixes errors, and ' +
-        'commits the finished package.',
-    })
-    .agent('types-worker', {
-      cli: 'claude',
-      preset: 'worker',
-      role:
-        'Writes Sources/AgentRelaySDK/RelayTypes.swift — ' +
-        'all Codable event structs and enums matching the broker wire protocol.',
-    })
-    .agent('transport-worker', {
-      cli: 'claude',
-      preset: 'worker',
-      role:
-        'Writes Sources/AgentRelaySDK/RelayTransport.swift — ' +
-        'URLSessionWebSocketTask connection with exponential-backoff reconnect and ping/pong.',
-    })
-    .agent('api-worker', {
-      cli: 'claude',
-      preset: 'worker',
-      role:
-        'Writes Sources/AgentRelaySDK/RelayCast.swift — ' +
-        'the public RelayCast, Channel, and AgentClient types that apps consume.',
-    })
-
-    // ── Phase 1: Context gathering (parallel deterministic) ─────────────────
-
-    .step('create-branch', {
-      type: 'deterministic',
-      command: 'git checkout -b feature/swift-sdk 2>&1 || git checkout feature/swift-sdk 2>&1',
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    .step('read-protocol', {
-      type: 'deterministic',
-      dependsOn: ['create-branch'],
-      command: 'cat packages/sdk/src/protocol.ts',
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    .step('read-ts-relay', {
-      type: 'deterministic',
-      dependsOn: ['create-branch'],
-      // First 400 lines covers the WebSocket setup, event loop, and public API
-      command: 'head -400 packages/sdk/src/relay.ts',
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    .step('read-python-sdk', {
-      type: 'deterministic',
-      dependsOn: ['create-branch'],
-      command:
-        'find packages/sdk-py -name "*.py" -not -path "*/__pycache__/*" ' +
-        '| sort | xargs head -n 60 2>/dev/null | head -500',
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    .step('read-msd-reference', {
-      type: 'deterministic',
-      dependsOn: ['create-branch'],
-      // Optional local reference path for a real Swift broker client.
-      // Set SWIFT_RELAY_REFERENCE_PATH to enable this extra context.
-      command:
-        'if [ -n "$SWIFT_RELAY_REFERENCE_PATH" ] && [ -f "$SWIFT_RELAY_REFERENCE_PATH" ]; then ' +
-        'cat "$SWIFT_RELAY_REFERENCE_PATH"; ' +
-        'else echo "No local Swift relay reference configured"; fi',
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    // ── Phase 2: Architecture plan ──────────────────────────────────────────
-
-    .step('plan', {
-      agent: 'lead',
-      dependsOn: ['read-protocol', 'read-ts-relay', 'read-python-sdk', 'read-msd-reference'],
-      task: `You are designing a native Swift SDK for the Agent Relay broker.
-
-## Context
-
-Broker wire protocol (TypeScript):
-{{steps.read-protocol.output}}
-
-TypeScript relay client (first 400 lines):
-{{steps.read-ts-relay.output}}
-
-Python SDK reference:
-{{steps.read-python-sdk.output}}
-
-Existing Swift WebSocket client (MSD project — real production code for this same broker):
-{{steps.read-msd-reference.output}}
-
-## Your task
-
-Produce a detailed design document covering:
-
-1. **Package structure** — files to create under packages/sdk-swift/Sources/AgentRelaySDK/
-2. **RelayTypes.swift** — every Codable struct/enum needed to decode broker events
-   (hello_ack, event, worker_stream, worker_exited, pong, error, deliver_relay, ok)
-   and encode client messages (hello, send_message, spawn_agent, release_agent, ping)
-3. **RelayTransport.swift** — URLSessionWebSocketTask connection class:
-   - connect() / disconnect()
-   - Exponential backoff reconnect (max 30s)
-   - Ping every 20s, disconnect if pong not received in 10s
-   - Inbound message stream via AsyncStream
-4. **RelayCast.swift** — public API:
-   - RelayCast(apiKey:baseURL:) — manages a single WebSocket connection
-   - channel(_ name: String) -> Channel
-   - registerOrRotate(name:) async throws -> AgentRegistration
-   - Channel: subscribe(), post(_ text:), events: AsyncStream<InboundEvent>
-   - AgentClient (returned by as(_ token:)): post(to:message:), dm(to:message:)
-5. **Concurrency model** — Swift structured concurrency (async/await, Actor isolation)
-6. **Platform targets** — macOS 13+, iOS 16+, no third-party dependencies
-
-End your plan with the exact file list workers must create. Use the marker:
-PLAN_COMPLETE`,
-      verification: { type: 'output_contains', value: 'PLAN_COMPLETE' },
-    })
-
-    // ── Phase 3: Scaffold (deterministic) ───────────────────────────────────
-
-    .step('scaffold-dirs', {
-      type: 'deterministic',
-      dependsOn: ['plan'],
-      command:
-        'mkdir -p packages/sdk-swift/Sources/AgentRelaySDK ' + 'packages/sdk-swift/Tests/AgentRelaySDKTests',
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('write-package-swift', {
-      type: 'deterministic',
-      dependsOn: ['scaffold-dirs'],
-      command: `cat > packages/sdk-swift/Package.swift << 'SWIFTEOF'
-// swift-tools-version: 5.9
-// AgentRelaySDK — Swift Package Manager manifest
-//
-// Native Swift client for the Agent Relay broker.
-// No third-party dependencies — uses URLSession WebSocket and Swift Concurrency.
-
-import PackageDescription
-
-let package = Package(
-    name: "AgentRelaySDK",
-    platforms: [
-        .macOS(.v13),
-        .iOS(.v16),
-        .watchOS(.v9),
-        .tvOS(.v16),
-    ],
-    products: [
-        .library(
-            name: "AgentRelaySDK",
-            targets: ["AgentRelaySDK"]
-        ),
-    ],
-    targets: [
-        .target(
-            name: "AgentRelaySDK",
-            path: "Sources/AgentRelaySDK"
-        ),
-        .testTarget(
-            name: "AgentRelaySDKTests",
-            dependencies: ["AgentRelaySDK"],
-            path: "Tests/AgentRelaySDKTests"
-        ),
-    ]
-)
-SWIFTEOF`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('write-test-stub', {
-      type: 'deterministic',
-      dependsOn: ['scaffold-dirs'],
-      command: `cat > packages/sdk-swift/Tests/AgentRelaySDKTests/AgentRelaySDKTests.swift << 'SWIFTEOF'
-import XCTest
-@testable import AgentRelaySDK
-
-final class AgentRelaySDKTests: XCTestCase {
-
-    func testRelayCastInit() {
-        let relay = RelayCast(apiKey: "rk_test_key")
-        XCTAssertNotNil(relay)
-    }
-
-    func testChannelCreation() {
-        let relay = RelayCast(apiKey: "rk_test_key")
-        let channel = relay.channel("test-channel")
-        XCTAssertEqual(channel.name, "test-channel")
-    }
-}
-SWIFTEOF`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 4: Parallel implementation ────────────────────────────────────
-
-    .step('implement-types', {
-      agent: 'types-worker',
-      dependsOn: ['scaffold-dirs', 'plan'],
-      task: `Write the file packages/sdk-swift/Sources/AgentRelaySDK/RelayTypes.swift.
-
-Architecture plan from the lead:
-{{steps.plan.output}}
-
-Broker wire protocol reference:
-{{steps.read-protocol.output}}
-
-MSD production reference (existing working types):
-{{steps.read-msd-reference.output}}
-
-## Requirements
-
-Write a complete Swift file containing:
-
-1. **Inbound message types** (broker → client, Decodable):
-   - InboundMessage enum with associated values for each event type
-   - BrokerEvent struct (wraps event kind + payload)
-   - EventKind for relay_inbound, agent_spawned, agent_released, worker_stream, worker_exited
-   - HelloAck, BrokerError structs
-
-2. **Outbound message types** (client → broker, Encodable):
-   - OutboundMessage enum: hello, send_message, release_agent, ping, list_agents
-   - SendMessagePayload, SpawnAgentPayload, HelloPayload structs
-
-3. Use Swift Codable (Encodable + Decodable), snake_case CodingKeys to match broker JSON.
-4. Add // MARK: - section headers for clarity.
-
-IMPORTANT: Write the complete file to disk at
-packages/sdk-swift/Sources/AgentRelaySDK/RelayTypes.swift
-Do NOT output to stdout — the file must exist on disk when you finish.`,
-      verification: { type: 'exit_code' },
-    })
-
-    .step('implement-transport', {
-      agent: 'transport-worker',
-      dependsOn: ['scaffold-dirs', 'plan'],
-      task: `Write the file packages/sdk-swift/Sources/AgentRelaySDK/RelayTransport.swift.
-
-Architecture plan from the lead:
-{{steps.plan.output}}
-
-MSD production WebSocket reference (existing working transport for this same broker):
-{{steps.read-msd-reference.output}}
-
-## Requirements
-
-Write a complete Swift file containing the RelayTransport actor:
-
-\`\`\`swift
-actor RelayTransport {
-    init(url: URL)
-    func connect() async throws
-    func disconnect()
-    func send(_ message: Data) async throws
-    var inbound: AsyncStream<Data> { get }
-}
-\`\`\`
-
-Implementation details:
-1. Use URLSessionWebSocketTask — no third-party dependencies
-2. Reconnect with exponential backoff: 0.5s, 1s, 2s, 4s, 8s, 16s, 30s (cap)
-3. Send a ping frame every 20s; treat no pong within 10s as a disconnect
-4. Expose inbound messages via AsyncStream<Data> (raw frames before JSON decode)
-5. Target macOS 13+, iOS 16+ — use structured concurrency (async/await, Task, actor)
-6. Include a ConnectionState enum: disconnected, connecting, connected, reconnecting
-
-IMPORTANT: Write the complete file to disk at
-packages/sdk-swift/Sources/AgentRelaySDK/RelayTransport.swift
-Do NOT output to stdout — the file must exist on disk when you finish.`,
-      verification: { type: 'exit_code' },
-    })
-
-    .step('implement-api', {
-      agent: 'api-worker',
-      dependsOn: ['scaffold-dirs', 'plan'],
-      task: `Write the file packages/sdk-swift/Sources/AgentRelaySDK/RelayCast.swift.
-
-Architecture plan from the lead:
-{{steps.plan.output}}
-
-TypeScript SDK reference (API shape to mirror):
-{{steps.read-ts-relay.output}}
-
-## Requirements
-
-Write a complete Swift file with the public API:
-
-\`\`\`swift
-// Entry point
-public final class RelayCast {
-    public init(apiKey: String, baseURL: URL? = nil)
-    public func channel(_ name: String) -> Channel
-    public func registerOrRotate(name: String) async throws -> AgentRegistration
-    public func \`as\`(_ agentToken: String) -> AgentClient
-}
-
-// Channel pub/sub
-public final class Channel {
-    public let name: String
-    public func subscribe() async throws
-    public func post(_ text: String) async throws
-    public var events: AsyncStream<RelayChannelEvent> { get }
-}
-
-// Agent posting
-public final class AgentClient {
-    public func post(to channel: String, message: String) async throws
-    public func dm(to agentName: String, message: String) async throws
-}
-
-// Returned by registerOrRotate
-public struct AgentRegistration {
-    public let agentName: String
-    public let token: String
-    public func asClient() -> AgentClient
-}
-
-// Events surfaced to callers
-public struct RelayChannelEvent {
-    public let from: String
-    public let body: String
-    public let threadId: String?
-    public let timestamp: Date
-}
-\`\`\`
-
-Implementation notes:
-- RelayCast owns the RelayTransport, shared by all Channel and AgentClient instances
-- Use the broker's WebSocket at ws://{host}/ws (default: ws://localhost:3889/ws)
-- Authenticate with apiKey in the hello handshake (payload.client_name + apiKey header or token)
-- Channel.subscribe() sends a channel subscription message to the broker
-- AgentClient.post / dm send send_message frames with from set to the agent name
-- All async methods throw RelayError (define a public enum for connection/protocol errors)
-
-IMPORTANT: Write the complete file to disk at
-packages/sdk-swift/Sources/AgentRelaySDK/RelayCast.swift
-Do NOT output to stdout — the file must exist on disk when you finish.`,
-      verification: { type: 'exit_code' },
-    })
-
-    // ── Phase 5: Verify files + build ───────────────────────────────────────
-
-    .step('verify-files', {
-      type: 'deterministic',
-      dependsOn: ['implement-types', 'implement-transport', 'implement-api', 'write-package-swift'],
-      command: `missing=0
-for f in \
-  packages/sdk-swift/Package.swift \
-  packages/sdk-swift/Sources/AgentRelaySDK/RelayTypes.swift \
-  packages/sdk-swift/Sources/AgentRelaySDK/RelayTransport.swift \
-  packages/sdk-swift/Sources/AgentRelaySDK/RelayCast.swift; do
-  if [ ! -f "$f" ]; then
-    echo "MISSING: $f"
-    missing=$((missing + 1))
-  else
-    echo "OK: $f ($(wc -l < "$f") lines)"
-  fi
-done
-if [ $missing -gt 0 ]; then
-  echo "$missing file(s) missing — workers did not write to disk"
-  exit 1
-fi
-echo "All files present"`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('swift-build', {
-      type: 'deterministic',
-      dependsOn: ['verify-files'],
-      command: 'cd packages/sdk-swift && swift build 2>&1 | tail -50',
-      captureOutput: true,
-      failOnError: false, // lead will diagnose and fix errors
-    })
-
-    // ── Phase 6: Review, fix, commit ────────────────────────────────────────
-
-    .step('review-and-commit', {
-      agent: 'lead',
-      dependsOn: ['swift-build'],
-      task: `Review the Swift SDK build result and leave durable repo output on this branch.
-
-Files produced:
-{{steps.verify-files.output}}
-
-Swift build output:
-{{steps.swift-build.output}}
-
-## Non-negotiable contract
-
-This workflow is only successful if the repository itself contains the finished SDK files.
-A status update to WorkflowRunner is NOT enough.
-Do not stop after sending a message. Do not remove yourself until the repo state is durable.
-
-## Your tasks
-
-1. **If the build failed:** read each source file, diagnose the errors, and fix them
-   directly using your file-editing tools. Common issues to check:
-   - Missing CodingKeys for snake_case fields
-   - Actor isolation violations (mark mutating state with nonisolated(unsafe) or move to actor)
-   - Missing 'public' access modifiers on exported types
-   - AsyncStream continuation retention
-
-2. **If the build still cannot be made green because the host Swift toolchain is broken:**
-   - keep the generated SDK files on disk
-   - write packages/sdk-swift/README.md explaining the current status
-   - commit the package anyway with a message that clearly notes validation was blocked by the local environment
-   - explicitly say in your final summary whether validation was blocked by environment vs source errors
-
-3. **Write a README** at packages/sdk-swift/README.md with:
-   - Installation (SPM dependency snippet)
-   - Quick-start example (connect, subscribe to a channel, post a message)
-   - Current validation status / known limitations
-
-4. **Commit** all files under packages/sdk-swift/.
-   Required commands:
-   \`\`\`
-   git add packages/sdk-swift/
-   git commit -m "feat(sdk-swift): add native Swift SDK for Agent Relay broker"
-   \`\`\`
-
-5. In your final response include all of the following markers on separate lines:
-   - REVIEW_COMPLETE
-   - README_WRITTEN
-   - COMMIT_STATUS: <committed|blocked>
-   - COMMIT_SHA: <sha or none>
-   - VALIDATION_STATUS: <passed|env-blocked|source-blocked>
-
-If you cannot commit, explain exactly why and output COMMIT_STATUS: blocked.`,
-      verification: { type: 'output_contains', value: 'REVIEW_COMPLETE' },
-    })
-
-    .step('verify-readme', {
-      type: 'deterministic',
-      dependsOn: ['review-and-commit'],
-      command: 'test -f packages/sdk-swift/README.md && echo README_OK',
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('verify-commit', {
-      type: 'deterministic',
-      dependsOn: ['review-and-commit'],
-      command: `set -e
-if ! git rev-parse --verify HEAD >/dev/null 2>&1; then
-  echo "No HEAD commit"
-  exit 1
-fi
-if git diff --quiet HEAD -- packages/sdk-swift; then
-  if git diff --cached --quiet -- packages/sdk-swift; then
-    echo "SDK files are committed in HEAD"
-  else
-    echo "SDK changes are only staged, not committed"
-    exit 1
-  fi
-else
-  echo "SDK changes are still uncommitted after review-and-commit"
-  exit 1
-fi`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .onError('retry', { maxRetries: 2, retryDelayMs: 10000 })
-    .run({
-      onEvent: renderer.onEvent,
-      cwd,
-    }),
-
-  renderer.start(),
-]);
-
-renderer.unmount();
-
-console.log(`\nSwift SDK workflow: ${result.status}`);
-if (result.status === 'completed') {
-  console.log('Package location: packages/sdk-swift/');
-  console.log('Run: cd packages/sdk-swift && swift build');
-}
diff --git a/workflows/ci/cli-observability.ts b/workflows/ci/cli-observability.ts
deleted file mode 100644
index 83a10659e..000000000
--- a/workflows/ci/cli-observability.ts
+++ /dev/null
@@ -1,380 +0,0 @@
-import { workflow } from '@agent-relay/sdk/workflows';
-
-async function main() {
-  const sdkRoot = 'packages/sdk/src';
-
-  const wf = workflow('cli-observability')
-    .description('Add step-level cwd, CLI session collectors, and run summary table to the workflow SDK')
-    .pattern('dag')
-    .channel('wf-cli-observability')
-    .maxConcurrency(4)
-    .timeout(1_800_000);
-
-  // ── Agents ──────────────────────────────────────────────────────────────
-
-  wf.agent('architect', {
-    cli: 'claude',
-    role: 'SDK architect — designs interfaces and coordinates implementation',
-    preset: 'lead',
-    retries: 2,
-  });
-
-  wf.agent('sdk-worker', {
-    cli: 'codex',
-    role: 'TypeScript SDK developer',
-    preset: 'worker',
-    retries: 2,
-  });
-
-  wf.agent('test-writer', {
-    cli: 'codex',
-    role: 'Test engineer — writes unit and integration tests',
-    preset: 'worker',
-    retries: 2,
-  });
-
-  // ── Phase 1: Read current source & plan ─────────────────────────────────
-
-  wf.step('read-types', {
-    type: 'deterministic',
-    command: `cat ${sdkRoot}/workflows/types.ts`,
-  });
-
-  wf.step('read-builder', {
-    type: 'deterministic',
-    command: `cat ${sdkRoot}/workflows/builder.ts`,
-  });
-
-  wf.step('read-runner', {
-    type: 'deterministic',
-    command: `cat ${sdkRoot}/workflows/runner.ts`,
-  });
-
-  wf.step('read-spec', {
-    type: 'deterministic',
-    command: 'cat workflows/specs/cli-observability.md',
-  });
-
-  wf.step('plan', {
-    agent: 'architect',
-    task: `
-You are implementing the CLI Observability spec for @agent-relay/sdk.
-
-Read the spec carefully:
-{{steps.read-spec.output}}
-
-Current SDK types:
-{{steps.read-types.output}}
-
-Current builder API:
-{{steps.read-builder.output}}
-
-Current runner API:
-{{steps.read-runner.output}}
-
-Produce an implementation plan that covers:
-1. Exact files to create/modify in packages/sdk/src/workflows/
-2. The CliSessionCollector interface and registry pattern
-3. How step-level cwd integrates into the existing resolveAgentCwd / resolveStepWorkdir chain
-4. The new step:agent-report event wiring
-5. The run summary table formatting logic
-6. Test file locations and fixture strategy
-
-Output the plan as a numbered checklist. Do NOT write any code — just the plan.
-    `.trim(),
-    dependsOn: ['read-types', 'read-builder', 'read-runner', 'read-spec'],
-    verification: { type: 'output_contains', value: 'PLAN_COMPLETE' },
-  });
-
-  // ── Phase 2: Parallel implementation ────────────────────────────────────
-
-  // 2a. Step-level cwd — types + builder + runner
-  wf.step('impl-step-cwd', {
-    agent: 'sdk-worker',
-    task: `
-Implement step-level cwd support in @agent-relay/sdk.
-
-Implementation plan:
-{{steps.plan.output}}
-
-Spec:
-{{steps.read-spec.output}}
-
-Changes needed:
-1. In packages/sdk/src/workflows/types.ts:
-   - Add optional \`cwd?: string\` to WorkflowStep type
-2. In packages/sdk/src/workflows/builder.ts:
-   - Add optional \`cwd?: string\` to AgentStepOptions and DeterministicStepOptions
-   - Pass cwd through when constructing step config in the step() method
-3. In packages/sdk/src/workflows/runner.ts:
-   - In executeAgentStep: resolve effective cwd as step.cwd ?? resolveStepWorkdir(step) ?? resolveAgentCwd(agentDef) ?? this.cwd
-   - In executeDeterministicStep: same resolution chain
-   - In execNonInteractive: pass resolved cwd to spawn
-
-Keep changes minimal. Do not refactor existing code beyond what is needed.
-    `.trim(),
-    dependsOn: ['plan'],
-    verification: { type: 'exit_code' },
-  });
-
-  // 2b. CliSessionCollector interface + registry
-  wf.step('impl-collector-interface', {
-    agent: 'sdk-worker',
-    task: `
-Create the CliSessionCollector interface and collector registry.
-
-Implementation plan:
-{{steps.plan.output}}
-
-Spec:
-{{steps.read-spec.output}}
-
-Create file: packages/sdk/src/workflows/cli-session-collector.ts
-
-This file must export:
-1. CliSessionReport interface (cli, sessionId, model, provider, durationMs, cost, tokens, turns, toolCalls, errors, finalStatus, summary)
-2. CliSessionQuery interface (cli, cwd, startedAt, completedAt)
-3. CliSessionCollector interface (canCollect, collect)
-4. createCollector(cli: AgentCli): CliSessionCollector | null — factory that returns the right collector
-5. collectCliSession(query: CliSessionQuery): Promise<CliSessionReport | null> — convenience wrapper
-
-Also add the export to packages/sdk/src/workflows/index.ts.
-
-Do NOT implement the individual collectors yet — just the interface, factory skeleton (returning null for now), and convenience wrapper.
-    `.trim(),
-    dependsOn: ['plan'],
-    verification: { type: 'file_exists', value: 'packages/sdk/src/workflows/cli-session-collector.ts' },
-  });
-
-  // 2c. OpenCode collector
-  wf.step('impl-opencode-collector', {
-    agent: 'sdk-worker',
-    task: `
-Implement the OpenCode session collector.
-
-Implementation plan:
-{{steps.plan.output}}
-
-Spec (see section 2b):
-{{steps.read-spec.output}}
-
-Collector interface (already created):
-{{steps.impl-collector-interface.output}}
-
-Create file: packages/sdk/src/workflows/collectors/opencode.ts
-
-OpenCode stores data in ~/.local/share/opencode/opencode.db (SQLite).
-
-Schema:
-- session: id, directory, time_created
-- message: id, session_id, time_created, data (JSON with role, modelID, providerID, cost, tokens{total,input,output,reasoning,cache{read,write}}, finish)
-- part: id, message_id, session_id, time_created, data (JSON with type, text, name for tool calls)
-
-Matching: Find session where directory = query.cwd AND time_created BETWEEN startedAt-5000 AND completedAt, ORDER BY time_created DESC LIMIT 1.
-
-Use better-sqlite3 for sync reads (it's already a common transitive dep). If the DB doesn't exist or is locked, canCollect() returns false.
-
-Aggregate tokens by summing across all messages. Extract tool calls from parts where data.type includes 'tool'. Extract errors by scanning text parts for lines matching /^Error|error:|Command failed|FAIL/.
-
-Wire it into the factory in cli-session-collector.ts for cli === 'opencode'.
-    `.trim(),
-    dependsOn: ['impl-collector-interface'],
-    verification: { type: 'file_exists', value: 'packages/sdk/src/workflows/collectors/opencode.ts' },
-  });
-
-  // 2d. Claude Code collector
-  wf.step('impl-claude-collector', {
-    agent: 'sdk-worker',
-    task: `
-Implement the Claude Code session collector.
-
-Implementation plan:
-{{steps.plan.output}}
-
-Spec (see section 2c):
-{{steps.read-spec.output}}
-
-Collector interface:
-{{steps.impl-collector-interface.output}}
-
-Create file: packages/sdk/src/workflows/collectors/claude.ts
-
-Claude Code stores:
-1. ~/.claude/history.jsonl — one JSON per line: { display, timestamp (ms), project (abs path), sessionId }
-2. ~/.claude/projects/{encoded-path}/{sessionId}.jsonl — full session log, one JSON per line
-
-History matching: Read history.jsonl bottom-up, find entry where project matches query.cwd and timestamp is within [startedAt-5000, completedAt].
-
-Session JSONL format: Each line has type (user, assistant, tool_use, tool_result, system, progress) plus content. Assistant messages may include usage metadata with token counts.
-
-Encode project path the same way Claude Code does (replace / with --, strip leading -).
-
-If files don't exist or aren't readable, canCollect() returns false. Use fs.createReadStream with readline for efficient bottom-up reading of large JSONL files.
-
-Wire it into the factory in cli-session-collector.ts for cli === 'claude'.
-    `.trim(),
-    dependsOn: ['impl-collector-interface'],
-    verification: { type: 'file_exists', value: 'packages/sdk/src/workflows/collectors/claude.ts' },
-  });
-
-  // 2e. Codex collector
-  wf.step('impl-codex-collector', {
-    agent: 'sdk-worker',
-    task: `
-Implement the Codex session collector.
-
-Implementation plan:
-{{steps.plan.output}}
-
-Spec (see section 2d):
-{{steps.read-spec.output}}
-
-Collector interface:
-{{steps.impl-collector-interface.output}}
-
-Create file: packages/sdk/src/workflows/collectors/codex.ts
-
-Codex stores:
-1. ~/.codex/history.jsonl — one JSON per line: { session_id, ts (unix seconds, NOT ms), text }
-2. ~/.codex/state_5.sqlite — SQLite with tables:
-   - threads: id, cwd, model_provider, tokens_used, created_at, updated_at
-   - logs: thread_id, ts, level, message
-
-Matching: Query threads table for cwd = query.cwd AND created_at within the step window. If no SQLite match, fall back to history.jsonl.
-
-Token extraction: threads.tokens_used gives the total. For breakdown, check if the schema has per-field columns.
-
-Error extraction: Query logs where level = 'error' AND thread_id = matched thread.
-
-If files don't exist, canCollect() returns false.
-
-Wire it into the factory in cli-session-collector.ts for cli === 'codex'.
-    `.trim(),
-    dependsOn: ['impl-collector-interface'],
-    verification: { type: 'file_exists', value: 'packages/sdk/src/workflows/collectors/codex.ts' },
-  });
-
-  // ── Phase 3: Runner integration ─────────────────────────────────────────
-
-  wf.step('impl-runner-integration', {
-    agent: 'sdk-worker',
-    task: `
-Integrate CLI session collection into the workflow runner.
-
-Implementation plan:
-{{steps.plan.output}}
-
-Spec (sections 3a-3d):
-{{steps.read-spec.output}}
-
-The collectors are implemented:
-- OpenCode: {{steps.impl-opencode-collector.output}}
-- Claude: {{steps.impl-claude-collector.output}}
-- Codex: {{steps.impl-codex-collector.output}}
-
-Changes to packages/sdk/src/workflows/runner.ts:
-
-1. Add 'step:agent-report' to WorkflowEvent union type with fields: runId, stepName, report (CliSessionReport)
-
-2. Import collectCliSession from ./cli-session-collector
-
-3. In executeAgentStep, after spawnAndWait returns and before completion decision:
-   - Record stepStartTime at the top of executeAgentStep
-   - Call collectCliSession({ cli: agentDef.cli, cwd: effectiveCwd, startedAt: stepStartTime, completedAt: Date.now() })
-   - If report is non-null, emit step:agent-report event
-   - Store report in a Map<string, CliSessionReport> keyed by stepName
-
-4. Add persistAgentReport method: write {stepName}.report.json to .agent-relay/step-outputs/{runId}/
-
-5. Enhance logRunSummary to print a table when reports exist:
-   - Columns: Step, Status, Model, Cost, Tokens, Duration, Errors
-   - Footer row with totals for Cost, Tokens, Duration
-   - For failed steps, print first error line indented below
-
-Keep changes surgical — do not refactor existing runner methods.
-    `.trim(),
-    dependsOn: ['impl-step-cwd', 'impl-opencode-collector', 'impl-claude-collector', 'impl-codex-collector'],
-    verification: { type: 'exit_code' },
-  });
-
-  // ── Phase 4: Tests ──────────────────────────────────────────────────────
-
-  wf.step('write-tests', {
-    agent: 'test-writer',
-    task: `
-Write tests for the CLI observability features.
-
-Implementation plan:
-{{steps.plan.output}}
-
-Runner integration output:
-{{steps.impl-runner-integration.output}}
-
-Create the following test files:
-
-1. packages/sdk/src/workflows/__tests__/step-cwd.test.ts
-   - Test that step.cwd takes precedence over agent.cwd and runner cwd
-   - Test that deterministic steps also respect step.cwd
-   - Test fallback chain: step.cwd → step.workdir → agent.cwd → runner.cwd
-
-2. packages/sdk/src/workflows/__tests__/cli-session-collector.test.ts
-   - Test collectCliSession returns null for unknown CLI
-   - Test canCollect returns false when data store doesn't exist
-
-3. packages/sdk/src/workflows/__tests__/collectors/opencode.test.ts
-   - Create a test fixture SQLite DB with known session/message/part rows
-   - Test matching by directory and time window
-   - Test token aggregation
-   - Test error extraction
-   - Test canCollect returns false when DB missing
-
-4. packages/sdk/src/workflows/__tests__/collectors/claude.test.ts
-   - Create fixture history.jsonl and session JSONL files in a temp dir
-   - Test matching by project path and timestamp
-   - Test canCollect returns false when files missing
-
-5. packages/sdk/src/workflows/__tests__/collectors/codex.test.ts
-   - Create fixture SQLite DB with threads and logs tables
-   - Test matching by cwd and time window
-   - Test error extraction from logs table
-
-6. packages/sdk/src/workflows/__tests__/run-summary-table.test.ts
-   - Snapshot test of the formatted summary table given mock CliSessionReport objects
-   - Test with all-passing steps
-   - Test with one failed step (should show error line)
-   - Test with no reports (should fall back to existing summary format)
-
-Use vitest. Mock file system paths to point at temp fixtures. Do NOT hit real user data stores.
-    `.trim(),
-    dependsOn: ['impl-runner-integration'],
-    verification: { type: 'exit_code' },
-  });
-
-  // ── Phase 5: Verify ─────────────────────────────────────────────────────
-
-  wf.step('typecheck', {
-    type: 'deterministic',
-    command: 'cd packages/sdk && npx tsc --noEmit',
-    dependsOn: ['write-tests'],
-  });
-
-  wf.step('run-tests', {
-    type: 'deterministic',
-    command: 'cd packages/sdk && npx vitest run --reporter=verbose',
-    dependsOn: ['typecheck'],
-  });
-
-  // ── Run ─────────────────────────────────────────────────────────────────
-
-  const result = await wf.onError('retry', { maxRetries: 2, retryDelayMs: 10_000 }).run({
-    onEvent: (e) => {
-      if (e.type.startsWith('step:')) {
-        console.log(`[${e.type}] ${e.stepName ?? ''}`);
-      }
-    },
-  });
-
-  console.log(`Done: ${result.status} (${result.id})`);
-}
-
-main().catch(console.error);
diff --git a/workflows/ci/harden-npm-publish.ts b/workflows/ci/harden-npm-publish.ts
deleted file mode 100644
index 785f64236..000000000
--- a/workflows/ci/harden-npm-publish.ts
+++ /dev/null
@@ -1,350 +0,0 @@
-/**
- * Harden the npm publish path after the registry rejected agent-relay with:
- *
- *   E415 Unsupported Media Type - Hard link is not allowed
- *
- * Root cause: the root package publishes `packages/` wholesale. During the
- * publish job, workspace installs can leave nested package node_modules trees
- * under packages/*; esbuild can materialize its bin shim as a hard link, which
- * npm's registry rejects after provenance is already signed.
- *
- * Long-term target:
- *   1. Publish a validated .tgz, not the live working directory.
- *   2. Keep nested workspace node_modules out of the root package.
- *   3. Add a tarball validator that fails on hard links and unexpected files.
- *   4. Run the same validator in PR package validation and release publish.
- *
- * Run from relay repo root:
- *   agent-relay run workflows/ci/harden-npm-publish.ts
- */
-
-import { workflow } from '@agent-relay/sdk/workflows';
-
-const BRANCH = 'fix/npm-publish-hardening';
-const WORKTREE = '.worktrees/npm-publish-hardening';
-
-async function main() {
-  const wf = workflow('harden-npm-publish')
-    .description('Make npm publish use a clean, validated tarball artifact')
-    .pattern('dag')
-    .channel('wf-npm-publish-hardening')
-    .maxConcurrency(4)
-    .timeout(1_800_000)
-    .agent('architect', {
-      cli: 'claude',
-      preset: 'lead',
-      role: 'Design the npm packaging hardening plan',
-      retries: 2,
-    })
-    .agent('package-worker', {
-      cli: 'codex',
-      preset: 'worker',
-      role: 'Implement package manifest and tarball validation changes',
-      retries: 2,
-    })
-    .agent('workflow-worker', {
-      cli: 'codex',
-      preset: 'worker',
-      role: 'Harden GitHub Actions publish and validation jobs',
-      retries: 2,
-    })
-    .agent('reviewer', {
-      cli: 'claude',
-      preset: 'reviewer',
-      role: 'Review release hardening for correctness and low regression risk',
-      retries: 2,
-    });
-
-  wf.step('setup-worktree', {
-    type: 'deterministic',
-    command: `
-set -eu
-repo_root="$(git rev-parse --show-toplevel)"
-target="$repo_root/${WORKTREE}"
-
-if git worktree list --porcelain | grep -Fxq "worktree $target"; then
-  echo "Worktree already ready at ${WORKTREE}"
-elif [ -e "${WORKTREE}" ]; then
-  echo "Path exists but is not a registered git worktree: ${WORKTREE}" >&2
-  exit 1
-elif git show-ref --verify --quiet refs/heads/${BRANCH}; then
-  git worktree add "${WORKTREE}" "${BRANCH}"
-else
-  git worktree add "${WORKTREE}" -b "${BRANCH}" HEAD
-fi
-
-git -C "${WORKTREE}" status --short
-`.trim(),
-    failOnError: true,
-  });
-
-  wf.step('install-worktree-deps', {
-    type: 'deterministic',
-    cwd: WORKTREE,
-    dependsOn: ['setup-worktree'],
-    command: 'npm ci --ignore-scripts',
-    failOnError: true,
-  });
-
-  wf.step('read-package-context', {
-    type: 'deterministic',
-    cwd: WORKTREE,
-    dependsOn: ['setup-worktree'],
-    command: [
-      `echo "=== root package files and bundled deps ==="`,
-      `sed -n '1,120p' package.json`,
-      `sed -n '245,270p' package.json`,
-      `echo "=== root npmignore ==="`,
-      `sed -n '1,120p' .npmignore`,
-      `echo "=== bundled workspace package file allowlists ==="`,
-      `node -e "const fs=require('fs'); const names=new Set((require('./package.json').bundledDependencies||require('./package.json').bundleDependencies||[])); for (const d of fs.readdirSync('packages')) { const f='packages/'+d+'/package.json'; if (!fs.existsSync(f)) continue; const p=require('./'+f); if (names.has(p.name)) console.log(f, JSON.stringify(p.files||[])); }"`,
-      `echo "=== postinstall workspace linking ==="`,
-      `sed -n '559,690p' scripts/postinstall.js`,
-    ].join(' && '),
-    captureOutput: true,
-    failOnError: true,
-  });
-
-  wf.step('read-ci-context', {
-    type: 'deterministic',
-    cwd: WORKTREE,
-    dependsOn: ['setup-worktree'],
-    command: [
-      `echo "=== publish-main job ==="`,
-      `sed -n '873,931p' .github/workflows/publish.yml`,
-      `echo "=== build artifact upload ==="`,
-      `sed -n '405,437p' .github/workflows/publish.yml`,
-      `echo "=== package validation workflow ==="`,
-      `sed -n '1,135p' .github/workflows/package-validation.yml`,
-      `echo "=== existing bundled-deps audit ==="`,
-      `sed -n '1,160p' scripts/audit-bundled-deps.mjs`,
-    ].join(' && '),
-    captureOutput: true,
-    failOnError: true,
-  });
-
-  wf.step('plan', {
-    agent: 'architect',
-    dependsOn: ['read-package-context', 'read-ci-context'],
-    task: `
-Design the durable fix for npm publish hardening.
-
-Known failure: npm registry rejected the package because the tarball contained
-a hard-link entry under packages/openclaw/node_modules/esbuild/bin/esbuild.
-
-Package context:
-{{steps.read-package-context.output}}
-
-CI context:
-{{steps.read-ci-context.output}}
-
-Produce an implementation checklist for:
-1. A reusable tarball validator script.
-2. Package manifest or npmignore changes that exclude nested workspace node_modules.
-3. publish.yml changes that pack once, validate, then publish that exact .tgz.
-4. package-validation.yml changes that run the same gate on PRs.
-5. Smoke checks that prove agent-relay still imports and the CLI entry exists.
-
-Do not write code. End with PLAN_COMPLETE.
-`.trim(),
-    verification: { type: 'output_contains', value: 'PLAN_COMPLETE' },
-  });
-
-  wf.step('implement-tarball-validator', {
-    agent: 'package-worker',
-    cwd: WORKTREE,
-    dependsOn: ['plan'],
-    task: `
-Implement the tarball validation utility from this plan:
-{{steps.plan.output}}
-
-Create or update scripts/validate-npm-tarball.mjs and update package.json scripts.
-
-Requirements:
-1. Use Node.js and the existing tar package; do not add dependencies.
-2. Accept one or more .tgz paths. If no path is provided, create a temporary
-   package with npm pack --ignore-scripts --json and validate it.
-3. Fail if any tar entry type is a hard link.
-4. Fail if any path matches package/packages/*/node_modules/*.
-5. Fail if non-bundled workspace packages appear under package/packages/.
-6. Print a concise summary of entry count, package size, and violations.
-
-Only touch scripts/validate-npm-tarball.mjs and package.json.
-`.trim(),
-    verification: { type: 'exit_code', value: '0' },
-  });
-
-  wf.step('verify-validator-was-added', {
-    type: 'deterministic',
-    cwd: WORKTREE,
-    dependsOn: ['implement-tarball-validator'],
-    command: [
-      `test -f scripts/validate-npm-tarball.mjs`,
-      `node --check scripts/validate-npm-tarball.mjs`,
-      `node -e "const s=require('./package.json').scripts||{}; if (!s['pack:validate']) { console.error('missing pack:validate script'); process.exit(1); } console.log(s['pack:validate']);"`,
-    ].join(' && '),
-    captureOutput: true,
-    failOnError: true,
-  });
-
-  wf.step('harden-package-surface', {
-    agent: 'package-worker',
-    cwd: WORKTREE,
-    dependsOn: ['verify-validator-was-added'],
-    task: `
-Harden package inclusion using this plan:
-{{steps.plan.output}}
-
-Current validation output:
-{{steps.verify-validator-was-added.output}}
-
-Update package.json and/or .npmignore so the root package cannot include:
-1. packages/*/node_modules/**
-2. packages/openclaw/** unless there is a deliberate runtime requirement
-3. workspace test files, .turbo logs, and transient build caches
-
-Prefer replacing the broad "packages" files entry with explicit runtime
-entries for the bundled @agent-relay packages. Preserve files needed by root
-exports, postinstall workspace linking, SDK binaries, README, and licenses.
-
-Only touch package.json and .npmignore.
-`.trim(),
-    verification: { type: 'exit_code', value: '0' },
-  });
-
-  wf.step('verify-package-surface', {
-    type: 'deterministic',
-    cwd: WORKTREE,
-    dependsOn: ['harden-package-surface', 'install-worktree-deps'],
-    command: [
-      `node --check scripts/validate-npm-tarball.mjs`,
-      `node -e "const fs=require('fs'); const p=require('./package.json'); const files=p.files||[]; const npmignore=fs.existsSync('.npmignore')?fs.readFileSync('.npmignore','utf8'):''; const hasBroadPackages=files.includes('packages'); const ignoresNested=/packages\\/\\*\\/node_modules|packages\\/\\*\\*\\/node_modules|\\*\\*\\/node_modules/.test(npmignore); if (hasBroadPackages && !ignoresNested) { console.error('package.json still includes broad packages without an explicit nested node_modules exclusion'); process.exit(1); } console.log('package surface guard ok');"`,
-    ].join(' && '),
-    captureOutput: true,
-    failOnError: true,
-  });
-
-  wf.step('harden-publish-workflow', {
-    agent: 'workflow-worker',
-    cwd: WORKTREE,
-    dependsOn: ['plan'],
-    task: `
-Harden .github/workflows/publish.yml using this plan:
-{{steps.plan.output}}
-
-Required publish-main behavior:
-1. Install dependencies for bundling without dev workspace node_modules.
-2. Remove nested packages/*/node_modules and transient package caches before pack.
-3. Run npm pack --ignore-scripts once into a temporary directory.
-4. Run node scripts/validate-npm-tarball.mjs against that exact tarball.
-5. Publish that exact .tgz with npm publish <tarball> --provenance.
-6. Dry-run mode must dry-run the same validated .tgz.
-
-Keep provenance and tag behavior unchanged. Only touch .github/workflows/publish.yml.
-`.trim(),
-    verification: { type: 'exit_code', value: '0' },
-  });
-
-  wf.step('verify-publish-workflow', {
-    type: 'deterministic',
-    cwd: WORKTREE,
-    dependsOn: ['harden-publish-workflow'],
-    command: [
-      `if git diff --quiet .github/workflows/publish.yml; then echo "publish workflow was not changed"; exit 1; fi`,
-      `grep -q "validate-npm-tarball" .github/workflows/publish.yml`,
-      `grep -q "npm publish .*\\.tgz\\|npm publish.*NPM_TARBALL" .github/workflows/publish.yml`,
-    ].join(' && '),
-    failOnError: true,
-  });
-
-  wf.step('harden-pr-validation', {
-    agent: 'workflow-worker',
-    cwd: WORKTREE,
-    dependsOn: ['verify-package-surface'],
-    task: `
-Update .github/workflows/package-validation.yml for the same artifact gate.
-
-Inputs:
-{{steps.plan.output}}
-{{steps.verify-package-surface.output}}
-
-Add a validation step after build and bundled dependency audit that runs:
-  npm run pack:validate
-
-The PR gate must fail before merge if the root npm tarball contains hard links,
-nested workspace node_modules, or non-bundled workspace packages.
-
-Only touch .github/workflows/package-validation.yml.
-`.trim(),
-    verification: { type: 'exit_code', value: '0' },
-  });
-
-  wf.step('verify-pr-validation', {
-    type: 'deterministic',
-    cwd: WORKTREE,
-    dependsOn: ['harden-pr-validation'],
-    command: [
-      `if git diff --quiet .github/workflows/package-validation.yml; then echo "package validation workflow was not changed"; exit 1; fi`,
-      `grep -q "pack:validate" .github/workflows/package-validation.yml`,
-    ].join(' && '),
-    failOnError: true,
-  });
-
-  wf.step('build-for-pack-validation', {
-    type: 'deterministic',
-    cwd: WORKTREE,
-    dependsOn: ['verify-publish-workflow', 'verify-pr-validation', 'install-worktree-deps'],
-    command: 'npm run build',
-    failOnError: true,
-  });
-
-  wf.step('full-verification', {
-    type: 'deterministic',
-    cwd: WORKTREE,
-    dependsOn: ['build-for-pack-validation'],
-    command: [
-      `npm run pack:validate`,
-      `node -e "import('./dist/src/index.js').then(() => console.log('root import ok'))"`,
-      `test -f dist/src/cli/index.js`,
-      `git diff --stat`,
-    ].join(' && '),
-    captureOutput: true,
-    failOnError: true,
-  });
-
-  wf.step('review', {
-    agent: 'reviewer',
-    cwd: WORKTREE,
-    dependsOn: ['full-verification'],
-    task: `
-Review the npm publish hardening changes.
-
-Verification output:
-{{steps.full-verification.output}}
-
-Check:
-1. The package tarball validator catches hard links and nested workspace node_modules.
-2. Root package contents still include files required by exports and postinstall.
-3. publish.yml publishes the exact validated .tgz, not the mutable directory.
-4. package-validation.yml runs the same gate on PRs.
-5. No unrelated workflow or package metadata churn was introduced.
-
-Fix small issues if needed, then run npm run pack:validate.
-`.trim(),
-    verification: { type: 'exit_code', value: '0' },
-  });
-
-  wf.step('final-check', {
-    type: 'deterministic',
-    cwd: WORKTREE,
-    dependsOn: ['review'],
-    command: [`npm run pack:validate`, `git diff --check`, `git status --short`].join(' && '),
-    captureOutput: true,
-    failOnError: true,
-  });
-
-  const result = await wf.run();
-  console.log(`Done: ${result.status} (${result.id})`);
-}
-
-main().catch(console.error);
diff --git a/workflows/cloud-connect/fix-agent-relay-utils-bundling.ts b/workflows/cloud-connect/fix-agent-relay-utils-bundling.ts
deleted file mode 100644
index ceea1a702..000000000
--- a/workflows/cloud-connect/fix-agent-relay-utils-bundling.ts
+++ /dev/null
@@ -1,597 +0,0 @@
-/**
- * fix-agent-relay-utils-bundling.ts
- *
- * ## Problem
- *
- * `npx agent-relay cloud connect openai` fails with:
- *
- *   Error [ERR_MODULE_NOT_FOUND]: Cannot find package '@agent-relay/utils'
- *   imported from /root/.npm/_npx/.../node_modules/agent-relay/dist/cli/commands/cloud/connect.js
- *
- * But `agent-relay --version` and `agent-relay --help` work fine. The existing
- * post-publish verification suite only exercises `--version` and `--help`, so
- * this regression shipped without being caught.
- *
- * ## Root cause hypothesis
- *
- * `agent-relay/package.json` lists `@agent-relay/utils` in `bundledDependencies`
- * and in `dependencies` (at workspace version). The idea: npm pack should
- * bundle `node_modules/@agent-relay/utils` into the tarball so runtime import
- * resolution finds it.
- *
- * What actually ships (inspected the 3.2.22 tarball installed in cloud repo):
- *
- *   - `packages/utils/dist/` is present (via the `files` array)
- *   - `packages/utils/package.json` is present
- *   - `node_modules/@agent-relay/` directory is NOT present
- *
- * So the `bundledDependencies` mechanism didn't copy anything. The most
- * likely cause: at `npm pack` time, the workspace is symlinked from the root
- * `node_modules/@agent-relay/utils` → `packages/utils` rather than being a
- * real directory. npm's bundledDependencies implementation does not follow
- * symlinks out of the package root by default, so it silently bundles
- * nothing. Code that `import '@agent-relay/utils'` then can't resolve at
- * runtime because there is no `node_modules/@agent-relay/utils` in the
- * installed tarball and the published package.json doesn't declare a
- * `workspaces` config that would make npm install resolve the sibling
- * `packages/utils` directory.
- *
- * (An alternative mechanism would be to use `npm pkg fix` / `exports` + a
- * file: dependency, but we want to keep the bundledDependencies contract
- * rather than re-plumb resolution for all nine workspace packages.)
- *
- * ## Fix strategy
- *
- * The safest fix is to make `prepack` materialize real directories (not
- * symlinks) at `node_modules/@agent-relay/*` before `npm pack` runs, so
- * bundledDependencies copies them into the tarball. Concretely:
- *
- *   1. In `scripts/`, add `prepack-materialize-workspaces.mjs` that:
- *      - For each entry in `package.json#bundledDependencies` starting with
- *        `@agent-relay/`:
- *      - Check `node_modules/<name>` — if it's a symlink, resolve the target,
- *        rm the symlink, and copy the target directory contents (dist, package.json,
- *        README.md if present) into `node_modules/<name>`.
- *      - Exit cleanly if already a real directory.
- *   2. Wire it into `prepack` in `package.json`: run the materialize script
- *      AFTER `npm run build` and BEFORE npm pack completes.
- *   3. Add a `verify-bundled-deps.mjs` script that, post-build, verifies every
- *      entry in bundledDependencies has a real directory at `node_modules/<name>/package.json`.
- *   4. Add a `prepublishOnly` hook that runs `npm pack --dry-run --json`,
- *      parses the file list, and asserts every `node_modules/@agent-relay/<pkg>/package.json`
- *      is present in the pack list. Fail-fast if any are missing.
- *
- * The workflow delegates step 1–4 to agents, then validates with a
- * smoke test that actually runs a cloud-connect-like code path against the
- * just-built tarball.
- *
- * ## Post-publish verification enhancement
- *
- * `scripts/post-publish-verify/verify-install.sh` currently runs only
- * `agent-relay --version`, `--help`, `version`, and SDK require tests. None of
- * these exercise `@agent-relay/utils`. Add Test 6:
- *
- *   - From the installed package directory, run
- *     `node -e "require('agent-relay/dist/cli/commands/cloud/connect.js')"`
- *     (or the equivalent ESM import) and assert it does not throw
- *     ERR_MODULE_NOT_FOUND.
- *   - Additionally, direct-resolve: `require.resolve('@agent-relay/utils', { paths: [packageDir] })`
- *     — this must succeed, otherwise bundledDependencies didn't ship anything.
- *
- * ## Acceptance contract
- *
- *   A1  `node scripts/verify-bundled-deps.mjs` exits 0 and prints OK for every
- *       @agent-relay/* entry in bundledDependencies
- *   A2  `npm pack --dry-run --json` output contains an entry for
- *       `node_modules/@agent-relay/utils/package.json`
- *   A3  After `npm pack && tar -xzf agent-relay-*.tgz -C /tmp/pack-check`,
- *       the extracted `package/node_modules/@agent-relay/utils/dist/index.js`
- *       exists and is non-empty
- *   A4  In a fresh temp dir, `npm install /absolute/path/to/agent-relay-*.tgz`
- *       followed by `node -e "require.resolve('@agent-relay/utils', { paths: ['./node_modules/agent-relay'] })"`
- *       exits 0 with a path printed
- *   A5  `scripts/post-publish-verify/verify-install.sh` contains a new Test 6
- *       block that invokes the cloud-connect dynamic import path and asserts
- *       no ERR_MODULE_NOT_FOUND
- *   A6  `npx tsc --noEmit` is clean
- *   A7  Existing ssh-interactive and cli tests still pass (regression guard)
- *
- * ## Usage
- *
- *   cd /Users/khaliqgant/Projects/AgentWorkforce/relay
- *   agent-relay run workflows/cloud-connect/fix-agent-relay-utils-bundling.ts
- */
-
-import { workflow } from '@agent-relay/sdk/workflows';
-import { CodexModels } from '@agent-relay/config';
-
-const PKG_JSON = 'package.json';
-const PREPACK_SCRIPT = 'scripts/prepack-materialize-workspaces.mjs';
-const VERIFY_SCRIPT = 'scripts/verify-bundled-deps.mjs';
-const POST_PUBLISH = 'scripts/post-publish-verify/verify-install.sh';
-
-async function main() {
-  const result = await workflow('fix-agent-relay-utils-bundling')
-    .description(
-      'Ensure @agent-relay/utils (and sibling workspace packages) are actually bundled into the published tarball. Adds prepack materializer, pack-verifier, and post-publish regression test.'
-    )
-    .pattern('dag')
-    .channel('wf-fix-bundling')
-    .maxConcurrency(3)
-    .timeout(3_600_000)
-
-    .agent('impl', {
-      cli: 'codex',
-      model: CodexModels.GPT_5_5,
-      preset: 'worker',
-      role: 'Writes the materialize + verify scripts and wires prepack/prepublishOnly',
-      retries: 2,
-    })
-    .agent('tester', {
-      cli: 'codex',
-      model: CodexModels.GPT_5_5,
-      preset: 'worker',
-      role: 'Enhances post-publish verification to exercise @agent-relay/utils import path',
-      retries: 2,
-    })
-    .agent('fixer', {
-      cli: 'codex',
-      model: CodexModels.GPT_5_5,
-      preset: 'worker',
-      role: 'Fixes type errors, script failures, and packing issues',
-      retries: 2,
-    })
-
-    // ── Phase 0: Setup branch ────────────────────────────────────────
-    .step('setup-branch', {
-      type: 'deterministic',
-      command: `set -e
-BRANCH="fix/cloud-connect-workflows"
-CURRENT=$(git branch --show-current)
-if [ "$CURRENT" = "$BRANCH" ]; then
-  echo "Already on $BRANCH"
-elif git checkout -b "$BRANCH" 2>/dev/null; then
-  echo "Checked out new $BRANCH"
-elif git checkout "$BRANCH" 2>/dev/null; then
-  echo "Checked out existing $BRANCH"
-else
-  echo "Branch $BRANCH unavailable in this worktree; staying on $CURRENT"
-fi
-echo "BRANCH: $(git branch --show-current)"`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 1: Reproduce the bug ───────────────────────────────────
-    .step('reproduce-bug', {
-      type: 'deterministic',
-      dependsOn: ['setup-branch'],
-      command: `set -e
-echo "=== bundledDependencies (expected to list @agent-relay/utils) ==="
-node -e "console.log(JSON.stringify(require('./${PKG_JSON}').bundledDependencies, null, 2))"
-
-echo ""
-echo "=== current node_modules/@agent-relay/ directory (symlink or real?) ==="
-ls -la node_modules/@agent-relay/ 2>&1 | head -30 || echo "(does not exist)"
-
-echo ""
-echo "=== what 'npm pack --dry-run' would ship under node_modules/@agent-relay ==="
-npm pack --dry-run --json 2>/dev/null | node -e "
-const chunks = [];
-process.stdin.on('data', c => chunks.push(c));
-process.stdin.on('end', () => {
-  const data = JSON.parse(Buffer.concat(chunks).toString());
-  const files = (data[0] && data[0].files) || [];
-  const hits = files.filter(f => f.path && f.path.includes('node_modules/@agent-relay/'));
-  if (hits.length === 0) {
-    console.log('REPRO_CONFIRMED: no node_modules/@agent-relay/* entries in pack list');
-  } else {
-    console.log('UNEXPECTED: pack list contains ' + hits.length + ' @agent-relay node_modules entries:');
-    hits.slice(0, 10).forEach(h => console.log('  ' + h.path));
-  }
-});
-"`,
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    // ── Phase 2: Implement the prepack materializer ──────────────────
-    .step('implement-materialize', {
-      agent: 'impl',
-      dependsOn: ['reproduce-bug'],
-      timeoutMs: 900_000,
-      task: `Reproduction output:
-
-{{steps.reproduce-bug.output}}
-
-Write a new script at \`${PREPACK_SCRIPT}\`. It must be a plain Node.js ESM script (\`.mjs\`). Its job is to ensure every \`@agent-relay/*\` package listed in the repo-root \`package.json#bundledDependencies\` exists as a **real directory** at \`node_modules/<pkgname>/\` (not a symlink) before \`npm pack\` runs.
-
-Exact contract:
-
-\`\`\`js
-#!/usr/bin/env node
-// scripts/prepack-materialize-workspaces.mjs
-//
-// npm pack's bundledDependencies mechanism only ships real directories under
-// node_modules/. In a workspace, node_modules/@agent-relay/<pkg> is typically
-// a symlink to packages/<pkg>, and npm pack does not follow symlinks out of
-// the package root. Result: bundledDependencies silently ships nothing.
-//
-// This script runs in prepack, detects any symlinked workspace packages, and
-// replaces them with real directories containing dist/, package.json, and
-// README.md (if present). The replacement is scoped to a .materialized marker
-// so it is idempotent and safe to re-run.
-
-import { readFileSync, readdirSync, statSync, lstatSync, existsSync, mkdirSync, rmSync, cpSync, writeFileSync } from 'node:fs';
-import { resolve, dirname, join } from 'node:path';
-import { fileURLToPath } from 'node:url';
-
-const __dirname = dirname(fileURLToPath(import.meta.url));
-const ROOT = resolve(__dirname, '..');
-const pkg = JSON.parse(readFileSync(join(ROOT, 'package.json'), 'utf8'));
-
-const bundled = pkg.bundledDependencies || pkg.bundleDependencies || [];
-const targets = bundled.filter((n) => n.startsWith('@agent-relay/'));
-
-if (targets.length === 0) {
-  console.log('[prepack-materialize] no @agent-relay/* entries in bundledDependencies — nothing to do');
-  process.exit(0);
-}
-
-let materialized = 0;
-for (const name of targets) {
-  const dst = join(ROOT, 'node_modules', name);
-  if (!existsSync(dst)) {
-    console.error('[prepack-materialize] MISSING: ' + dst + ' — run npm install first');
-    process.exit(1);
-  }
-  const lst = lstatSync(dst);
-  if (lst.isDirectory() && !lst.isSymbolicLink()) {
-    const marker = join(dst, '.materialized');
-    if (existsSync(marker)) {
-      console.log('[prepack-materialize] already materialized: ' + name);
-      continue;
-    }
-    // Non-symlink, but not marked. Leave as-is.
-    console.log('[prepack-materialize] real dir (unmarked): ' + name);
-    continue;
-  }
-  // It's a symlink — resolve, then replace
-  const target = statSync(dst).isDirectory() ? require('node:fs').realpathSync(dst) : null;
-  // Using the sync API available in ESM:
-  const real = (await import('node:fs/promises')).realpath(dst);
-  // (keep both fallbacks working in old node)
-  const realPath = target || await real;
-  console.log('[prepack-materialize] ' + name + ' → symlink → ' + realPath);
-
-  rmSync(dst, { recursive: true, force: true });
-  mkdirSync(dst, { recursive: true });
-
-  // Copy: package.json, dist (if present), README.md (if present)
-  const pkgJsonSrc = join(realPath, 'package.json');
-  if (!existsSync(pkgJsonSrc)) {
-    console.error('[prepack-materialize] ' + name + ' missing package.json at ' + pkgJsonSrc);
-    process.exit(1);
-  }
-  cpSync(pkgJsonSrc, join(dst, 'package.json'));
-  if (existsSync(join(realPath, 'dist'))) {
-    cpSync(join(realPath, 'dist'), join(dst, 'dist'), { recursive: true });
-  }
-  if (existsSync(join(realPath, 'README.md'))) {
-    cpSync(join(realPath, 'README.md'), join(dst, 'README.md'));
-  }
-  writeFileSync(join(dst, '.materialized'), 'materialized-by-prepack\\n');
-  materialized++;
-}
-
-console.log('[prepack-materialize] done — materialized ' + materialized + ' package(s)');
-\`\`\`
-
-Feel free to simplify the realpath dance (pick one approach — the \`realpathSync\` sync call is fine for an mjs file). Do NOT add any unrelated features; the script must be tight and auditable.
-
-Also write a second script at \`${VERIFY_SCRIPT}\`:
-
-\`\`\`js
-#!/usr/bin/env node
-// scripts/verify-bundled-deps.mjs
-//
-// Post-prepack sanity check: every @agent-relay/* entry in bundledDependencies
-// must have a real directory at node_modules/<name>/package.json. Run from
-// prepublishOnly to fail the publish if anything is off.
-
-import { readFileSync, existsSync, lstatSync } from 'node:fs';
-import { resolve, dirname, join } from 'node:path';
-import { fileURLToPath } from 'node:url';
-
-const ROOT = resolve(dirname(fileURLToPath(import.meta.url)), '..');
-const pkg = JSON.parse(readFileSync(join(ROOT, 'package.json'), 'utf8'));
-const bundled = (pkg.bundledDependencies || pkg.bundleDependencies || []).filter((n) => n.startsWith('@agent-relay/'));
-
-let failed = 0;
-for (const name of bundled) {
-  const dir = join(ROOT, 'node_modules', name);
-  const pj = join(dir, 'package.json');
-  if (!existsSync(pj)) {
-    console.error('[verify-bundled] MISSING package.json: ' + pj);
-    failed++;
-    continue;
-  }
-  if (lstatSync(dir).isSymbolicLink()) {
-    console.error('[verify-bundled] STILL A SYMLINK: ' + dir + ' — prepack materializer did not run');
-    failed++;
-    continue;
-  }
-  console.log('[verify-bundled] OK: ' + name);
-}
-
-if (failed > 0) {
-  console.error('[verify-bundled] FAIL — ' + failed + ' package(s) not ready for npm pack');
-  process.exit(1);
-}
-console.log('[verify-bundled] all bundled @agent-relay/* packages ready');
-\`\`\`
-
-Finally, update \`${PKG_JSON}\`:
-
-1. In \`scripts.prepack\`, append \`&& node scripts/prepack-materialize-workspaces.mjs && node scripts/verify-bundled-deps.mjs\`. Preserve the existing conditional build step — the new form should be:
-
-   \`\`\`
-   "prepack": "if [ -d node_modules ]; then npm run build; else echo '⚠ node_modules not found, skipping prepack build'; fi && node scripts/prepack-materialize-workspaces.mjs && node scripts/verify-bundled-deps.mjs"
-   \`\`\`
-
-2. Add a \`prepublishOnly\` script: \`"prepublishOnly": "node scripts/verify-bundled-deps.mjs"\`.
-
-Do NOT modify any other script, dependency, or version field. End your message with IMPL_DONE.`,
-      verification: { type: 'output_contains', value: 'IMPL_DONE' },
-    })
-
-    // ── Phase 3: Verify scripts landed ───────────────────────────────
-    .step('verify-impl', {
-      type: 'deterministic',
-      dependsOn: ['implement-materialize'],
-      command: `set -e
-test -f ${PREPACK_SCRIPT} || (echo "MISSING ${PREPACK_SCRIPT}"; exit 1)
-test -f ${VERIFY_SCRIPT} || (echo "MISSING ${VERIFY_SCRIPT}"; exit 1)
-
-node -e "const s = require('./${PKG_JSON}').scripts; if (!s.prepack.includes('prepack-materialize-workspaces')) { console.error('prepack not wired'); process.exit(1); } if (!s.prepublishOnly || !s.prepublishOnly.includes('verify-bundled-deps')) { console.error('prepublishOnly not wired'); process.exit(1); } console.log('PKG_SCRIPTS_OK');"
-
-echo "VERIFY_IMPL_OK"`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 4: Run the verifier (dry — pre-materialize) ────────────
-    .step('run-verify-before', {
-      type: 'deterministic',
-      dependsOn: ['verify-impl'],
-      command: `node ${VERIFY_SCRIPT} 2>&1 || echo "(expected failure pre-materialize)"`,
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    // ── Phase 5: Run the materializer ────────────────────────────────
-    .step('run-materialize', {
-      type: 'deterministic',
-      dependsOn: ['run-verify-before'],
-      command: `node ${PREPACK_SCRIPT} 2>&1`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('fix-materialize', {
-      agent: 'fixer',
-      dependsOn: ['run-materialize'],
-      timeoutMs: 600_000,
-      task: `Materialize script output:
-
-{{steps.run-materialize.output}}
-
-If the script printed "done — materialized" or "already materialized" and exited 0, do nothing and end with MATERIALIZE_OK.
-
-If it crashed (TypeError, ReferenceError, ENOENT, etc.), read ${PREPACK_SCRIPT}, find the bug, fix it, and re-run \`node ${PREPACK_SCRIPT}\`. Iterate until it's green. End with MATERIALIZE_OK.`,
-      verification: { type: 'exit_code' },
-    })
-
-    .step('run-verify-after', {
-      type: 'deterministic',
-      dependsOn: ['fix-materialize'],
-      command: `node ${VERIFY_SCRIPT} 2>&1`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 6: npm pack smoke — does the tarball actually contain utils? ──
-    .step('pack-smoke', {
-      type: 'deterministic',
-      dependsOn: ['run-verify-after'],
-      command: `set -e
-rm -f agent-relay-*.tgz
-# Skip the build step to keep this fast — we just care about the pack list
-SKIP_BUILD=1 npm pack --dry-run --json 2>/dev/null > /tmp/pack-dry.json || npm pack --dry-run --json > /tmp/pack-dry.json
-node -e "
-const data = JSON.parse(require('fs').readFileSync('/tmp/pack-dry.json', 'utf8'));
-const files = (data[0] && data[0].files) || [];
-const utilsEntries = files.filter(f => f.path && f.path.startsWith('node_modules/@agent-relay/utils/'));
-if (utilsEntries.length === 0) {
-  console.error('PACK_SMOKE_FAIL: no node_modules/@agent-relay/utils entries in pack list');
-  console.error('sample pack entries:');
-  files.slice(0, 20).forEach(f => console.error('  ' + f.path));
-  process.exit(1);
-}
-console.log('PACK_SMOKE_OK — ' + utilsEntries.length + ' utils entries in pack list');
-utilsEntries.slice(0, 5).forEach(e => console.log('  ' + e.path));
-"`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 7: Enhance post-publish verification ───────────────────
-    .step('enhance-post-publish', {
-      agent: 'tester',
-      dependsOn: ['pack-smoke'],
-      timeoutMs: 600_000,
-      task: `Edit \`${POST_PUBLISH}\`. Add a new test block (Test 6) near the end of the file, BEFORE the \`# Summary\` section.
-
-The current script only runs \`agent-relay --version\` / \`--help\` / SDK require tests. None of these load any code that imports \`@agent-relay/utils\`, so the ERR_MODULE_NOT_FOUND bug ships unnoticed.
-
-Add this block (bash, using the existing log_header / record_pass / record_fail helpers and the existing \`TEST_PROJECT_DIR\` and installed \`./node_modules/agent-relay\`):
-
-\`\`\`bash
-# ============================================
-# Test 6: @agent-relay/utils resolution (regression guard for bundledDependencies)
-# ============================================
-log_header "Test 6: @agent-relay/utils resolution"
-
-log_info "Verifying @agent-relay/utils resolves from installed agent-relay..."
-UTILS_RESOLUTION=$(node -e "
-try {
-    const path = require('path');
-    const pkgDir = path.dirname(require.resolve('agent-relay/package.json'));
-    const resolved = require.resolve('@agent-relay/utils', { paths: [pkgDir] });
-    console.log('RESOLVED:', resolved);
-    console.log('UTILS_RESOLVE_OK');
-} catch (e) {
-    console.log('UTILS_RESOLVE_FAIL:', e.code || e.message);
-}
-" 2>&1) || true
-
-log_info "Utils resolution output: $UTILS_RESOLUTION"
-if echo "$UTILS_RESOLUTION" | grep -q "UTILS_RESOLVE_OK"; then
-    record_pass "@agent-relay/utils resolves from installed agent-relay"
-else
-    record_fail "@agent-relay/utils is NOT resolvable — bundledDependencies regression"
-fi
-
-log_info "Dynamic-import smoke test for cloud connect code path..."
-CLOUD_CONNECT_SMOKE=$(node --input-type=module -e "
-try {
-    await import('agent-relay/dist/cli/commands/cloud/connect.js');
-    console.log('CLOUD_CONNECT_IMPORT_OK');
-} catch (e) {
-    if (e && e.code === 'ERR_MODULE_NOT_FOUND') {
-        console.log('CLOUD_CONNECT_IMPORT_FAIL:', e.message);
-    } else {
-        // A different error (e.g. expecting argv) is fine — the module loaded
-        console.log('CLOUD_CONNECT_IMPORT_OK_WITH_RUNTIME_ERR');
-    }
-}
-" 2>&1) || true
-
-log_info "Cloud connect import output: $CLOUD_CONNECT_SMOKE"
-if echo "$CLOUD_CONNECT_SMOKE" | grep -q "CLOUD_CONNECT_IMPORT_OK"; then
-    record_pass "cloud connect module imports without ERR_MODULE_NOT_FOUND"
-elif echo "$CLOUD_CONNECT_SMOKE" | grep -q "CLOUD_CONNECT_IMPORT_FAIL"; then
-    record_fail "cloud connect import FAILED with ERR_MODULE_NOT_FOUND: $CLOUD_CONNECT_SMOKE"
-else
-    log_warn "cloud connect import had unknown outcome: $CLOUD_CONNECT_SMOKE"
-    record_fail "cloud connect import indeterminate"
-fi
-\`\`\`
-
-Important notes:
-- The exact path \`agent-relay/dist/cli/commands/cloud/connect.js\` may not exist in the current build output — before committing, \`ls node_modules/agent-relay/dist/cli/commands/cloud/\` in the current repo (from the test project dir used by earlier Tests 3–5) to discover the actual exported entry. If \`connect.js\` isn't there, pick any file that transitively imports \`@agent-relay/utils\` — you can grep \`dist/\` for \`require("@agent-relay/utils")\` or the ESM equivalent to find a known-good target.
-- Preserve all existing test blocks; only add Test 6.
-- Make sure the \`Summary\` section and \`exit $TESTS_FAILED\` logic still sees the new pass/fail counts (using \`record_pass\` / \`record_fail\` handles this automatically).
-
-End your message with POST_PUBLISH_DONE.`,
-      verification: { type: 'output_contains', value: 'POST_PUBLISH_DONE' },
-    })
-
-    .step('verify-post-publish', {
-      type: 'deterministic',
-      dependsOn: ['enhance-post-publish'],
-      command: `set -e
-grep -q "Test 6: @agent-relay/utils resolution" ${POST_PUBLISH} || (echo "MISSING Test 6 block"; exit 1)
-grep -q "UTILS_RESOLVE_OK" ${POST_PUBLISH} || (echo "MISSING resolve assertion"; exit 1)
-grep -Eq "UTILS_IMPORT_OK|CLOUD_CONNECT_IMPORT|CLI_BOOTSTRAP_IMPORT" ${POST_PUBLISH} || (echo "MISSING import assertion marker"; exit 1)
-bash -n ${POST_PUBLISH} || (echo "SHELL SYNTAX ERROR"; exit 1)
-echo "POST_PUBLISH_VERIFY_OK"`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 8: Full regression — typecheck and existing tests ───────
-    .step('typecheck', {
-      type: 'deterministic',
-      dependsOn: ['verify-post-publish'],
-      command: `npx tsc --noEmit 2>&1 | tail -40; echo "EXIT: $?"`,
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    .step('fix-typecheck', {
-      agent: 'fixer',
-      dependsOn: ['typecheck'],
-      timeoutMs: 600_000,
-      task: `Typecheck output:
-{{steps.typecheck.output}}
-
-If EXIT: 0, do nothing and end with TYPECHECK_OK.
-Otherwise the script edits introduced a new typecheck error. Fix it in the smallest possible diff. The new .mjs files should not be typechecked by tsc — if they are, either exclude them from tsconfig or add an appropriate jsconfig/JSDoc stub. Re-run \`npx tsc --noEmit\`. End with TYPECHECK_OK.`,
-      verification: { type: 'exit_code' },
-    })
-
-    .step('typecheck-final', {
-      type: 'deterministic',
-      dependsOn: ['fix-typecheck'],
-      command: `npx tsc --noEmit 2>&1 | tail -40`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('regression-cli-tests', {
-      type: 'deterministic',
-      dependsOn: ['typecheck-final'],
-      command: `npx vitest run src/cli 2>&1 | tail -40`,
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    .step('fix-regressions', {
-      agent: 'fixer',
-      dependsOn: ['regression-cli-tests'],
-      timeoutMs: 600_000,
-      task: `Vitest output:
-{{steps.regression-cli-tests.output}}
-
-If all green, end with NO_REGRESSIONS.
-If anything broke, the bundling changes should not have touched src/ — investigate and fix the root cause. End with NO_REGRESSIONS.`,
-      verification: { type: 'exit_code' },
-    })
-
-    .step('regression-cli-tests-final', {
-      type: 'deterministic',
-      dependsOn: ['fix-regressions'],
-      command: `npx vitest run src/cli 2>&1 | tail -30`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 9: Summary ─────────────────────────────────────────────
-    .step('summary', {
-      type: 'deterministic',
-      dependsOn: ['regression-cli-tests-final'],
-      command: `echo "=== Files changed ==="
-git status --short
-echo ""
-echo "=== Diff summary ==="
-git diff --stat
-echo ""
-echo "All green. The tarball will now ship node_modules/@agent-relay/utils"
-echo "and the post-publish verifier will catch this regression class going forward."`,
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    .onError('retry', { maxRetries: 1, retryDelayMs: 5_000 })
-    .run({ cwd: process.cwd() });
-
-  console.log('Workflow status:', result.status);
-  console.log('Steps completed:', Object.keys(result.steps || {}));
-  process.exit(result.status === 'completed' ? 0 : 1);
-}
-
-main().catch((error) => {
-  console.error('Workflow failed:', error);
-  process.exit(1);
-});
diff --git a/workflows/cloud-connect/fix-cloud-connect-claude-hang.ts b/workflows/cloud-connect/fix-cloud-connect-claude-hang.ts
deleted file mode 100644
index 961119cc5..000000000
--- a/workflows/cloud-connect/fix-cloud-connect-claude-hang.ts
+++ /dev/null
@@ -1,532 +0,0 @@
-/**
- * fix-cloud-connect-claude-hang.ts
- *
- * ## Problem
- *
- * `agent-relay cloud connect anthropic` prints "Starting interactive
- * authentication..." and then hangs with zero further output. The SSH channel
- * to the Daytona sandbox is open, but claude's Ink-based TUI never renders
- * anything in the local terminal. The same codepath works for opencode and
- * cursor in some shapes but hangs consistently for claude.
- *
- * This is a separate bug from the openai PATH-propagation hang (fixed
- * server-side in cloud repo — that one was `VAR=val cmd1; cmd2` bash scoping).
- * Claude is preinstalled on the Daytona base image so PATH isn't the issue;
- * this one lives in the relay CLI's SSH shell bridge.
- *
- * ## Hypotheses
- *
- * The root cause is in `src/cli/lib/ssh-interactive.ts` inside the
- * `sshClient.shell({ term, cols, rows }, (err, stream) => { ... })` callback
- * (around line 180). Two concrete things are wrong:
- *
- * **H1 — data-listener race.** `stream.on('data', ...)` is attached AFTER
- * `stream.write(\`${command}; exit $?\\n\`)`. ssh2 ClientChannel is a
- * Readable stream. While no 'data' listener is attached, the stream is
- * paused and bytes buffer internally. When the listener is attached, the
- * stream enters flowing mode and the buffered bytes should flush.
- *
- * In practice, if the shell's early output (PS1, banner, TUI enter-alt-screen
- * sequences) is emitted in the same tick as the stream.write call, ssh2 can
- * dispatch those bytes as a single 'data' event that is dropped because no
- * listener is attached yet. The TUI then hides the cursor and clears the
- * screen — and the local terminal sits black until the 15-minute session
- * timeout.
- *
- * **H2 — `; exit $?` race.** `stream.write(\`${command}; exit $?\\n\`)` sends
- * the command plus a trailing `; exit $?`. When the wrapped CLI is an
- * Ink-based TUI (claude, codex, opencode), the TUI enters alternate-screen
- * buffer and hides the cursor at start. If the user's SSH client exits or
- * the shell process closes before the TUI has flushed its final redraw, the
- * local terminal never renders anything. Using `exec <command>` replaces the
- * shell process with the CLI outright, so there is no "shell exit" after the
- * TUI returns — the PTY closes when the CLI exits, cleanly.
- *
- * ## Fix
- *
- * 1. Move all stream event handlers (`stream.on('data', ...)`, stdin
- *    wiring, resize handler, timeout setup) BEFORE the `stream.write(...)`
- *    call inside the shell() callback. Do not call `stream.write` until
- *    after the 'data' listener is attached.
- *
- * 2. Change `stream.write(\`${command}; exit $?\\n\`)` to
- *    `stream.write(\`exec ${command}\\n\`)`. The shell is replaced with the
- *    target CLI; the PTY closes when the CLI exits and emits its exit code
- *    naturally.
- *
- * 3. Extract `ssh-interactive.ts`'s shell-invocation command transformation
- *    into a pure helper `formatShellInvocation(command: string): string` so
- *    it can be unit-tested without a real SSH server.
- *
- * 4. Add a unit test that mocks `sshClient.shell()` and asserts:
- *    - A 'data' event listener is attached BEFORE any `stream.write` call
- *    - The write payload starts with `exec ` and contains no `; exit $?`
- *    - When the fake stream emits data synchronously upon open, the handler
- *      sees it (regression test for H1)
- *
- * ## Acceptance contract
- *
- *   A1  formatShellInvocation('claude')              === 'exec claude\\n'
- *   A2  formatShellInvocation('codex login --no-browser') === 'exec codex login --no-browser\\n'
- *   A3  formatShellInvocation never contains '; exit $?'
- *   A4  In the shell() callback, on('data') is registered before stream.write
- *   A5  When the mock stream emits 'READY\\n' synchronously after open, the
- *       captured output buffer contains 'READY'
- *   A6  `npx tsc --noEmit` is clean
- *   A7  Existing ssh-interactive tests (if any) still pass
- *   A8  `npm run test:cli -- ssh-interactive` is green
- *
- * ## Usage
- *
- *   cd /Users/khaliqgant/Projects/AgentWorkforce/relay
- *   agent-relay run workflows/fix-cloud-connect-claude-hang.ts
- */
-
-import { workflow } from '@agent-relay/sdk/workflows';
-import { CodexModels } from '@agent-relay/config';
-
-const SSH_INTERACTIVE = 'src/cli/lib/ssh-interactive.ts';
-const NEW_TEST = 'src/cli/lib/ssh-interactive.test.ts';
-
-async function main() {
-  const result = await workflow('fix-cloud-connect-claude-hang')
-    .description(
-      'Fix claude TUI hang in ssh-interactive.ts — data-listener race + shell exit race. Validates with unit tests.'
-    )
-    .pattern('dag')
-    .channel('wf-fix-claude-hang')
-    .maxConcurrency(3)
-    .timeout(3_600_000)
-
-    .agent('impl', {
-      cli: 'codex',
-      model: CodexModels.GPT_5_5,
-      preset: 'worker',
-      role: 'Refactors ssh-interactive.ts shell callback and exports formatShellInvocation',
-      retries: 2,
-    })
-    .agent('tester', {
-      cli: 'codex',
-      model: CodexModels.GPT_5_5,
-      preset: 'worker',
-      role: 'Writes unit tests for formatShellInvocation and the handler-order regression',
-      retries: 2,
-    })
-    .agent('fixer', {
-      cli: 'codex',
-      model: CodexModels.GPT_5_5,
-      preset: 'worker',
-      role: 'Fixes type errors, test failures, and regressions',
-      retries: 2,
-    })
-
-    // ── Phase 0: Setup branch ────────────────────────────────────────
-    .step('setup-branch', {
-      type: 'deterministic',
-      command: `set -e
-BRANCH="fix/cloud-connect-claude-hang"
-CURRENT=$(git branch --show-current)
-if [ "$CURRENT" = "$BRANCH" ]; then
-  echo "Already on $BRANCH"
-elif git checkout -b "$BRANCH" 2>/dev/null; then
-  echo "Checked out new $BRANCH"
-elif git checkout "$BRANCH" 2>/dev/null; then
-  echo "Checked out existing $BRANCH"
-else
-  echo "Branch $BRANCH unavailable in this worktree; staying on $CURRENT"
-fi
-echo "BRANCH: $(git branch --show-current)"`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 1: Read source ─────────────────────────────────────────
-    .step('read-ssh-interactive', {
-      type: 'deterministic',
-      dependsOn: ['setup-branch'],
-      command: `cat ${SSH_INTERACTIVE}`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 2: Implement the fix ───────────────────────────────────
-    .step('implement-fix', {
-      agent: 'impl',
-      dependsOn: ['read-ssh-interactive'],
-      timeoutMs: 900_000,
-      task: `Edit \`${SSH_INTERACTIVE}\`. Do not touch other files.
-
-Current file:
-{{steps.read-ssh-interactive.output}}
-
-Make three changes inside the \`sshClient.shell({ term, cols, rows }, (err, stream) => { ... })\` callback (starting around the line \`sshClient.shell({ term, cols, rows }, (err, stream) => {\`):
-
----
-
-**Change 1 — Add an exported pure helper at module top-level (above runInteractiveSession):**
-
-\`\`\`ts
-/**
- * Format a remote command for execution inside an ssh2 shell() PTY.
- *
- * Uses \`exec\` to replace the shell process with the target CLI so there is
- * no shell-teardown race after a TUI (claude, codex, opencode, etc.) returns.
- * The PTY closes when the CLI exits and emits its exit code naturally, with
- * no trailing \`; exit $?\` that can win a race against the TUI's final
- * alternate-screen-buffer flush.
- */
-export function formatShellInvocation(command: string): string {
-  return \`exec \${command}\\n\`;
-}
-\`\`\`
-
----
-
-**Change 2 — Reorder the shell() callback so data handlers are attached BEFORE the write.**
-
-Find the block that currently looks (approximately) like:
-
-\`\`\`ts
-sshClient.shell({ term, cols, rows }, (err, stream) => {
-  if (err) return reject(err);
-
-  // Send the command through the shell, then exit with its status
-  stream.write(\`\${command}; exit $?\\n\`);
-
-  let exitCode: number | null = null;
-  let exitSignal: string | null = null;
-  let authDetected = false;
-  let outputBuffer = '';
-
-  const stdin = process.stdin;
-  const stdout = process.stdout;
-  const stderr = process.stderr;
-
-  // ... raw mode setup, stdin handler, cleanup, resize handler, timer ...
-
-  stream.on('data', (data: Buffer) => { /* ... */ });
-  stream.stderr.on('data', (data: Buffer) => { /* ... */ });
-
-  // ... stream.on('exit'), stream.on('close'), stream.on('error') ...
-});
-\`\`\`
-
-Restructure so the order is:
-
-1. \`if (err) return reject(err);\`
-2. All variable declarations (\`let exitCode\`, etc.)
-3. All handler declarations (\`onStdinData\`, \`cleanup\`, \`closeOnAuthSuccess\`, \`onResize\`)
-4. \`stream.on('data', ...)\`  — MOVED BEFORE the write
-5. \`stream.stderr.on('data', ...)\` — MOVED BEFORE the write
-6. \`stream.on('exit', ...)\`
-7. \`stream.on('close', ...)\`
-8. \`stream.on('error', ...)\`
-9. \`stdout.on('resize', onResize)\`
-10. \`stdin.on('data', onStdinData)\`
-11. \`stdin.setRawMode?.(true); stdin.resume();\`
-12. \`const timer = runtime.setTimeout(...);\`
-13. **Only now:** \`stream.write(formatShellInvocation(command));\`
-
-The key invariant: no \`stream.write\` call may happen before \`stream.on('data', ...)\` is registered.
-
----
-
-**Change 3 — Replace the write payload.**
-
-OLD:
-\`\`\`ts
-stream.write(\`\${command}; exit $?\\n\`);
-\`\`\`
-
-NEW:
-\`\`\`ts
-stream.write(formatShellInvocation(command));
-\`\`\`
-
----
-
-Do NOT change any other function, the system-ssh fallback branch, types, imports (except that you may need to add an export for \`formatShellInvocation\`), or the file's public surface. Keep the diff focused.
-
-When done, end your message with EDIT_DONE.`,
-      verification: { type: 'output_contains', value: 'EDIT_DONE' },
-    })
-
-    // ── Phase 3: Verify edit landed ──────────────────────────────────
-    .step('verify-edit', {
-      type: 'deterministic',
-      dependsOn: ['implement-fix'],
-      command: `set -e
-git diff --quiet ${SSH_INTERACTIVE} && (echo "NOT MODIFIED"; exit 1) || true
-
-grep -q "export function formatShellInvocation" ${SSH_INTERACTIVE} || (echo "MISSING formatShellInvocation export"; exit 1)
-
-grep -q "formatShellInvocation(command)" ${SSH_INTERACTIVE} || (echo "NOT CALLED from shell callback"; exit 1)
-
-# Must NOT contain the old shell-wrapper stream.write call.
-# Comments may still describe the legacy behavior, so match the code pattern.
-if rg -q 'stream\\.write\\(.*exit \\$\\?\\\\n' ${SSH_INTERACTIVE}; then
-  echo "ERROR: still contains legacy shell wrapper write — must be removed"
-  exit 1
-fi
-
-# Basic ordering check: stream.on('data' should appear before stream.write(
-# inside the shell callback region. This uses line numbers.
-DATA_LINE=$(grep -n "stream.on('data'" ${SSH_INTERACTIVE} | head -1 | cut -d: -f1)
-WRITE_LINE=$(grep -n "stream.write(formatShellInvocation" ${SSH_INTERACTIVE} | head -1 | cut -d: -f1)
-
-if [ -z "$DATA_LINE" ] || [ -z "$WRITE_LINE" ]; then
-  echo "MISSING expected markers: data=$DATA_LINE write=$WRITE_LINE"
-  exit 1
-fi
-
-if [ "$DATA_LINE" -gt "$WRITE_LINE" ]; then
-  echo "ERROR: stream.on('data') (line $DATA_LINE) appears AFTER stream.write(formatShellInvocation) (line $WRITE_LINE)"
-  exit 1
-fi
-
-echo "VERIFY_EDIT_OK"`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 4: Write unit tests ────────────────────────────────────
-    .step('write-tests', {
-      agent: 'tester',
-      dependsOn: ['verify-edit'],
-      timeoutMs: 600_000,
-      task: `Create \`${NEW_TEST}\`. This is a new test file — there is no existing one to extend.
-
-Use Vitest (the rest of this repo uses \`vitest\`) with \`describe/it/expect\`. Import from the module under test:
-
-\`\`\`ts
-import { describe, it, expect, vi } from 'vitest';
-import { formatShellInvocation, runInteractiveSession } from './ssh-interactive.js';
-\`\`\`
-
-Write these test cases:
-
-**Suite: formatShellInvocation**
-
-1. \`exec\` prefix: \`expect(formatShellInvocation('claude')).toBe('exec claude\\n')\`
-2. passes args through: \`expect(formatShellInvocation('codex login --no-browser')).toBe('exec codex login --no-browser\\n')\`
-3. never includes \`; exit $?\`: \`expect(formatShellInvocation('claude').includes('; exit $?')).toBe(false)\`
-4. ends with a single \`\\n\`: \`expect(formatShellInvocation('claude').endsWith('\\n')).toBe(true)\` and \`.split('\\n').length === 2\`
-
-**Suite: runInteractiveSession — handler-order regression (H1)**
-
-Construct a fake ssh2 client via the \`runtime\` option. Strategy:
-
-\`\`\`ts
-import { EventEmitter } from 'node:events';
-
-function createFakeStream() {
-  const stream: any = new EventEmitter();
-  stream.stderr = new EventEmitter();
-  stream.write = vi.fn();
-  stream.close = vi.fn();
-  stream.setWindow = vi.fn();
-  return stream;
-}
-
-function createFakeClient() {
-  const client: any = new EventEmitter();
-  const stream = createFakeStream();
-  client.stream = stream;
-  client.connect = vi.fn(() => {
-    // Fire 'ready' async
-    setImmediate(() => client.emit('ready'));
-  });
-  client.shell = vi.fn((opts: any, cb: any) => {
-    // Synchronously invoke callback with the stream
-    cb(null, stream);
-  });
-  client.forwardOut = vi.fn((src: any, p1: any, dst: any, p2: any, cb: any) => {
-    cb(null, new EventEmitter());
-  });
-  client.end = vi.fn();
-  return client;
-}
-
-const fakeSSH2 = {
-  Client: class FakeClientWrap {
-    constructor() {
-      return createFakeClient();
-    }
-  },
-};
-\`\`\`
-
-Tests:
-
-5. **stream.on('data') is attached before stream.write**:
-   - Call \`runInteractiveSession\` with a fake \`loadSSH2\` that returns \`fakeSSH2\`.
-   - After the shell() callback fires, inspect the order of events on the fake stream: listenerCount('data') must be \`>= 1\` BEFORE the first \`stream.write.mock.calls\` entry. Track this by spying on \`stream.write\` with vi.fn and recording \`stream.listenerCount('data')\` at the moment write is called.
-   - Assert the recorded listener count at write-time is \`>= 1\`.
-
-6. **write payload starts with 'exec '**:
-   - Assert \`stream.write.mock.calls[0][0].startsWith('exec ')\`.
-   - Assert \`stream.write.mock.calls[0][0].includes('; exit $?')\` is false.
-
-7. **synchronous early data is captured**:
-   - Configure the fake \`client.shell\` to emit \`stream.emit('data', Buffer.from('READY\\n'))\` synchronously inside the shell callback, immediately after passing the stream to \`cb\`.
-   - Configure the successPatterns to include \`/READY/\` so the session resolves with \`authDetected: true\`.
-   - Assert the returned \`InteractiveSessionResult.authDetected === true\`.
-
-Required test options — pass these to \`runInteractiveSession\`:
-- \`ssh: { host: 'test', port: 22, user: 'test', password: 'test' }\`
-- \`remoteCommand: 'claude'\`
-- \`successPatterns: [/READY/]\` (for test 7; use \`[]\` for tests 5 and 6)
-- \`errorPatterns: []\`
-- \`timeoutMs: 5000\`
-- \`io: { log: vi.fn(), error: vi.fn() }\`
-- \`runtime: { loadSSH2: async () => fakeSSH2, createServer: () => ({ listen: (_: any, _h: any, cb: any) => cb(), close: vi.fn(), on: vi.fn() }), setTimeout: (fn: any, ms: any) => setTimeout(fn, ms) }\`
-
-Mock stdin/stdout via vi.spyOn so raw mode + resume don't break the test environment:
-\`\`\`ts
-vi.spyOn(process.stdin, 'setRawMode').mockImplementation(() => process.stdin);
-vi.spyOn(process.stdin, 'resume').mockImplementation(() => process.stdin);
-vi.spyOn(process.stdin, 'pause').mockImplementation(() => process.stdin);
-\`\`\`
-
-You may need to call \`stream.emit('close')\` after setup to resolve the session. Study the current \`runInteractiveSession\` code to understand the resolve path.
-
-When done, end your message with TESTS_WRITTEN.`,
-      verification: { type: 'file_exists', value: NEW_TEST },
-    })
-
-    .step('verify-tests-written', {
-      type: 'deterministic',
-      dependsOn: ['write-tests'],
-      command: `set -e
-test -f ${NEW_TEST} || (echo "MISSING test file"; exit 1)
-grep -q "formatShellInvocation" ${NEW_TEST} || (echo "missing formatShellInvocation import/usage"; exit 1)
-grep -q "exec claude" ${NEW_TEST} || (echo "missing exec claude assertion"; exit 1)
-grep -q "listenerCount" ${NEW_TEST} || (echo "missing listener-count handler-order test"; exit 1)
-echo OK`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 5: Run unit tests (test-fix-rerun) ─────────────────────
-    .step('run-tests', {
-      type: 'deterministic',
-      dependsOn: ['verify-tests-written'],
-      command: `npx vitest run ${NEW_TEST} 2>&1 | tail -80`,
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    .step('fix-tests', {
-      agent: 'fixer',
-      dependsOn: ['run-tests'],
-      timeoutMs: 900_000,
-      task: `Vitest output:
-
-{{steps.run-tests.output}}
-
-If ALL tests passed, do nothing and end with ALL_GREEN.
-
-If there are failures, decide whether the bug is in:
-  (a) the implementation in ${SSH_INTERACTIVE}, or
-  (b) the test in ${NEW_TEST}, or
-  (c) the fake-ssh2 setup.
-
-The handler-order regression test (test 5) is a hard contract — if it fails because listenerCount('data') is 0 at write-time, the implementation reordering is wrong. Fix ${SSH_INTERACTIVE}, not the test.
-
-Re-run: \`npx vitest run ${NEW_TEST}\`. Iterate until green. End with ALL_GREEN.`,
-      verification: { type: 'exit_code' },
-    })
-
-    .step('run-tests-final', {
-      type: 'deterministic',
-      dependsOn: ['fix-tests'],
-      command: `npx vitest run ${NEW_TEST} 2>&1 | tail -60`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 6: Typecheck ───────────────────────────────────────────
-    .step('typecheck', {
-      type: 'deterministic',
-      dependsOn: ['run-tests-final'],
-      command: `npx tsc --noEmit 2>&1 | tail -40; echo "EXIT: $?"`,
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    .step('fix-typecheck', {
-      agent: 'fixer',
-      dependsOn: ['typecheck'],
-      timeoutMs: 600_000,
-      task: `Typecheck output:
-{{steps.typecheck.output}}
-
-If EXIT: 0, do nothing and end with TYPECHECK_OK.
-If there are type errors in ${SSH_INTERACTIVE} or ${NEW_TEST}, fix them. Do not touch unrelated files. Re-run \`npx tsc --noEmit\`. End with TYPECHECK_OK.`,
-      verification: { type: 'exit_code' },
-    })
-
-    .step('typecheck-final', {
-      type: 'deterministic',
-      dependsOn: ['fix-typecheck'],
-      command: `npx tsc --noEmit 2>&1 | tail -40`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 7: Regression — run related CLI tests ──────────────────
-    .step('run-cli-tests', {
-      type: 'deterministic',
-      dependsOn: ['typecheck-final'],
-      command: `npx vitest run src/cli 2>&1 | tail -40`,
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    .step('fix-regressions', {
-      agent: 'fixer',
-      dependsOn: ['run-cli-tests'],
-      timeoutMs: 600_000,
-      task: `CLI test output:
-{{steps.run-cli-tests.output}}
-
-If all green, end with NO_REGRESSIONS.
-If the refactor broke any existing test under src/cli, fix the root cause in ${SSH_INTERACTIVE} (not the test). End with NO_REGRESSIONS.`,
-      verification: { type: 'exit_code' },
-    })
-
-    .step('run-cli-tests-final', {
-      type: 'deterministic',
-      dependsOn: ['fix-regressions'],
-      command: `npx vitest run src/cli 2>&1 | tail -30`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 8: Summary ─────────────────────────────────────────────
-    .step('summary', {
-      type: 'deterministic',
-      dependsOn: ['run-cli-tests-final'],
-      command: `echo "=== Files changed ==="
-git status --short ${SSH_INTERACTIVE} ${NEW_TEST}
-echo ""
-echo "=== Diff summary ==="
-git diff --stat ${SSH_INTERACTIVE} ${NEW_TEST}
-echo ""
-echo "All green. Review the diff, commit, and open a PR."`,
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    .onError('retry', { maxRetries: 1, retryDelayMs: 5_000 })
-    .run({ cwd: process.cwd() });
-
-  console.log('Workflow status:', result.status);
-  console.log('Steps completed:', Object.keys(result.steps || {}));
-  process.exit(result.status === 'completed' ? 0 : 1);
-}
-
-main().catch((error) => {
-  console.error('Workflow failed:', error);
-  process.exit(1);
-});
diff --git a/workflows/cloud-connect/validate-cloud-connect-e2e.ts b/workflows/cloud-connect/validate-cloud-connect-e2e.ts
deleted file mode 100644
index 04060e4ce..000000000
--- a/workflows/cloud-connect/validate-cloud-connect-e2e.ts
+++ /dev/null
@@ -1,690 +0,0 @@
-/**
- * validate-cloud-connect-e2e.ts
- *
- * ## Problem
- *
- * `agent-relay cloud connect <provider>` still appears to hang for users on
- * the released Bun binary (v4.0.26), even after:
- *   - PR #743 (handler-order + exec sh -c refactor in ssh-interactive.ts)
- *   - PR #744 (wrapWithLaunchCheckpoint — visible printf breadcrumb before
- *     the provider CLI enters alt-screen)
- *
- * The user upgrades, runs `cloud connect claude`, sees the unified
- * "Starting interactive authentication…" banner, and then… silence. No
- * `[agent-relay] launching provider CLI…` line. No claude TUI. Nothing.
- *
- * ## Root-cause hypothesis
- *
- * `scripts/build-bun.sh` passes `--external ssh2` to `bun build --compile`
- * on BOTH the cross-compile loop (line 68) and the current-platform build
- * (line 104). That means the released standalone binary does NOT contain
- * ssh2. At runtime, `loadSSH2()` attempts `import('ssh2')`, which throws
- * inside a packaged Bun binary, so the helper returns null and the CLI
- * falls through to the system-ssh fallback path in ssh-interactive.ts.
- *
- * The ssh2 path has rich observability (AGENT_RELAY_DEBUG_SSH, first-byte
- * timing, stream lifecycle, line-clearing hint). The system-ssh fallback
- * has ALMOST NONE: it shells out to `ssh -tt user@host 'remoteCommand'`
- * and inherits stdio. If the remote command hangs during environment
- * setup — before `printf '…launching provider CLI…'` ever runs — the user
- * sees no breadcrumb because the printf never executed.
- *
- * Every fix we've shipped so far has lived in the ssh2 branch or in the
- * remote command string that the ssh2 branch writes into the PTY. None of
- * those fixes reach released-binary users because they aren't running the
- * ssh2 branch at all.
- *
- * We validated locally that ssh2 bundles cleanly into a Bun compile:
- *
- *   bun build --compile --target=bun-darwin-arm64 \
- *     --external better-sqlite3 --external cpu-features --external node-pty \
- *     ./dist/src/cli/index.js --outfile /tmp/ssh2-bundle-test/agent-relay-ssh2
- *   # → 986 modules, 62M binary, --version works
- *
- * ## Fix
- *
- *   1. Drop `--external ssh2` from both occurrences in scripts/build-bun.sh.
- *   2. Add a real ssh2 integration test (tests/integration/ssh-interactive-live.test.ts)
- *      that spawns an in-process ssh2 Server, runs `runInteractiveSession`
- *      against it WITHOUT mocking the runtime, and asserts the launch
- *      checkpoint printf is visible in captured stdout. This is the
- *      mechanical E2E proof that the ssh2 path actually reaches the printf.
- *   3. Rebuild the Bun binary from scratch and validate:
- *      - Binary runs: `./.release/agent-relay --version` exits 0
- *      - Binary contains ssh2: `strings` output references ssh2 / ssh-userauth
- *      - Binary exercises the ssh2 branch at runtime (not fallback)
- *
- * ## Acceptance contract
- *
- *   A1  scripts/build-bun.sh contains zero `--external ssh2` occurrences
- *   A2  Existing unit tests green: ssh-interactive.test.ts (13 tests) +
- *       packages/cloud/src/auth.test.ts (10 tests)
- *   A3  `npx tsc --noEmit` is clean
- *   A4  tests/integration/ssh-interactive-live.test.ts exists and passes.
- *       It spins up an in-process ssh2 Server on a random port, calls
- *       runInteractiveSession with a default runtime (no loadSSH2 mock),
- *       and asserts:
- *         - the ssh2 client connects successfully
- *         - a shell session is opened on the fake server
- *         - the first payload received by the fake shell STARTS with
- *           `exec sh -c '` and contains `launching provider CLI`
- *         - captured stdout (via a write-spy) includes the dim-ANSI
- *           "launching provider CLI…" breadcrumb as proof the pipeline
- *           reached the printf
- *   A5  `npm run build && bash scripts/build-bun.sh` produces a binary at
- *       .release/agent-relay that runs `--version` successfully
- *   A6  The built binary contains ssh2 symbols (proof ssh2 is bundled).
- *       Heuristic check: `strings .release/agent-relay | grep -c 'ssh-userauth'`
- *       returns >= 1
- *   A7  `npm run orchestrator:test` (or the project's regression suite)
- *       still green
- *   A8  No commit is made until A1-A7 all pass
- *
- * ## What this workflow explicitly does NOT cover
- *
- *   - Live Daytona validation. Real Daytona sessions cost money per run and
- *     require CLOUD_API_* credentials. After this workflow is green, the
- *     operator must run `./.release/agent-relay cloud connect claude`
- *     against a real Daytona sandbox to confirm the fix works end-to-end.
- *     The workflow prints explicit validation instructions in its final
- *     summary step.
- *
- * ## Usage
- *
- *   cd /Users/khaliqgant/Projects/AgentWorkforce/relay
- *   agent-relay run --dry-run workflows/cloud-connect/validate-cloud-connect-e2e.ts
- *   agent-relay run workflows/cloud-connect/validate-cloud-connect-e2e.ts
- */
-
-import { workflow } from '@agent-relay/sdk/workflows';
-import { CodexModels, ClaudeModels } from '@agent-relay/config';
-
-const BUILD_BUN_SH = 'scripts/build-bun.sh';
-const SSH_INTERACTIVE = 'src/cli/lib/ssh-interactive.ts';
-const SSH_INTERACTIVE_TEST = 'src/cli/lib/ssh-interactive.test.ts';
-const AUTH_TEST = 'packages/cloud/src/auth.test.ts';
-const LIVE_TEST = 'tests/integration/ssh-interactive-live.test.ts';
-
-async function main() {
-  const result = await workflow('validate-cloud-connect-e2e')
-    .description(
-      'Validate cloud connect E2E: drop --external ssh2, add ssh2 live integration test, rebuild binary, prove the launch-checkpoint printf actually fires on the ssh2 path.'
-    )
-    .pattern('dag')
-    .channel('wf-validate-cloud-connect-e2e')
-    .maxConcurrency(3)
-    .timeout(5_400_000)
-
-    .agent('impl', {
-      cli: 'codex',
-      model: CodexModels.GPT_5_5,
-      preset: 'worker',
-      role: 'Edits scripts/build-bun.sh and writes the live ssh2 integration test',
-      retries: 2,
-    })
-    .agent('tester', {
-      cli: 'codex',
-      model: CodexModels.GPT_5_5,
-      preset: 'worker',
-      role: 'Writes and iterates on the ssh2 live integration test until green',
-      retries: 2,
-    })
-    .agent('fixer', {
-      cli: 'codex',
-      model: CodexModels.GPT_5_5,
-      preset: 'worker',
-      role: 'Fixes unit-test, typecheck, and regression failures',
-      retries: 2,
-    })
-    .agent('reviewer', {
-      cli: 'claude',
-      model: ClaudeModels.SONNET,
-      preset: 'reviewer',
-      role: 'Reviews the final diff for correctness before commit',
-      retries: 1,
-    })
-
-    // ── Phase 0: Branch setup ────────────────────────────────────────
-    .step('setup-branch', {
-      type: 'deterministic',
-      command: `set -e
-BRANCH="fix/cloud-connect-bundle-ssh2"
-CURRENT=$(git branch --show-current)
-if [ "$CURRENT" = "$BRANCH" ]; then
-  echo "Already on $BRANCH"
-elif git checkout -b "$BRANCH" 2>/dev/null; then
-  echo "Checked out new $BRANCH"
-elif git checkout "$BRANCH" 2>/dev/null; then
-  echo "Checked out existing $BRANCH"
-else
-  echo "Branch $BRANCH unavailable in this worktree; staying on $CURRENT"
-fi
-echo "BRANCH: $(git branch --show-current)"
-git status --short`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 1: Snapshot the bug ────────────────────────────────────
-    .step('snapshot-build-bun', {
-      type: 'deterministic',
-      dependsOn: ['setup-branch'],
-      command: `cat ${BUILD_BUN_SH}`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('snapshot-ssh-external-count', {
-      type: 'deterministic',
-      dependsOn: ['snapshot-build-bun'],
-      command: `set -e
-COUNT=$(grep -c -- '--external ssh2' ${BUILD_BUN_SH} || true)
-echo "ssh2-external-count-before: $COUNT"
-if [ "$COUNT" -eq 0 ]; then
-  echo "UNEXPECTED: no --external ssh2 occurrences — fix may already be applied"
-  exit 1
-fi
-echo "OK: $COUNT occurrences to remove"`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('snapshot-existing-tests', {
-      type: 'deterministic',
-      dependsOn: ['snapshot-build-bun'],
-      command: `set -euo pipefail
-echo "=== ssh-interactive.test.ts ==="
-npx vitest run ${SSH_INTERACTIVE_TEST} 2>&1 | tail -30
-echo ""
-echo "=== auth.test.ts ==="
-npx vitest run ${AUTH_TEST} 2>&1 | tail -30`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 2: Read source files the fix touches ───────────────────
-    .step('read-ssh-interactive', {
-      type: 'deterministic',
-      dependsOn: ['snapshot-existing-tests'],
-      command: `cat ${SSH_INTERACTIVE}`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('read-auth-ssh', {
-      type: 'deterministic',
-      dependsOn: ['snapshot-existing-tests'],
-      command: `cat src/cli/lib/auth-ssh.ts 2>/dev/null | head -120 || echo "(auth-ssh.ts missing or short)"`,
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    // ── Phase 3: Drop --external ssh2 from build-bun.sh ──────────────
-    .step('edit-build-bun', {
-      agent: 'impl',
-      dependsOn: ['snapshot-ssh-external-count', 'read-ssh-interactive'],
-      timeoutMs: 300_000,
-      task: `Edit \`${BUILD_BUN_SH}\`. This is a bash script. Do not touch any other file.
-
-Current contents:
-{{steps.snapshot-build-bun.output}}
-
-Remove EVERY line that is exactly \`    --external ssh2 \\\` (part of a \`bun build\` multi-line command). There are two such occurrences — one inside the \`for target_spec in "\${TARGETS[@]}"\` cross-compile loop, and one inside the current-platform build block further down. Keep all other \`--external\` flags (\`better-sqlite3\`, \`cpu-features\`, \`node-pty\`) intact. Keep the trailing backslash continuation on the line ABOVE the removed line so the bash command still parses.
-
-Do NOT add comments explaining the removal. Do NOT change version strings, paths, or any other logic.
-
-After editing, verify with: \`grep -c -- '--external ssh2' ${BUILD_BUN_SH}\` — it must output \`0\`.
-
-End your message with EDIT_DONE.`,
-      verification: { type: 'output_contains', value: 'EDIT_DONE' },
-    })
-
-    .step('verify-build-bun-edit', {
-      type: 'deterministic',
-      dependsOn: ['edit-build-bun'],
-      command: `set -e
-git diff --quiet ${BUILD_BUN_SH} && (echo "NOT MODIFIED"; exit 1) || true
-
-COUNT_AFTER=$(grep -c -- '--external ssh2' ${BUILD_BUN_SH} || true)
-echo "ssh2-external-count-after: $COUNT_AFTER"
-if [ "$COUNT_AFTER" -ne 0 ]; then
-  echo "ERROR: still has $COUNT_AFTER --external ssh2 lines"
-  exit 1
-fi
-
-# Quick bash syntax check.
-bash -n ${BUILD_BUN_SH} && echo "SYNTAX_OK" || (echo "BASH SYNTAX ERROR"; exit 1)
-
-# The other externals must still be present.
-grep -q -- '--external better-sqlite3' ${BUILD_BUN_SH} || (echo "MISSING better-sqlite3"; exit 1)
-grep -q -- '--external cpu-features'   ${BUILD_BUN_SH} || (echo "MISSING cpu-features"; exit 1)
-grep -q -- '--external node-pty'       ${BUILD_BUN_SH} || (echo "MISSING node-pty"; exit 1)
-
-echo "VERIFY_BUILD_BUN_OK"`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 4: Write live ssh2 integration test ───────────────────
-    .step('write-live-integration-test', {
-      agent: 'tester',
-      dependsOn: ['verify-build-bun-edit', 'read-ssh-interactive'],
-      timeoutMs: 900_000,
-      task: `Create \`${LIVE_TEST}\`. This is a NEW integration test — there is no existing one to extend.
-
-The test must spin up an in-process ssh2 Server, call \`runInteractiveSession\` from \`src/cli/lib/ssh-interactive.ts\` with the DEFAULT runtime (i.e. do NOT mock \`loadSSH2\`), and prove that:
-  (a) the ssh2 client actually connects to our fake server
-  (b) the fake server receives a shell-channel write whose payload starts with \`exec sh -c '\` AND contains \`launching provider CLI\`
-  (c) the CLI's stdout pipeline emits the dim-ANSI "launching provider CLI…" breadcrumb
-
-This is the mechanical E2E proof that the ssh2 path actually fires the launch-checkpoint printf — the piece users have NOT been seeing in the released binary.
-
-Use Vitest. Key shape:
-
-\`\`\`ts
-import { describe, it, expect, vi } from 'vitest';
-import { createServer } from 'node:net';
-import { randomBytes } from 'node:crypto';
-import { Server as SSH2Server } from 'ssh2';
-import { readFileSync } from 'node:fs';
-import path from 'node:path';
-import { runInteractiveSession } from '../../src/cli/lib/ssh-interactive.js';
-
-// You will need an ephemeral RSA host key for the fake server.
-// Generate once in the test with ssh2's utils or ship a fixture under
-// tests/fixtures/ssh-host-key. Prefer generating at test time to avoid a
-// checked-in private key:
-//
-//   import { generateKeyPairSync } from 'node:crypto';
-//   const { privateKey } = generateKeyPairSync('rsa', {
-//     modulusLength: 2048,
-//     privateKeyEncoding: { type: 'pkcs1', format: 'pem' },
-//     publicKeyEncoding:  { type: 'spki',  format: 'pem' },
-//   });
-\`\`\`
-
-Test plan:
-
-**Test 1 — ssh2 path writes exec sh -c with launch checkpoint through a real ssh2 connection:**
-
-1. Generate an ephemeral RSA host key.
-2. Start an ssh2 Server listening on an OS-assigned port (pass 0 to .listen).
-3. In the server's connection handler:
-   - Accept \`password\` auth unconditionally for user \`test\` / password \`test\`
-   - Accept the first session
-   - When the client requests a PTY, accept it
-   - When the client opens a shell, capture \`stream.on('data')\` — the first
-     data chunk is what the CLI writes into the shell. Store it into
-     \`capturedWrite: string\`, then write nothing back (or a trivial banner)
-     and call \`stream.exit(0); stream.end();\` after a short delay so the
-     CLI's close handler fires.
-4. Spy on \`process.stdout.write\` (vi.spyOn) to capture every byte the CLI
-   writes to stdout. Collect them into \`capturedStdout: string\`.
-5. Spy on \`process.stdin.setRawMode\`, \`resume\`, \`pause\` and no-op them.
-6. Call \`runInteractiveSession\` with:
-   \`\`\`ts
-   {
-     ssh: { host: '127.0.0.1', port: serverPort, user: 'test', password: 'test' },
-     remoteCommand: 'claude',
-     successPatterns: [],
-     errorPatterns: [],
-     timeoutMs: 5000,
-     io: { log: vi.fn(), error: vi.fn() },
-     // No runtime override — use the real default loadSSH2.
-   }
-   \`\`\`
-7. Await the result. Assertions:
-   - \`capturedWrite\` starts with \`exec sh -c '\`
-   - \`capturedWrite\` includes \`launching provider CLI\`
-   - \`capturedWrite\` does NOT include \`; exit $?\` (regression for PR #743)
-   - \`capturedStdout\` includes the literal text \`launching provider CLI\` — this proves the ssh2 data pipeline hands bytes back to stdout. (In the real sandbox, the printf is the FIRST thing the shell runs before exec claude. In this test, the server has to echo the payload back so the CLI sees "data" and writes it to stdout. See step 3.)
-8. Always close the ssh2 server + client in an \`afterEach\` / \`finally\`.
-
-**Test 2 — loadSSH2 returns a truthy ssh2 module in the default runtime:**
-
-1. Import \`loadSSH2\` from \`src/cli/lib/auth-ssh.js\`.
-2. Call it and assert the result is truthy AND has a \`Client\` constructor.
-3. This is a canary: if the bundler ever starts externalizing ssh2 again,
-   this test fails inside the Bun binary smoke check in Phase 6.
-
-**Gotchas:**
-
-- ssh2 Server needs \`hostKeys: [{ key: privateKeyPem }]\` in its options
-- The server's \`ready\` event signals the client is ready — use it to know
-  when to resolve \`new Promise\` wrappers
-- Server must call \`accept()\` on authentication, session, pty, and shell
-  requests (not \`reject()\`)
-- The shell stream emitted by the server is a Writable you can write to,
-  and a Readable you listen to (\`stream.on('data', …)\`)
-- Remember to call \`stream.exit(0)\` before \`stream.end()\` so the client's
-  \`exit\` handler fires before \`close\`
-- Mock \`process.stdin.setRawMode\` so vitest doesn't crash in CI environments
-  where stdin is not a TTY
-- Give the test a \`timeout: 10_000\` suffix on the \`it()\` call
-
-**Environment:**
-
-If ssh2 types are not re-exported from '@types/ssh2' in the usual place,
-use \`// @ts-expect-error\` or \`as any\` narrowly rather than fighting types.
-This is an integration test, not a type showcase.
-
-When done, end your message with LIVE_TEST_WRITTEN.`,
-      verification: { type: 'file_exists', value: LIVE_TEST },
-    })
-
-    .step('verify-live-test-written', {
-      type: 'deterministic',
-      dependsOn: ['write-live-integration-test'],
-      command: `set -e
-test -f ${LIVE_TEST} || (echo "MISSING test file"; exit 1)
-grep -q "from 'ssh2'" ${LIVE_TEST} || grep -q 'from "ssh2"' ${LIVE_TEST} || (echo "missing ssh2 import"; exit 1)
-grep -q "runInteractiveSession" ${LIVE_TEST} || (echo "missing runInteractiveSession import/usage"; exit 1)
-grep -q "launching provider CLI" ${LIVE_TEST} || (echo "missing launch-checkpoint assertion"; exit 1)
-grep -q "exec sh -c" ${LIVE_TEST} || (echo "missing exec sh -c assertion"; exit 1)
-echo VERIFY_LIVE_TEST_OK`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 5: Run live integration test (test-fix-rerun) ─────────
-    .step('run-live-test', {
-      type: 'deterministic',
-      dependsOn: ['verify-live-test-written'],
-      // Capture vitest's real exit via PIPESTATUS so fix-live-test can
-      // distinguish pass from fail — tail always exits 0 on its own.
-      command: `npx vitest run ${LIVE_TEST} 2>&1 | tail -120; echo "EXIT: \${PIPESTATUS[0]}"`,
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    .step('fix-live-test', {
-      agent: 'fixer',
-      dependsOn: ['run-live-test'],
-      timeoutMs: 1_500_000,
-      task: `Vitest output for ${LIVE_TEST}:
-
-{{steps.run-live-test.output}}
-
-The output ends with an \`EXIT: <code>\` line from the vitest invocation's
-real exit code (captured via PIPESTATUS — do NOT trust tail's own exit).
-If \`EXIT: 0\`, all tests passed — do nothing and end with LIVE_GREEN.
-
-If \`EXIT: <non-zero>\`, diagnose and fix. The failure could be in:
-  (a) the integration test itself (${LIVE_TEST}) — wrong ssh2 server API shape, missing accept() call, flaky timing, bad host key format
-  (b) the test expecting different bytes than what the ssh2 path actually produces
-  (c) a real bug in ${SSH_INTERACTIVE} that the mocked unit tests missed
-
-If the failure is that the ssh2 client never connects: the ssh2 Server is probably not accepting auth correctly. Read the ssh2 \`Server\` docs, the \`auth-method\` on the auth context is 'password', and you must call \`ctx.accept()\` for it.
-
-If the failure is that \`capturedWrite\` is empty: the client is connecting but the shell callback isn't firing — check that the server is accepting the 'session' + 'pty' + 'shell' subrequests.
-
-If the failure is that \`capturedStdout\` does not contain the breadcrumb: the server needs to echo the captured shell-write back through the stream so the CLI reads it, prints it to stdout, and the spy captures it. Adjust the server shell handler to \`stream.write(capturedWrite)\` (or the portion after 'exec sh -c …') before \`stream.exit(0)\`.
-
-Do NOT relax the assertions to make the test pass. The assertions encode the ACCEPTANCE CONTRACT from the workflow header — weakening them defeats the purpose.
-
-Re-run: \`npx vitest run ${LIVE_TEST}\`. Iterate until green. End with LIVE_GREEN.`,
-      verification: { type: 'exit_code' },
-    })
-
-    .step('run-live-test-final', {
-      type: 'deterministic',
-      dependsOn: ['fix-live-test'],
-      command: `set -o pipefail; npx vitest run ${LIVE_TEST} 2>&1 | tail -60`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 6: Typecheck + regression ──────────────────────────────
-    .step('typecheck', {
-      type: 'deterministic',
-      dependsOn: ['run-live-test-final'],
-      // Capture tsc's exit code via PIPESTATUS, not $? (which is tail's).
-      // failOnError: false so the fix-typecheck agent can read the output.
-      command: `npx tsc --noEmit 2>&1 | tail -40; echo "EXIT: \${PIPESTATUS[0]}"`,
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    .step('fix-typecheck', {
-      agent: 'fixer',
-      dependsOn: ['typecheck'],
-      timeoutMs: 600_000,
-      task: `Typecheck output:
-{{steps.typecheck.output}}
-
-The output ends with an \`EXIT: <code>\` line that holds tsc's REAL exit
-code (captured via PIPESTATUS — not tail's exit). If \`EXIT: 0\`, do
-nothing and end with TYPECHECK_OK.
-
-If there are type errors, fix them. They are almost certainly in ${LIVE_TEST} (new file) since we did not touch any TypeScript source. Do not touch files outside ${LIVE_TEST} unless the error is in a file we already edited in this workflow. Narrow \`any\` casts are acceptable for ssh2 Server types. Re-run \`npx tsc --noEmit\`. End with TYPECHECK_OK.`,
-      verification: { type: 'exit_code' },
-    })
-
-    .step('typecheck-final', {
-      type: 'deterministic',
-      dependsOn: ['fix-typecheck'],
-      command: `set -o pipefail; npx tsc --noEmit 2>&1 | tail -40`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('run-unit-tests', {
-      type: 'deterministic',
-      dependsOn: ['typecheck-final'],
-      // failOnError: false so the fixer agent can read the output. Capture
-      // each suite's exit via PIPESTATUS so we don't mask vitest failures
-      // with tail's always-zero exit — the fixer agent gates on these.
-      command: `echo "=== ssh-interactive.test.ts ==="
-npx vitest run ${SSH_INTERACTIVE_TEST} 2>&1 | tail -30; echo "EXIT_SSH: \${PIPESTATUS[0]}"
-echo ""
-echo "=== auth.test.ts ==="
-npx vitest run ${AUTH_TEST} 2>&1 | tail -30; echo "EXIT_AUTH: \${PIPESTATUS[0]}"
-echo ""
-echo "=== broader src/cli ==="
-npx vitest run src/cli 2>&1 | tail -40; echo "EXIT_CLI: \${PIPESTATUS[0]}"`,
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    .step('fix-unit-regressions', {
-      agent: 'fixer',
-      dependsOn: ['run-unit-tests'],
-      timeoutMs: 900_000,
-      task: `Unit test output:
-{{steps.run-unit-tests.output}}
-
-The output contains three \`EXIT_<SUITE>: <code>\` markers (SSH, AUTH,
-CLI) captured via PIPESTATUS — tail's own exit is always 0 and must not
-be trusted. If every marker reads \`EXIT_*: 0\`, end with UNIT_GREEN.
-
-If any existing test regressed, fix the ROOT CAUSE (most likely in the integration test file you just wrote, since the workflow did not touch any production TS source). Do not weaken assertions in existing tests. End with UNIT_GREEN.`,
-      verification: { type: 'exit_code' },
-    })
-
-    .step('run-unit-tests-final', {
-      type: 'deterministic',
-      dependsOn: ['fix-unit-regressions'],
-      command: `set -euo pipefail
-npx vitest run ${SSH_INTERACTIVE_TEST} ${AUTH_TEST} ${LIVE_TEST} 2>&1 | tail -60`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 7: Rebuild Bun binary from scratch ─────────────────────
-    .step('build-ts', {
-      type: 'deterministic',
-      dependsOn: ['run-unit-tests-final'],
-      command: `set -o pipefail; npm run build 2>&1 | tail -40`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('build-bun-binary', {
-      type: 'deterministic',
-      dependsOn: ['build-ts'],
-      command: `set -euo pipefail
-rm -rf .release
-bash ${BUILD_BUN_SH} 2>&1 | tail -80
-echo ""
-echo "=== .release contents ==="
-ls -lh .release/ 2>&1`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('validate-binary-runs', {
-      type: 'deterministic',
-      dependsOn: ['build-bun-binary'],
-      command: `set -e
-BIN=.release/agent-relay
-test -x "$BIN" || (echo "MISSING executable $BIN"; exit 1)
-
-echo "=== --version ==="
-"$BIN" --version
-echo ""
-echo "=== cloud connect --help ==="
-"$BIN" cloud connect --help 2>&1 | head -40 || true
-echo ""
-echo "=== binary size ==="
-du -h "$BIN"`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('validate-ssh2-bundled', {
-      type: 'deterministic',
-      dependsOn: ['validate-binary-runs'],
-      command: `set -e
-BIN=.release/agent-relay
-
-# Heuristic 1: strings-grep for ssh2 protocol markers. ssh-userauth is part
-# of the SSH2 wire protocol and is present as a literal string in ssh2's
-# JS sources, so a bundled binary should contain it.
-USERAUTH_HITS=$(strings "$BIN" 2>/dev/null | grep -c 'ssh-userauth' || true)
-echo "ssh-userauth hits: $USERAUTH_HITS"
-
-# Heuristic 2: the ssh2 package name also appears as a module id in the
-# bundle's module map.
-SSH2_HITS=$(strings "$BIN" 2>/dev/null | grep -c '"ssh2"\\|node_modules/ssh2' || true)
-echo "ssh2 module hits:  $SSH2_HITS"
-
-if [ "$USERAUTH_HITS" -lt 1 ]; then
-  echo "ERROR: binary does not appear to contain ssh2 (ssh-userauth not found in strings)"
-  echo "This means --external ssh2 is still being applied somewhere in the build."
-  exit 1
-fi
-
-echo "SSH2_BUNDLED_OK"`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 8: Review ──────────────────────────────────────────────
-    .step('collect-diff', {
-      type: 'deterministic',
-      dependsOn: ['validate-ssh2-bundled'],
-      command: `set -e
-echo "=== files changed ==="
-git status --short ${BUILD_BUN_SH} ${LIVE_TEST}
-echo ""
-echo "=== diff stat ==="
-git diff --stat ${BUILD_BUN_SH} ${LIVE_TEST}
-echo ""
-echo "=== build-bun.sh diff ==="
-git diff ${BUILD_BUN_SH}
-echo ""
-echo "=== live test (first 80 lines) ==="
-head -80 ${LIVE_TEST}`,
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    .step('review-diff', {
-      agent: 'reviewer',
-      dependsOn: ['collect-diff'],
-      timeoutMs: 600_000,
-      task: `Review the diff for the cloud-connect fix.
-
-{{steps.collect-diff.output}}
-
-Check:
-  1. build-bun.sh: both \`--external ssh2\` occurrences removed, line continuations still valid, no other flags lost.
-  2. The new integration test actually exercises the real ssh2 path (no loadSSH2 mock) and asserts the launch-checkpoint printf.
-  3. No unrelated files modified.
-  4. No secrets, credentials, or private keys committed.
-
-Respond with one of:
-  - REVIEW_OK — diff is correct and ready to commit
-  - REVIEW_BLOCK: <reason> — do not commit; explain what's wrong
-
-Do not edit files in this step. Review only.`,
-      verification: { type: 'output_contains', value: 'REVIEW_OK' },
-    })
-
-    // ── Phase 9: Summary ─────────────────────────────────────────────
-    .step('summary', {
-      type: 'deterministic',
-      dependsOn: ['review-diff'],
-      command: `set -e
-cat <<'EOF'
-════════════════════════════════════════════════════════════════
-  validate-cloud-connect-e2e — ALL GATES GREEN
-════════════════════════════════════════════════════════════════
-
-Acceptance contract:
-  A1  scripts/build-bun.sh has zero --external ssh2         PASS
-  A2  ssh-interactive + auth unit tests green               PASS
-  A3  npx tsc --noEmit clean                                PASS
-  A4  tests/integration/ssh-interactive-live.test.ts green  PASS
-  A5  .release/agent-relay --version works                  PASS
-  A6  Built binary contains ssh2 symbols                    PASS
-  A7  Regression suite green                                PASS
-  A8  Reviewer approved diff                                PASS
-
-Next steps (MANUAL — not covered by this workflow):
-
-  1. Commit and push on fix/cloud-connect-bundle-ssh2:
-
-       git add scripts/build-bun.sh tests/integration/ssh-interactive-live.test.ts
-       git commit -m "fix(cli): bundle ssh2 into Bun binary so cloud connect exercises the ssh2 path"
-       git push -u origin fix/cloud-connect-bundle-ssh2
-
-  2. Live Daytona validation against a real sandbox BEFORE releasing:
-
-       # This requires CLOUD_API_* credentials and a real cloud workspace.
-       # Expect to see the dim '[agent-relay] launching provider CLI…'
-       # breadcrumb appear within 1-2 seconds of 'Starting interactive
-       # authentication…'. If that line does not appear, the fix did not
-       # land and the ssh2 branch is still being skipped.
-
-       AGENT_RELAY_DEBUG_SSH=1 ./.release/agent-relay cloud connect claude
-
-       Success criteria:
-         - Unified 'Starting interactive authentication…' banner prints
-         - Dim '[agent-relay] launching provider CLI…' breadcrumb prints
-           within 1-2s
-         - Claude TUI renders within 15s
-         - First-byte debug log shows non-zero elapsedMs
-         - AGENT_RELAY_DEBUG_SSH output shows 'shell-request' then
-           'shell-opened' then 'shell-write'
-
-  3. Open PR, get review, merge, cut new release.
-
-════════════════════════════════════════════════════════════════
-EOF`,
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    .onError('retry', { maxRetries: 1, retryDelayMs: 5_000 })
-    .run({ cwd: process.cwd() });
-
-  console.log('Workflow status:', result.status);
-  console.log('Steps completed:', Object.keys(result.steps || {}));
-  process.exit(result.status === 'completed' ? 0 : 1);
-}
-
-main().catch((error) => {
-  console.error('Workflow failed:', error);
-  process.exit(1);
-});
diff --git a/workflows/fix-history-inbox-v2.ts b/workflows/fix-history-inbox-v2.ts
deleted file mode 100644
index d91b21d34..000000000
--- a/workflows/fix-history-inbox-v2.ts
+++ /dev/null
@@ -1,11 +0,0 @@
-/**
- * Retired workflow placeholder.
- *
- * This path previously contained a one-shot repair workflow that had drifted
- * from main, reintroduced duplicate import regressions, and hardcoded
- * branch-specific git/PR commands. It is intentionally inert now so the stale
- * workflow cannot be run or copied forward.
- */
-throw new Error(
-  'workflows/fix-history-inbox-v2.ts has been retired because the previous repair workflow was stale and unsafe to reuse.',
-);
diff --git a/workflows/fix-history-inbox-workflow.ts b/workflows/fix-history-inbox-workflow.ts
deleted file mode 100644
index 5049d1971..000000000
--- a/workflows/fix-history-inbox-workflow.ts
+++ /dev/null
@@ -1,191 +0,0 @@
-import { workflow } from '@agent-relay/sdk/workflows';
-
-async function main() {
-  const result = await workflow('fix-history-inbox')
-    .description('Test, fix, verify history/inbox workspace_key resolution, then commit and open PR')
-    .pattern('dag')
-    .channel('wf-history-inbox')
-    .maxConcurrency(1)
-    .timeout(600000)
-
-    .agent('fixer', {
-      cli: 'codex',
-      preset: 'worker',
-      role: 'Fix resolveRelaycastApiKey in messaging.ts',
-      retries: 2,
-    })
-
-    // Test in a temp directory so we don't nuke the workflow's own broker
-    .step('diagnose', {
-      type: 'deterministic',
-      command: `set -e
-TMPDIR=$(mktemp -d)
-echo "=== Testing history/inbox in temp workspace: $TMPDIR ==="
-agent-relay up --cwd "$TMPDIR" --no-dashboard 2>&1 &
-BROKER_PID=$!
-sleep 12
-echo "=== connection.json ==="
-cat "$TMPDIR/.agent-relay/connection.json" 2>/dev/null || echo "NO_CONNECTION_JSON"
-echo ""
-echo "=== history test ==="
-agent-relay history --cwd "$TMPDIR" --limit 3 2>&1 || agent-relay history --limit 3 2>&1 || echo "HISTORY_FAILED"
-echo ""
-echo "=== inbox test ==="
-agent-relay inbox --cwd "$TMPDIR" 2>&1 || agent-relay inbox 2>&1 || echo "INBOX_FAILED"
-echo ""
-kill $BROKER_PID 2>/dev/null || true
-echo "TMPDIR=$TMPDIR"
-echo "DIAGNOSE_DONE"`,
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    .step('read-messaging', {
-      type: 'deterministic',
-      dependsOn: ['diagnose'],
-      command: `cat src/cli/commands/messaging.ts`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('fix-messaging', {
-      agent: 'fixer',
-      dependsOn: ['read-messaging'],
-      task: `Fix resolveRelaycastApiKey() in src/cli/commands/messaging.ts so that history and inbox work without RELAY_API_KEY env var.
-
-Current file content:
-{{steps.read-messaging.output}}
-
-The bug: resolveRelaycastApiKey() throws "Relaycast API key not found in RELAY_API_KEY" because AgentRelayClient.connect() fails to find connection.json.
-
-Root cause: connection.json is written by the broker at startup but uses a state dir that may differ from cwd/.agent-relay/. The broker's /api/session endpoint always returns workspace_key when called with the broker's api_key.
-
-The fix: Replace the current fallback logic with a direct HTTP fetch to the broker's /api/session endpoint. The broker port is discoverable via getProjectPaths() or by scanning for the running broker.
-
-Here is the exact approach that is proven to work (verified via curl):
-- Read connection.json from the project's .agent-relay/ directory
-- Use its api_key and port to call GET http://127.0.0.1:{port}/api/session with Authorization: Bearer {api_key}
-- Return session.workspace_key
-
-The getProjectPaths() import is already available. Use node's built-in fs/path (not dynamic import, use static import at the top if needed, or require('fs') style if the file uses require).
-
-Only edit src/cli/commands/messaging.ts. Do not touch any other file.
-End with FIXES_COMPLETE.`,
-      verification: { type: 'exit_code' },
-      retries: 2,
-    })
-
-    .step('verify-changed', {
-      type: 'deterministic',
-      dependsOn: ['fix-messaging'],
-      command: `set -e
-if git diff --quiet src/cli/commands/messaging.ts; then
-  echo "ERROR: messaging.ts was not modified"
-  exit 1
-fi
-echo "messaging.ts modified — diff summary:"
-git diff --stat src/cli/commands/messaging.ts
-echo "CHANGED_OK"`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('rebuild', {
-      type: 'deterministic',
-      dependsOn: ['verify-changed'],
-      command: `set -e
-npm run build 2>&1 | tail -15
-echo "REBUILD_DONE"`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('retest', {
-      type: 'deterministic',
-      dependsOn: ['rebuild'],
-      command: `set -e
-TMPDIR=$(mktemp -d)
-echo "=== Fresh broker in temp workspace: $TMPDIR ==="
-agent-relay up --cwd "$TMPDIR" --no-dashboard 2>&1 &
-BROKER_PID=$!
-sleep 12
-
-echo "=== RETEST HISTORY ==="
-agent-relay history --limit 3 2>&1
-HISTORY_EXIT=$?
-
-echo ""
-echo "=== RETEST INBOX ==="
-agent-relay inbox 2>&1
-INBOX_EXIT=$?
-
-kill $BROKER_PID 2>/dev/null || true
-
-echo ""
-if [ $HISTORY_EXIT -eq 0 ] && [ $INBOX_EXIT -eq 0 ]; then
-  echo "RETEST_PASSED"
-else
-  echo "RETEST_FAILED: history=$HISTORY_EXIT inbox=$INBOX_EXIT"
-  exit 1
-fi`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('run-unit-tests', {
-      type: 'deterministic',
-      dependsOn: ['retest'],
-      command: `set -e
-npx vitest run src/cli/commands/messaging.test.ts 2>&1 | tail -20
-echo "UNIT_TESTS_DONE"`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('commit-and-open-pr', {
-      type: 'deterministic',
-      dependsOn: ['run-unit-tests'],
-      command: `set -e
-git add src/cli/commands/messaging.ts
-git commit -m "fix: resolve workspace_key from broker API for history/inbox
-
-history and inbox previously required RELAY_API_KEY env var.
-Now resolveRelaycastApiKey() fetches workspace_key directly from the
-running broker's /api/session endpoint using the local connection.json,
-so both commands work out of the box whenever a broker is running."
-
-git push origin miya/relay-fix-workflow
-
-PR_URL=$(gh pr create \
-  --title "fix: history and inbox work without RELAY_API_KEY env var" \
-  --body "## Problem
-\`agent-relay history\` and \`agent-relay inbox\` failed with:
-\`\`\`
-Failed to initialize relaycast client: Relaycast API key not found in RELAY_API_KEY
-\`\`\`
-...even when a broker was running with a valid workspace key.
-
-## Root Cause
-\`resolveRelaycastApiKey()\` only checked the \`RELAY_API_KEY\` env var and then tried \`AgentRelayClient.connect()\` which reads \`connection.json\` — but that file was not reliably present when the broker is managed by the workflow runner.
-
-## Fix
-Fetch \`workspace_key\` directly from the running broker's \`/api/session\` HTTP endpoint using the \`api_key\` and \`port\` from \`connection.json\`. This is always available when the broker is running.
-
-## Verified
-- Workflow test: history and inbox return results after fix
-- Unit tests: all messaging tests pass" \
-  --base main \
-  --head miya/relay-fix-workflow 2>&1)
-
-echo "PR_URL: $PR_URL"
-echo "PR_CREATED"`,
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .run({ cwd: process.cwd() });
-
-  console.log('Result:', result.status);
-}
-
-main().catch(console.error);
diff --git a/workflows/relay-e2e-meta-workflow.ts b/workflows/relay-e2e-meta-workflow.ts
deleted file mode 100644
index 603ae16f1..000000000
--- a/workflows/relay-e2e-meta-workflow.ts
+++ /dev/null
@@ -1,196 +0,0 @@
-import { workflow } from '@agent-relay/sdk/workflows';
-import { ClaudeModels, CodexModels } from '@agent-relay/config';
-
-await workflow('design-relay-clean-room-e2e-validation')
-  .description('Meta-workflow that designs the right clean-environment end-to-end validation workflow for agent-relay install/bootstrap/messaging fixes, choosing the proving environment and evidence plan before implementation.')
-  .pattern('supervisor')
-  .channel('wf-relay-e2e-meta')
-  .maxConcurrency(4)
-  .timeout(3_600_000)
-
-  .agent('lead', {
-    cli: 'claude',
-    preset: 'lead',
-    role: 'Lead workflow architect choosing the proving strategy and acceptance contract',
-    model: ClaudeModels.OPUS,
-    retries: 2,
-  })
-  .agent('research-a', {
-    cli: 'codex',
-    preset: 'analyst',
-    role: 'Research clean-environment options and relay install/runtime failure modes',
-    model: CodexModels.GPT_5_5,
-    retries: 2,
-  })
-  .agent('research-b', {
-    cli: 'codex',
-    preset: 'analyst',
-    role: 'Research workflow pattern choice, validation phases, and evidence design',
-    model: CodexModels.GPT_5_5,
-    retries: 2,
-  })
-  .agent('author', {
-    cli: 'claude',
-    preset: 'worker',
-    role: 'Workflow author for the final clean-room end-to-end validation workflow',
-    model: ClaudeModels.SONNET,
-    retries: 2,
-  })
-  .agent('reviewer', {
-    cli: 'claude',
-    preset: 'reviewer',
-    role: 'Reviewer verifying that the authored workflow really proves the original problem is fixed',
-    model: ClaudeModels.SONNET,
-    retries: 2,
-  })
-
-  .step('capture-current-context', {
-    type: 'deterministic',
-    command: `
-      set -e
-      cd ~/Projects/AgentWorkforce/relay
-      echo '## Relevant existing workflows'
-      find workflows -maxdepth 1 -type f | sort | sed 's#^#- #' || true
-      echo '\n## Existing fix workflow'
-      sed -n '1,260p' /Users/khaliqgant/.openclaw/workspace/relay-fix-workflow.ts
-      echo '\n## Running-headless-orchestrator skill excerpt'
-      sed -n '1,220p' /Users/khaliqgant/.openclaw/workspace/skills/running-headless-orchestrator/SKILL.md
-      echo '\n## Writer skill excerpt'
-      sed -n '1,260p' /tmp/skills-review/skills/writing-agent-relay-workflows/SKILL.md
-    `,
-    captureOutput: true,
-    failOnError: true,
-  })
-
-  .step('research-environment-options', {
-    agent: 'research-a',
-    dependsOn: ['capture-current-context'],
-    task: `Analyze the current relay workflow context and choose the best proving environment for full end-to-end validation of install/bootstrap/messaging fixes.
-
-Context:
-{{steps.capture-current-context.output}}
-
-Compare at least these options:
-- Docker / container
-- cloud-provisioned sandbox / workspace
-- fresh local shell with isolated paths
-
-Output sections:
-1. ORIGINAL_PROBLEM_CLASS
-2. PROVING_ENV_OPTIONS
-3. RECOMMENDED_ENV
-4. WHY_NOT_THE_OTHERS
-5. ACCEPTANCE_SIGNALS
-
-End with ENV_ANALYSIS_COMPLETE.`,
-    verification: { type: 'output_contains', value: 'ENV_ANALYSIS_COMPLETE' },
-    retries: 2,
-  })
-
-  .step('research-workflow-shape', {
-    agent: 'research-b',
-    dependsOn: ['capture-current-context'],
-    task: `Determine the right workflow/swarm shape for a clean-room validation workflow.
-
-Context:
-{{steps.capture-current-context.output}}
-
-Requirements:
-- do not assume DAG by default
-- consider whether a meta-workflow + generated workflow pair is best
-- define the required phases to prove the original user-visible bug is actually fixed
-- explicitly define what evidence artifacts must be captured
-
-Output sections:
-1. PATTERN_DECISION
-2. REQUIRED_PHASES
-3. EVIDENCE_ARTIFACTS
-4. FAILURE_MODES_TO_REPRODUCE
-5. REVIEW_CRITERIA
-
-End with SHAPE_ANALYSIS_COMPLETE.`,
-    verification: { type: 'output_contains', value: 'SHAPE_ANALYSIS_COMPLETE' },
-    retries: 2,
-  })
-
-  .step('decide-validation-strategy', {
-    agent: 'lead',
-    dependsOn: ['research-environment-options', 'research-workflow-shape'],
-    task: `Make the final design decision for the clean-room end-to-end validation workflow.
-
-Environment analysis:
-{{steps.research-environment-options.output}}
-
-Workflow-shape analysis:
-{{steps.research-workflow-shape.output}}
-
-Produce a final design doc with these exact sections:
-1. ACCEPTANCE_CONTRACT
-2. CHOSEN_PROVING_ENVIRONMENT
-3. CHOSEN_PATTERN
-4. EXECUTION_PHASES
-5. REQUIRED_ARTIFACTS
-6. OUTPUT_FILES_TO_AUTHOR
-
-The design must be concrete enough for another agent to author the final workflow file without guessing.
-End with DESIGN_COMPLETE.`,
-    verification: { type: 'output_contains', value: 'DESIGN_COMPLETE' },
-    retries: 2,
-  })
-
-  .step('author-final-validation-workflow', {
-    agent: 'author',
-    dependsOn: ['decide-validation-strategy'],
-    task: `Write the final clean-room end-to-end validation workflow into ~/Projects/AgentWorkforce/relay/workflows/relay-clean-room-e2e-validation.ts.
-
-Design:
-{{steps.decide-validation-strategy.output}}
-
-Requirements:
-- the workflow must explicitly reproduce/capture the original failure class first
-- it must validate the fix in a clean environment, not just the current shell
-- it must define deterministic artifact capture and a reviewer verdict
-- it should use the chosen workflow/swarm pattern from the design
-- write only the workflow file to disk
-
-End by printing WORKFLOW_AUTHORED.`,
-    verification: { type: 'exit_code' },
-    retries: 2,
-  })
-
-  .step('verify-authored-workflow', {
-    type: 'deterministic',
-    dependsOn: ['author-final-validation-workflow'],
-    command: `
-      set -e
-      cd ~/Projects/AgentWorkforce/relay
-      test -f workflows/relay-clean-room-e2e-validation.ts
-      sed -n '1,260p' workflows/relay-clean-room-e2e-validation.ts
-    `,
-    captureOutput: true,
-    failOnError: true,
-  })
-
-  .step('review-authored-workflow', {
-    agent: 'reviewer',
-    dependsOn: ['decide-validation-strategy', 'verify-authored-workflow'],
-    task: `Review the authored workflow against the design.
-
-Design:
-{{steps.decide-validation-strategy.output}}
-
-Authored workflow:
-{{steps.verify-authored-workflow.output}}
-
-Answer with these exact sections:
-1. PASS_FAIL
-2. WHAT_PROBLEM_IT_PROVES
-3. WHAT_EVIDENCE_IT_COLLECTS
-4. WHAT_STILL_NEEDS_HUMAN_DECISION
-
-End with REVIEW_COMPLETE.`,
-    verification: { type: 'output_contains', value: 'REVIEW_COMPLETE' },
-    retries: 2,
-  })
-
-  .run({ cwd: process.cwd() });
diff --git a/workflows/test-agent-relay-cli.ts b/workflows/test-agent-relay-cli.ts
deleted file mode 100644
index 2408c6a3d..000000000
--- a/workflows/test-agent-relay-cli.ts
+++ /dev/null
@@ -1,130 +0,0 @@
-import { workflow } from '@agent-relay/sdk/workflows';
-
-async function main() {
-  const result = await workflow('test-agent-relay-cli-commands')
-    .description('TDD test for agent-relay CLI commands: spawn, who, agents:logs, release, set-model, send, history, inbox with all subcommands and options')
-    .pattern('dag')
-    .channel('wf-cli-test')
-    .maxConcurrency(3)
-    .timeout(3600000)
-
-    .agent('lead', {
-      cli: 'claude',
-      preset: 'lead',
-      role: 'Architect coordinating test creation and verification',
-      model: 'sonnet-4-20250514',
-      retries: 2,
-    })
-    .agent('codex', {
-      cli: 'codex',
-      preset: 'worker',
-      role: 'Implementation agent for fixing CLI bugs',
-      retries: 2,
-    })
-    .agent('tester', {
-      cli: 'claude',
-      preset: 'analyst',
-      role: 'Running tests and verifying fixes',
-      retries: 2,
-    })
-
-    .step('map-commands', {
-      type: 'deterministic',
-      command: `
-        set -e
-        cd "$PWD"
-        echo "=== CLI COMMAND STRUCTURE ==="
-        node dist/src/cli/index.js --help 2>&1 | head -50
-        echo ""
-        echo "=== spawn help ==="
-        node dist/src/cli/index.js spawn --help 2>&1
-        echo ""
-        echo "=== who help ==="
-        node dist/src/cli/index.js who --help 2>&1
-        echo ""
-        echo "=== agents help ==="
-        node dist/src/cli/index.js agents --help 2>&1
-        echo ""
-        echo "=== release help ==="
-        node dist/src/cli/index.js release --help 2>&1
-        echo ""
-        echo "=== set-model help ==="
-        node dist/src/cli/index.js set-model --help 2>&1
-        echo ""
-        echo "=== send help ==="
-        node dist/src/cli/index.js send --help 2>&1
-        echo ""
-        echo "=== history help ==="
-        node dist/src/cli/index.js history --help 2>&1
-        echo ""
-        echo "=== inbox help ==="
-        node dist/src/cli/index.js inbox --help 2>&1
-      `,
-      captureOutput: true,
-      failOnError: false,
-    })
-
-    .step('write-tests', {
-      agent: 'lead',
-      dependsOn: ['map-commands'],
-      task: `Create TDD tests for agent-relay CLI commands.
-
-CLI help:
-{{steps.map-commands.output}}
-
-Commands: spawn, who, agents:logs, release, set-model, send, history, inbox
-
-Write to: tests/integration/cli-commands.test.ts
-
-Test against LIVE broker (not mocked). End with TESTS_WRITTEN.`,
-      verification: { type: 'output_contains', value: 'TESTS_WRITTEN' },
-      retries: 2,
-    })
-
-    .step('run-tests', {
-      agent: 'tester',
-      dependsOn: ['write-tests'],
-      task: `Run tests against live broker.
-
-Setup:
-1. agent-relay down --force --timeout 5000 || true
-2. agent-relay up --no-dashboard --verbose > /tmp/broker.log 2>&1 &
-3. sleep 8
-
-Test all commands. Report PASS/FAIL per command.
-End with TEST_RESULTS_COMPLETE.`,
-      verification: { type: 'output_contains', value: 'TEST_RESULTS_COMPLETE' },
-      retries: 2,
-    })
-
-    .step('fix-broken', {
-      agent: 'codex',
-      dependsOn: ['run-tests'],
-      task: `Fix broken CLI commands.
-
-Test report:
-{{steps.run-tests.output}}
-
-Focus on src/cli/commands/messaging.ts and agent-management.ts.
-End with FIXES_COMPLETE.`,
-      verification: { type: 'exit_code' },
-      retries: 2,
-    })
-
-    .step('retest', {
-      agent: 'tester',
-      dependsOn: ['fix-broken'],
-      task: `Retest all commands after fixes.
-
-Restart broker and test again.
-End with RETEST_COMPLETE.`,
-      verification: { type: 'output_contains', value: 'RETEST_COMPLETE' },
-      retries: 2,
-    })
-
-    .run({ cwd: process.cwd() });
-
-  console.log('Result:', result.status);
-}
-
-main().catch(console.error);
\ No newline at end of file
diff --git a/workflows/wire-process-backend.ts b/workflows/wire-process-backend.ts
deleted file mode 100644
index dc44d076c..000000000
--- a/workflows/wire-process-backend.ts
+++ /dev/null
@@ -1,314 +0,0 @@
-/**
- * wire-process-backend — Wire ProcessBackend into the SDK runner
- * ================================================================
- *
- * Run:
- *   agent-relay run workflows/wire-process-backend.ts
- *
- * Resume from a specific step (reuses cached outputs from the last run):
- *   START_FROM=build agent-relay run workflows/wire-process-backend.ts
- *   START_FROM=commit agent-relay run workflows/wire-process-backend.ts
- *
- * Resume a specific failed run by ID:
- *   RESUME_RUN_ID=<run-id> agent-relay run workflows/wire-process-backend.ts
- *
- * ────────────────────────────────────────────────────────────────────
- *
- * The cloud repo now exports a ProcessBackend interface:
- *
- *   interface ProcessBackend {
- *     createEnvironment(label): Promise<ProcessEnvironment>
- *   }
- *   interface ProcessEnvironment {
- *     id: string; homeDir: string;
- *     exec(command, opts?): Promise<{ output, exitCode }>
- *     uploadFile(content, path): Promise<void>
- *     destroy(): Promise<void>
- *   }
- *
- * This workflow wires it into the relay SDK so the runner can use it:
- *
- * 1. Add ProcessBackend + ProcessEnvironment interfaces to the SDK
- *    (packages/sdk/src/workflows/types.ts)
- *
- * 2. Accept processBackend in WorkflowRunnerOptions
- *    (packages/sdk/src/workflows/runner.ts)
- *
- * 3. Create a ProcessBackend-backed RunnerStepExecutor when processBackend is
- *    set and no explicit executor is provided.
- *
- * The Rust broker change (replacing Command::spawn with backend.exec) remains
- * a separate follow-up. This TS adapter lets workflow steps run in cloud
- * environments without changing the default local broker path.
- */
-import { workflow } from '@agent-relay/sdk/workflows';
-
-const CHANNEL = 'wf-wire-process-backend';
-const FEATURE_BRANCH = 'feat/process-backend-runner';
-
-async function main() {
-  const result = await workflow('wire-process-backend')
-    .description(
-      'Wire ProcessBackend into the SDK runner so cloud sandboxes can execute workflow steps',
-    )
-    .pattern('dag')
-    .channel(CHANNEL)
-    .maxConcurrency(3)
-    .timeout(1_200_000)
-
-    .agent('impl', {
-      cli: 'claude',
-      role: 'Implements the ProcessBackend wiring in the relay SDK',
-      retries: 2,
-    })
-
-    // ── Phase 1: Read ────────────────────────────────────────────────
-    .step('read-types', {
-      type: 'deterministic',
-      command: 'cat packages/sdk/src/workflows/types.ts',
-      captureOutput: true,
-    })
-
-    .step('read-runner-options', {
-      type: 'deterministic',
-      command: 'sed -n "250,290p" packages/sdk/src/workflows/runner.ts',
-      captureOutput: true,
-    })
-
-    .step('read-runner-constructor', {
-      type: 'deterministic',
-      command: 'sed -n "460,470p" packages/sdk/src/workflows/runner.ts',
-      captureOutput: true,
-    })
-
-    .step('read-runner-fork', {
-      type: 'deterministic',
-      command: 'sed -n "4033,4055p" packages/sdk/src/workflows/runner.ts',
-      captureOutput: true,
-    })
-
-    .step('read-requires-broker', {
-      type: 'deterministic',
-      command: 'sed -n "2710,2730p" packages/sdk/src/workflows/runner.ts',
-      captureOutput: true,
-    })
-
-    .step('read-build-command', {
-      type: 'deterministic',
-      command: 'grep -n "buildNonInteractiveCommand" packages/sdk/src/workflows/runner.ts | head -10',
-      captureOutput: true,
-    })
-
-    .step('read-exports', {
-      type: 'deterministic',
-      command: 'grep -n "export" packages/sdk/src/workflows/index.ts 2>/dev/null || echo "no index.ts barrel"',
-      captureOutput: true,
-    })
-
-    .step('read-tests', {
-      type: 'deterministic',
-      command: 'ls packages/sdk/src/workflows/__tests__/*.test.ts 2>/dev/null | head -10 && echo "---" && cat packages/sdk/src/workflows/__tests__/step-executor.test.ts 2>/dev/null | head -50 || echo "no step-executor test"',
-      captureOutput: true,
-    })
-
-    // ── Phase 2: Implement ───────────────────────────────────────────
-    .step('implement', {
-      agent: 'impl',
-      dependsOn: [
-        'read-types', 'read-runner-options', 'read-runner-constructor',
-        'read-runner-fork', 'read-requires-broker', 'read-build-command',
-        'read-exports', 'read-tests',
-      ],
-      task: `Wire ProcessBackend into the relay SDK runner. This is a BACKWARD COMPATIBLE change — when no processBackend is provided, behavior is identical to today.
-
-## Current code
-
-=== packages/sdk/src/workflows/types.ts (excerpt) ===
-{{steps.read-types.output}}
-
-=== WorkflowRunnerOptions + RunnerStepExecutor (runner.ts:250-290) ===
-{{steps.read-runner-options.output}}
-
-=== Constructor (runner.ts:460-470) ===
-{{steps.read-runner-constructor.output}}
-
-=== The fork (runner.ts:4033-4055) ===
-{{steps.read-runner-fork.output}}
-
-=== requiresBroker check (runner.ts:2710-2730) ===
-{{steps.read-requires-broker.output}}
-
-=== buildNonInteractiveCommand references ===
-{{steps.read-build-command.output}}
-
-=== Exports ===
-{{steps.read-exports.output}}
-
-=== Tests ===
-{{steps.read-tests.output}}
-
-## Changes needed
-
-### 1. Add ProcessBackend interfaces to types.ts
-
-At the end of packages/sdk/src/workflows/types.ts, add:
-
-// ── ProcessBackend: cloud-injected execution environment ─────────────────────
-//
-// Relay owns command construction, auth env, cwd, timeout, and step lifecycle.
-// The backend owns execution environments (create VM, run command, destroy VM).
-// uploadFile is reserved for future file asset staging; current executors run
-// commands directly with env/cwd/timeout passed through exec options.
-
-export interface ProcessBackend {
-  /** Create an isolated execution environment (e.g. a Daytona sandbox). */
-  createEnvironment(label: string): Promise<ProcessEnvironment>;
-}
-
-export interface ProcessEnvironment {
-  /** Unique identifier for this environment. */
-  id: string;
-  /** Home directory inside the environment. */
-  homeDir: string;
-  /** Execute a shell command in the environment. */
-  exec(command: string, opts?: { cwd?: string; env?: Record<string, string>; timeoutSeconds?: number }): Promise<{ output: string; exitCode: number }>;
-  /** Upload a file into the environment. */
-  uploadFile(content: string | Buffer, remotePath: string): Promise<void>;
-  /** Tear down the environment and release resources. */
-  destroy(): Promise<void>;
-}
-
-### 2. Add processBackend to WorkflowRunnerOptions
-
-In packages/sdk/src/workflows/runner.ts, find the WorkflowRunnerOptions interface and add:
-
-  /**
-   * Process backend for remote execution environments.
-   * When set without an explicit executor, the runner wraps it in a
-   * RunnerStepExecutor that creates isolated environments for agent and
-   * deterministic steps. The runner builds CLI commands and passes auth env,
-   * cwd, and timeout; the backend provides create/exec/destroy primitives.
-   *
-   * When both executor and processBackend are set, executor takes precedence.
-   * When neither is set, the broker spawns local child processes (default).
-   */
-  processBackend?: ProcessBackend;
-
-Make sure to import ProcessBackend from the types file at the top of runner.ts.
-
-### 3. Store processBackend and synthesize an executor in the constructor
-
-In the constructor, after the line that sets this.executor, add:
-
-    this.processBackend = options.processBackend;
-
-And add the private field to the class:
-
-  private readonly processBackend?: ProcessBackend;
-
-Then synthesize the ProcessBackend executor only when no explicit executor was
-provided:
-
-    if (!this.executor && this.processBackend) {
-      this.executor = createProcessBackendExecutor(this.processBackend, {
-        env: this.envSecrets,
-      });
-    }
-
-### 4. Add the ProcessBackend executor
-
-Add packages/sdk/src/workflows/process-backend-executor.ts. It should:
-
-- Build non-interactive CLI commands using the existing process-spawner helper.
-- Pass env, cwd, and ceil-rounded timeoutSeconds via ProcessEnvironment.exec options.
-- Shell-escape argv safely before joining into the command string.
-- Reject cli:"api" because API agents do not run as subprocesses.
-- Destroy the environment in a finally block.
-
-### 5. Keep the existing executor fork behavior
-
-Do not add a second processBackend-specific fork. The constructor makes
-this.executor point at the ProcessBackend executor when processBackend is set
-and executor is omitted, so the existing this.executor branch remains the single
-extension point. The default no-executor path still uses spawnAndWait.
-
-After making changes, run:
-  npm run build 2>&1 | tail -20
-  npm test 2>&1 | tail -30
-
-IMPORTANT: Write all changes to disk. Do NOT just output code.`,
-      verification: { type: 'exit_code' },
-    })
-
-    // ── Phase 2b: Verify edits ───────────────────────────────────────
-    .step('verify-edits', {
-      type: 'deterministic',
-      dependsOn: ['implement'],
-      command: [
-        'set -e',
-        'grep -q "ProcessBackend" packages/sdk/src/workflows/types.ts || (echo "MISSING: ProcessBackend in types.ts"; exit 1)',
-        'grep -q "ProcessEnvironment" packages/sdk/src/workflows/types.ts || (echo "MISSING: ProcessEnvironment in types.ts"; exit 1)',
-        'grep -q "processBackend" packages/sdk/src/workflows/runner.ts || (echo "MISSING: processBackend in runner.ts"; exit 1)',
-        'grep -q "createProcessBackendExecutor" packages/sdk/src/workflows/process-backend-executor.ts || (echo "MISSING: ProcessBackend executor"; exit 1)',
-        'if git diff --quiet packages/sdk/src/workflows/types.ts; then echo "types.ts NOT MODIFIED"; exit 1; fi',
-        'if git diff --quiet packages/sdk/src/workflows/runner.ts; then echo "runner.ts NOT MODIFIED"; exit 1; fi',
-        'if git diff --quiet packages/sdk/src/workflows/process-backend-executor.ts; then echo "process-backend-executor.ts NOT MODIFIED"; exit 1; fi',
-        'echo "All expected changes verified"',
-      ].join(' && '),
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 3: Build + test gate ──────────────────────────────────
-    .step('build', {
-      type: 'deterministic',
-      dependsOn: ['verify-edits'],
-      command: 'npm run build 2>&1 | tail -30',
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('run-tests', {
-      type: 'deterministic',
-      dependsOn: ['verify-edits'],
-      command: 'npm test 2>&1 | tail -60',
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    // ── Phase 4: Commit + push + PR ─────────────────────────────────
-    .step('commit', {
-      type: 'deterministic',
-      dependsOn: ['build', 'run-tests'],
-      command: [
-        `git checkout -B ${FEATURE_BRANCH}`,
-        'git add packages/sdk/src/workflows/types.ts packages/sdk/src/workflows/runner.ts packages/sdk/src/workflows/process-backend-executor.ts packages/sdk/src/workflows/index.ts packages/sdk/src/workflows/__tests__/process-backend-executor.test.ts workflows/wire-process-backend.ts',
-        'if git diff --cached --quiet; then echo "NO CHANGES"; exit 1; fi',
-        'git commit -m "feat(sdk): add ProcessBackend executor for workflows"',
-        `git push -u origin ${FEATURE_BRANCH}`,
-      ].join(' && '),
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .step('open-pr', {
-      type: 'deterministic',
-      dependsOn: ['commit'],
-      command: [
-        `gh pr view ${FEATURE_BRANCH} --repo AgentWorkforce/relay --json url -q .url 2>/dev/null && echo 'PR already exists' && exit 0`,
-        `gh pr create --repo AgentWorkforce/relay --base main --head ${FEATURE_BRANCH} --title 'feat(sdk): add ProcessBackend executor for cloud sandbox execution' --body "## Summary\n\nAdds ProcessBackend and ProcessEnvironment interfaces to the SDK, accepts processBackend in WorkflowRunnerOptions, and creates a ProcessBackend-backed RunnerStepExecutor when no explicit executor is provided.\n\n## What this does\n\n- Exports ProcessBackend + ProcessEnvironment from @agent-relay/sdk/workflows\n- WorkflowRunnerOptions accepts optional processBackend field\n- Agent and deterministic steps can execute through ProcessEnvironment.exec\n- env, cwd, and timeoutSeconds are passed through structured exec options\n\n## Boundary\n\n- Relay builds CLI commands and passes auth env, cwd, and timeout metadata\n- ProcessBackend creates environments, executes commands, and destroys environments\n- uploadFile is part of the interface for future file asset staging and is not used by this executor yet\n\n## Test plan\n\n- [x] npm run build passes\n- [x] npm test passes"`,
-      ].join(' || '),
-      captureOutput: true,
-      failOnError: true,
-    })
-
-    .onError('retry', { maxRetries: 2, retryDelayMs: 10_000 })
-    .run({ cwd: process.cwd() });
-
-  console.log(`Run status: ${result.status}`);
-  if (result.status !== 'completed') {
-    process.exit(1);
-  }
-}
-
-main().catch(console.error);