07 - Implementation Plan

Phased build-out of Substrate. This file remains a roadmap, but implemented phases are rewritten to match repository HEAD instead of earlier pre-rename drafts.

Directory Structure

cmd/substrate/main.go
internal/
    domain/                # Session / Plan / Task / Review domain types
    repository/
        interfaces.go      # repository interfaces
        sqlite/            # SQLite implementations
    service/               # state machines, domain rules, transacter-wrapped repos
    orchestrator/          # planning, implementation, review, foreman, resume, session registry
    adapter/
        linear/
        manual/
        gitlab/
        github/
        glab/
        sentry/
        ohmypi/
        claudeagent/
        codex/
    app/                   # adapter + harness wiring, remote detection
    event/                 # persisted channel bus
    gitwork/               # git-work integration and workspace helpers
    config/                # YAML config + secret hydration
    tui/                   # Bubble Tea UI
        views/             # 60+ view and overlay files (incl. overlay_add_repo.go)
        components/        # bunny, input, confirm, overlay frame, toast, etc.
bridge/omp-bridge.ts
migrations/
    001_initial.sql
    002_agent_sessions_canonical.sql
    003_omp_session_meta.sql
    004_sub_plan_planning_round.sql
    005_review_artifacts.sql
~/.substrate/state.db

Phase 0: Project Bootstrap (Week 1)

Shipped.

What exists today:

typed config loading in internal/config/
global path helpers (GlobalDir, GlobalDBPath, ConfigPath, SessionsDir)
defaulting + validation for commit, plan, review, harness, adapters, foreman, and repos
migration runner and migrations/001_initial.sql
cmd/substrate/main.go startup that loads config, runs migrations, wires repos/services/bus/adapters/harnesses, and starts the TUI
secret hydration via config.LoadSecrets

Current config model is richer than the original plan:

harness.default plus per-phase overrides for planning, implementation, review, and foreman
adapter config blocks for ohmypi, claude_code, codex, linear, glab, gitlab, github, and sentry
foreman.question_timeout
per-repo repos.<name>.doc_paths

Phase 1: Core Domain + Persistence (Week 2)

Shipped, with naming updated from older drafts.

Current domain/storage split:

root aggregate is domain.Session
orchestration record is domain.Plan
per-repo plan slice is domain.TaskPlan
repo-scoped harness run is domain.Task
review is domain.ReviewCycle + domain.Critique
questions are domain.Question
liveness is domain.SubstrateInstance

Storage still uses legacy table names:

work_items stores domain.Session
agent_sessions stores domain.Task

Current schema details worth preserving:

plans.faq JSON column exists and backs the Foreman FAQ flow
questions.proposed_answer exists for escalated-question UX
critiques.suggestion exists
agent_sessions.owner_instance_id points at substrate_instances
system_events persists raw domain.SystemEvent rows
github_pull_requests and gitlab_merge_requests store provider-native PR/MR records (migration 005)
session_review_artifacts links work items to provider artifacts with dedup on (workspace_id, work_item_id, provider, provider_artifact_id)
sub_plans.planning_round tracked which planning round last modified each sub-plan (migration 004; dropped in migration 008)- migration 006 migrates OMP-specific session metadata to generic resume_info
migration 007 adds plan supersede model (partial unique index on non-superseded plans) and plan_id to agent_sessions pointing at plans(id)
migration 008 drops sub_plans.planning_round
agent_sessions.resume_info tracks native harness session state as generic resume metadata

SQLite implementations live in internal/repository/sqlite/ and accept generic.SQLXRemote. resources.go still groups transaction-bound repos into a Resources bundle for tests / transactional construction.

Phase 2: Service Layer (Week 2-3)

Shipped.

Current services and names:

SessionService
PlanService
TaskService
ReviewService
QuestionService
WorkspaceService
InstanceService
EventService
GithubPRService
GitlabMRService
SessionReviewArtifactService Important current behavior:
SessionService enforces root-session uniqueness and lifecycle transitions
TaskService owns task lifecycle plus SearchHistory / interrupted-owner queries
QuestionService supports EscalateWithProposal and UpdateProposal
PlanService owns sub-plan transitions and AppendFAQ through the repo boundary
all services use the atomic.Transacter[repository.Resources] pattern: every call runs inside Transact(ctx, func(ctx, res) error { ... }) closures, giving services consistent transaction boundaries and read-after-write consistency

This layer is already using the renamed Session / Task model; older WorkItemService / AgentSessionService wording is obsolete.

Phase 3: Event Bus + Adapter Interfaces (Week 3)

Shipped, but in its current mixed form.

Current reality:

domain.SystemEvent is the persisted event shape
event.Bus is a channel-based pub/sub bus with topic filtering
worktree.creating is the default pre-hook event type
pre-hooks and post-hooks are supported, with default 30s hook timeouts
dispatch is non-blocking; full subscriber buffers yield ErrRetryLater unless a drop handler is installed
adapter OnEvent retries up to 3 times before publishing adapter.error system events and surfacing best-effort TUI warnings
work item adapters subscribe to all events and self-filter in OnEvent
repo lifecycle adapters subscribe only to worktree.created and work_item.completed

Also important: not all events currently flow through the bus. Planning persists lifecycle events directly through EventService.Create (not through the bus). Adapters persist review artifact events the same way.

Phase 4: git-work Integration (Week 3-4)

Shipped.

Current behavior:

workspace identity uses .substrate-workspace
workspace discovery walks upward from cwd
repo discovery looks for direct child directories containing .bare/
planning preflight warns on plain git clones and failed pulls
discoverer pull cooldown (5 minutes) avoids redundant pulls within the same window
pull failures are recorded but do not stop discovery
planning creates <workspace>/.substrate/sessions/<planning-session-id>/plan-draft.md
implementation creates feature worktrees through git-work checkout

Phase 6: Multi-Harness Agent Integration (Week 4-5)

Partially shipped.

Current production-quality path:

ohmypi is the default harness and the only path with verified interactive continuation behavior across planning, implementation, review, and foreman flows.

Currently wired but still parity-limited:

claudeagent
codex

Current router behavior in internal/app/harness.go:

each phase resolves a single harness from config
missing binaries cause that phase to be unavailable rather than silently pretending parity
Resume currently reuses the implementation harness choice
diagnostics are surfaced for settings/TUI consumption

The important naming update here is that the implementation talks about harness phases and AgentHarness instances, not the old single-harness assumption.

6a. oh-my-pi bridge (default, production path)

What is true today:

runtime path is internal/adapter/ohmypi/
package name is still omp
readiness checks validate bridge availability and Bun requirements for source-bridge mode
the bridge emits structured events consumed by the Go harness session wrapper

6b. Claude Code adapter (implemented, parity still limited)

startup and selection are wired
binary presence is checked
do not treat it as equal to oh-my-pi for interactive continuation semantics unless verified by tests/runtime evidence
supports compact and native resume via a TypeScript bridge (@anthropic-ai/claude-agent-sdk, same architecture as OMP bridge)

6c. Codex adapter (implemented, parity still limited)

same current status as Claude Code: selectable and wired, supports messaging via thread resume and native resume, but steering returns ErrSteerNotSupported and compact is not supported

6d. Harness routing, packaging, and validation

Current config shape is YAML, not TOML:

harness:
  default: ohmypi
  phase:
    planning: ohmypi
    implementation: ohmypi
    review: ohmypi
    foreman: ohmypi

Phase 7: Planning Pipeline (Week 5-6)

Shipped.

Current planning flow in internal/orchestrator/planning.go:

transition root Session from ingested to planning
load Workspace
run preflight (Discoverer.PreflightCheck)
pull main/ worktrees best-effort
discover repos and metadata
read workspace-root AGENTS.md
create workspace-local planning session dir and draft path
render planning prompt
start harness planning session
wait for draft file or completion
parse/validate the draft
run correction loop up to plan.max_parse_retries
persist Plan and TaskPlans
transition root Session to plan_review
persist planning events

Current naming details:

planning is launched for a root Session, not a WorkItem type
parsed repo slices are TaskPlan, not SubPlan in the domain model
PlanningContext uses WorkItemSnapshot as a projection name, but it snapshots the Session aggregate

Phase 8: Implementation Orchestrator (Week 6-7)

Shipped.

Current implementation flow in ImplementationService:

requires PlanApproved
loads the root Session and its Workspace
discovers repository paths before mutating root-session state
transitions root Session to implementing
pre-creates unique worktrees sequentially to avoid same-wave races
builds waves from TaskPlan.Order
executes tasks in a wave concurrently via errgroup
creates a durable Task row before launching each harness session
forwards harness events to the bus while the task runs
transitions the root Session to reviewing or failed at the end

Current event nuance:

worktree.creating and worktree.created go through event.Bus
work_item.implementation_started and task start/complete/fail events go through event.Bus via ImplementationService.publishEvent

Phase 9: Foreman + Review Pipeline (Week 7-8)

Shipped in current form.

Foreman

What exists now:

Foreman manages a persistent foreman-phase harness session per plan
StartForemanCmd is triggered from the TUI after plan approval and during review-driven reimplementation loops
questions are serialized through the foreman worker queue
high-confidence answers are persisted immediately and appended to Plan.FAQ
uncertain answers are escalated with Question.ProposedAnswer
the TUI can keep iterating with the foreman before calling ResolveEscalated
question_timeout is configurable through foreman.question_timeout; config default is "0" (documented as indefinite, but runtime falls back to 60 s)
the Foreman is stopped on implementation completion and restarted for follow-up feedback
it counts as a live session in the TUI status bar
the Foreman receives the full composed plan document as its system prompt context

Review

What exists now:

review is modeled as ReviewPipeline.ReviewSession(session domain.Task)
a review harness session is started in foreman mode
output is parsed for CRITIQUE / END_CRITIQUE blocks or NO_CRITIQUES
correction-loop retries reuse the same live review session
major/critical critiques trigger reimplementation decisions via the review result path
review outcome events are published through the bus
review sessions run in SessionModeAgent with a review-only role instruction
parse failures are treated as no critiques (no correction loop in agent mode)

Orchestrator-owned review pipeline

The implementation orchestrator now owns the per-repo review loop:

AutoFeedbackLoop config flag (default true) gates automatic reimplementation after critique
review critiques are reused as feedback when re-triggering implementation
native harness sessions are resumed via ResumeFromSessionID and ResumeInfo for review reimplementation
escalated sub-plans are persisted as failed before the work item is routed to reviewing
bridge answer timeout is handled through ohMyPiSession.SendAnswer

Phase 9b: Resume & Recovery (Week 8)

Shipped.

Current behavior:

InstanceService owns instance CRUD, heartbeat updates, liveness checks, and stale cleanup
orphan reconciliation runs on TUI startup via ReconcileOrphanedTasksCmd — tasks whose owner instance is absent or stale are transitioned to interrupted
Resumption.ResumeSession creates a new Task against the same TaskPlan and worktree
the interrupted task remains interrupted for audit purposes
resume context includes the last 50 log lines from ~/.substrate/sessions/<old-task-id>.log
AgentSessionResumed is published through the bus
AbandonSession terminalizes an interrupted task as failed
superseded interrupted sessions are failed once the replacement task is durable
deleteOrFailPendingSession is the shared cleanup path for failed starts
SessionRegistry.AbortAndDeregister is the idempotent live-session teardown path
DeleteSessionMsg cancels pipelines, aborts registry sessions, stops Foreman when needed, then deletes task/work-item rows and artifacts
graceful agent abort on quit: q / ctrl+c / SIGTERM trigger a confirmation dialog when agents are running

The old InstanceManager has been deleted. Instance reconciliation is now split between InstanceService (row ownership and heartbeat liveness) and the TUI startup orphan-reconciliation command.

Phase 10: Work Item Browsing and Selection (Week 8)

Still roadmap-oriented, but the terminology should be read as follows:

browsing creates root Session records
adapters resolve selections into domain.Session
manual creation remains a separate explicit path

The capability-driven browse contract already exists in internal/adapter/types.go and interfaces.go; remaining work is mostly UI semantics and provider-scope parity.

Phase 11: GitLab / GitHub Adapters and Unified Browse Semantics (Week 8-9)

Mixed state:

GitLab, GitHub, Linear, Manual, Glab, and Sentry adapters exist
work-item tracker mutation and repo-lifecycle automation are already split into different adapter contracts
internal/app/remotedetect routes lifecycle adapters by detected provider
browse/filter parity across all providers remains a roadmap item rather than a finished uniform contract

For Sentry specifically, the roadmap here should now be read alongside 04-adapters.md: 07 owns the broader rollout picture, while 04 owns the shipped Sentry source-adapter contract, auth/config model, browse semantics, and settings integration details.

Phase 12: TUI (Week 9-11)

Substantial portions are shipped.

Current TUI reality to keep in mind:

the default sidebar is root-session / work-item centric
history search uses SessionHistoryEntry
work-item completion and plan approval publish bus events from TUI command helpers
settings pages rebuild services and rewire adapters/harnesses dynamically
workspace init is a TUI flow, not just a CLI-only concern
overview view with action cards, confirm triggers, and superseded-interrupted filtering
session transcript view with structured rendering for assistant, prompt, tool, lifecycle, question, thinking, and foreman output
log overlay with line numbers, wrapping, scroll, and clipboard copy (c key)
source items overlay with split list/preview pane
empty state with animated ASCII bunny component
mouse scroll support (tea.WithMouseCellMotion) in overview, log overlay, source items overlay, and settings
quit confirmation dialog when agents are running (q, ctrl+c, SIGTERM)
duplicate session dialog for choosing between duplicate work items
centralized text input/textarea component with macOS-compatible word-movement bindings
raw key debug logger (SUBSTRATE_KEY_DEBUG)
Add Repository overlay for browsing and cloning remote repositories via GitHub, GitLab, and manual URL
Sidebar filters (All/Active/Needs Attention/Completed), grouping dimensions, and sort direction controls with custom scrollbar
Render caching for sidebar and status bar
Grouped task sidebar entries under section headers
Completed-session action card with Changes keybind
Plan inspect key (i) from planning sessions and plan review
Commit strategy injection into agent system prompts and residual-change commit/push after review pass
Compact-before-critique for review reimplementation when harness supports it

Phase 13: End-to-End Integration (Week 11-12)

Still the umbrella outcome: provider work item -> planning -> approval -> implementation -> review -> completion, with lifecycle automation routed by repository host.

The current codebase already has e2e coverage scaffolding under test/e2e/, but this phase remains the overall integration target rather than a finished “nothing left to do” claim.

Autonomous Validation Strategy

Keep the validation split, but interpret it against current repo structure:

# Unit
 go test ./...
 go test -race ./...
 go vet ./...

# Focused integration / e2e surfaces
 go test -tags=integration ./internal/gitwork/...
 go test -tags=integration ./internal/adapter/ohmypi/...
 go test -tags=integration ./internal/adapter/linear/...
 go test -tags=integration ./internal/adapter/gitlab/...
 go test -tags=integration ./internal/adapter/github/...
 go test -tags=integration ./internal/adapter/glab/...
 go test -tags=integration,e2e -timeout=30m ./test/e2e/...

Risk Register

The main live risks that still match current architecture are:

harness parity drift between oh-my-pi and the alternative harnesses
provider browse/filter semantics diverging across adapters
event-bus partial delivery when ErrRetryLater happens after some subscribers already received an event
SQLite contention and retry behavior under concurrent writes
bridge / CLI output format drift in external tools
foreman question_timeout config default "0" is documented as indefinite but runtime falls back to 60 s

Known Gaps

Current gaps that remain accurate to call out:

event-bus partial-delivery semantics are accepted and require idempotent consumers
no dead-letter store for adapter errors; adapter.error events are persisted but best-effort TUI warnings can be dropped when the message channel is full
pre-hook timeouts cannot kill a misbehaving goroutine that ignores context cancellation
Linux sandboxing for the oh-my-pi bridge remains less mature than the macOS path
some adapter / harness parity claims are intentionally held back until real-binary verification exists

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

07 - Implementation Plan

Directory Structure

Phase 0: Project Bootstrap (Week 1)

Phase 1: Core Domain + Persistence (Week 2)

Phase 2: Service Layer (Week 2-3)

Phase 3: Event Bus + Adapter Interfaces (Week 3)

Phase 4: git-work Integration (Week 3-4)

Phase 6: Multi-Harness Agent Integration (Week 4-5)

6a. oh-my-pi bridge (default, production path)

6b. Claude Code adapter (implemented, parity still limited)

6c. Codex adapter (implemented, parity still limited)

6d. Harness routing, packaging, and validation

Phase 7: Planning Pipeline (Week 5-6)

Phase 8: Implementation Orchestrator (Week 6-7)

Phase 9: Foreman + Review Pipeline (Week 7-8)

Foreman

Review

Orchestrator-owned review pipeline

Phase 9b: Resume & Recovery (Week 8)

Phase 10: Work Item Browsing and Selection (Week 8)

Phase 11: GitLab / GitHub Adapters and Unified Browse Semantics (Week 8-9)

Phase 12: TUI (Week 9-11)

Phase 13: End-to-End Integration (Week 11-12)

Autonomous Validation Strategy

Risk Register

Known Gaps

FilesExpand file tree

07-implementation-plan.md

Latest commit

History

07-implementation-plan.md

File metadata and controls

07 - Implementation Plan

Directory Structure

Phase 0: Project Bootstrap (Week 1)

Phase 1: Core Domain + Persistence (Week 2)

Phase 2: Service Layer (Week 2-3)

Phase 3: Event Bus + Adapter Interfaces (Week 3)

Phase 4: git-work Integration (Week 3-4)

Phase 6: Multi-Harness Agent Integration (Week 4-5)

6a. oh-my-pi bridge (default, production path)

6b. Claude Code adapter (implemented, parity still limited)

6c. Codex adapter (implemented, parity still limited)

6d. Harness routing, packaging, and validation

Phase 7: Planning Pipeline (Week 5-6)

Phase 8: Implementation Orchestrator (Week 6-7)

Phase 9: Foreman + Review Pipeline (Week 7-8)

Foreman

Review

Orchestrator-owned review pipeline

Phase 9b: Resume & Recovery (Week 8)

Phase 10: Work Item Browsing and Selection (Week 8)

Phase 11: GitLab / GitHub Adapters and Unified Browse Semantics (Week 8-9)

Phase 12: TUI (Week 9-11)

Phase 13: End-to-End Integration (Week 11-12)

Autonomous Validation Strategy

Risk Register

Known Gaps