Skip to content

feat: add reference replay runner#8

Merged
drewstone merged 1 commit into
mainfrom
feat/reference-replay-runner
Apr 25, 2026
Merged

feat: add reference replay runner#8
drewstone merged 1 commit into
mainfrom
feat/reference-replay-runner

Conversation

@drewstone
Copy link
Copy Markdown
Contributor

Adds the execution layer around the reference replay scorer so downstream products can run hidden-reference evals end to end.\n\nChanges:\n- add ReferenceReplayCase, execution scenario, adapter, run, and run record types\n- add runReferenceReplay with hidden references, per-case scoring, failure capture, and full run scoring\n- add in-memory and JSONL run stores\n- add run-to-run promotion decision helper\n- cover adapter reference hiding, persistence, JSONL round-trip, failure capture, and promotion behavior\n\nVerification:\n- pnpm test -- tests/reference-replay.test.ts\n- pnpm typecheck\n- pnpm build

@drewstone drewstone force-pushed the feat/reference-replay-runner branch from 9343d8b to 814a103 Compare April 25, 2026 19:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant