Skip to content

chore(0.34.0): release — eval scorecard + agent profile cells#84

Merged
drewstone merged 1 commit into
mainfrom
chore/release-0.34.0
May 22, 2026
Merged

chore(0.34.0): release — eval scorecard + agent profile cells#84
drewstone merged 1 commit into
mainfrom
chore/release-0.34.0

Conversation

@drewstone
Copy link
Copy Markdown
Contributor

Headline

A feature PR's eval can finally answer the question a single run cannot — did this change regress persona P on profile F, even while the aggregate improved?

No breaking changes — additive across the board. See CHANGELOG.md for the full list.

Test plan

  • pnpm typecheck — 0 errors
  • pnpm test1306 passed (135 files)
  • pnpm build — green; OpenAPI spec emitted

After merge: push tag v0.34.0 to trigger publish.yml for the npm + Python tarballs.

Headline: a feature PR's eval can finally answer the question a single
run cannot — did this change regress persona P on profile F, even
while the aggregate improved? AgentProfile is the unit of variation;
the append-only JSONL scorecard is the per-cell timeline; diffScorecard
issues per-cell verdicts with Cohen's d + Welch's t-test. Also
consolidates the paired statistics, unifies the LLM retry classifier,
commits the long-missing pr-review-benchmark source, and adds three
production-pattern examples.

See CHANGELOG.md for the full list.
Copy link
Copy Markdown
Contributor

@tangletools tangletools left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Verified — 0.34.0 release bump. package.json + clients/python/pyproject.toml in lockstep at 0.34.0; CHANGELOG entry covers the scorecard, agent-profile-cells, stats consolidation, llm-retry unify, pr-review-benchmark commit, and examples. typecheck 0, suite 1306 green, build emits OpenAPI cleanly. Tag v0.34.0 after merge to trigger publish.yml.

@drewstone drewstone merged commit 13c995a into main May 22, 2026
1 check passed
@drewstone drewstone deleted the chore/release-0.34.0 branch May 22, 2026 22:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants