feat(tooling): local style-variant experimentation script by MarkusNeusinger · Pull Request #6378 · MarkusNeusinger/anyplot

MarkusNeusinger · 2026-05-11T16:39:24Z

Summary

Adds scripts/style-experiment.py — a local sandbox for A/B-testing plot style variants (canvas size, fonts, palette, line weights, ...) on real spec/library combos before committing to a style change. Motivated by the observation that the current 4800×2700 canvas with title=24pt / axis=20pt / tick=16pt / line=3 looks crisp at detail view (1600 px) but shrinks to <8 px effective text at thumbnail / mobile display sizes.

Rather than guess at the right new defaults, this script lets us render the same spec×library through several variants in one shot and compare side-by-side at multiple display widths.

How it works

Mimics agentic/workflows/modules/regen/render.py exactly: uv run python -P <impl>, MPLBACKEND=Agg, ANYPLOT_THEME=<theme>, xvfb fallback for headless bokeh.
Patches the implementation source in place and restores it via try/finally + atexit + SIGINT/SIGTERM/SIGHUP handlers. In-place is required because some impls (highcharts) resolve Path(__file__).parents[3] for shared assets and break if the file is copied to a temp dir.
All output (PNG + HTML + run logs + manifest.json + compare.html) lands under /tmp/anyplot-style-experiments/<timestamp>/ — never inside the repo, never on GCS.
Pre/post git diff --name-only plots/ check guards against patches not being restored on hard kills.

Starter variants (`scripts/style-variants.yaml`)

variant	what it does
`baseline`	unchanged production style
`smaller_canvas`	4800×2700 → 2400×1350 (effective text doubles at thumbnail scale)
`bigger_fonts`	title 24→40pt, axis 20→32pt, tick 16→26pt, line 3→5
`combo_smaller_canvas_modest_fonts`	halved canvas + modest font bump (28/22/18)
`palette_tableau`	swap Okabe-Ito for Tableau-10 (sanity check, not a real proposal)

New variants are pure YAML — no script changes. Patches are {find, replace} literal or {regex, replace} and can be keyed per-library or "*" for all.

Usage

uv run python scripts/style-experiment.py \
    --spec scatter-basic --spec dendrogram-basic \
    --library matplotlib --library plotly \
    --variant baseline --variant smaller_canvas --variant bigger_fonts \
    --theme both --open

The generated compare.html shows rows = (spec, library), columns = variants × {light, dark}, with a dropdown to toggle thumbnail width between 160 / 320 / 640 / 1200 px — the same render evaluated at mobile / grid / detail / large display scale.

Verified

Smoke test on scatter-basic / matplotlib / [baseline, smaller_canvas] / light → baseline produces 4766×2672, smaller_canvas produces 2375×1342 (exactly half), both PNGs render correctly, git status clean on plots/ after the run.
uv.lock is bumped 2.2.0 → 2.3.0 to sync with pyproject.toml.

Test plan

Run on 2–3 specs across matplotlib + plotly + altair with all 5 starter variants
Open compare.html and step through the thumbnail-width toggle to evaluate readability at mobile (160 px) and grid (320 px) scales
Confirm git status stays clean on plots/ after every run
Pick a winning variant (or define a new one in YAML) and follow up with a PR that updates prompts/default-style-guide.md + prompts/library/*.md

https://claude.ai/code/session_01LnqUDs4Fpo1d87qeHbvVx6

Generated by Claude Code

scripts/style-experiment.py is a sandbox for A/B-testing plot style variants (canvas size, font sizes, palette, ...) on real spec/library combos. Mimics agentic/workflows/modules/regen/render.py exactly (uv run -P, MPLBACKEND=Agg, ANYPLOT_THEME, xvfb fallback) but writes all output under /tmp/anyplot-style-experiments/<ts>/ so nothing lands in the repo or GCS. Patches the implementation source in place and restores it via try/finally + atexit + signal handlers - required because some impls (highcharts) resolve Path(__file__).parents[3] for shared assets and break when copied to a temp dir. scripts/style-variants.yaml ships starter variants: baseline, smaller_canvas (4800x2700 -> 2400x1350), bigger_fonts (title 24->40pt, axis 20->32pt, tick 16->26pt, line 3->5), combo_smaller_canvas_modest_fonts, and palette_tableau. New variants are pure YAML - no script changes. Output is a static compare.html with a thumbnail-width toggle (160/320/640/1200 px) so the same render can be evaluated at mobile, grid, and detail display scales side-by-side. uv.lock is bumped 2.2.0 -> 2.3.0 to match pyproject.toml.

Copilot

Pull request overview

Adds a local sandbox tool to render the same spec×library across multiple “style variants” (patch bundles) and themes, producing a side-by-side comparison report to help choose new visual defaults before updating production prompts.

Changes:

Added scripts/style-experiment.py to apply source patches in place, render via uv run python -P, and generate manifest.json + compare.html under /tmp/....
Added scripts/style-variants.yaml with starter style-variant patch sets (canvas sizing, font/line weights, palette swap).
Updated uv.lock to align the lockfile version with pyproject.toml (2.3.0).

Reviewed changes

Copilot reviewed 2 out of 3 changed files in this pull request and generated 5 comments.

File	Description
`uv.lock`	Bumps the editable package version entry to 2.3.0.
`scripts/style-variants.yaml`	Defines YAML-driven style variants and per-library patch rules consumed by the experiment script.
`scripts/style-experiment.py`	New CLI tool to patch implementations, render variants/themes into `/tmp`, and generate a browsable comparison report.

+    pre_dirty = subprocess.run(
+        ["git", "diff", "--name-only", "--", "plots/"],
+        capture_output=True, text=True, cwd=REPO_ROOT,
+    ).stdout.strip()
+    if pre_dirty:
+        print("[style-experiment] WARNING: plots/ has uncommitted changes before run:")
+        print(pre_dirty)
+        print("Patches will still be reverted via try/finally, but verify with `git diff` after.\n")
+


+atexit.register(_restore_all)
+for _sig in (signal.SIGINT, signal.SIGTERM, signal.SIGHUP):
+    try:
+        signal.signal(_sig, _signal_handler)
+    except (ValueError, OSError):
+        pass


+            html.append(f"<div class='group'><h2>{spec} &mdash; {lib}</h2>")
+            html.append("<table><thead><tr><th></th>")
+            for vn in variant_names:
+                html.append(f"<th>{vn}<small>{variant_desc[vn]}</small></th>")
+            html.append("</tr></thead><tbody>")
+            for theme in themes:
+                html.append(f"<tr><th>{theme}</th>")
+                for vn in variant_names:
+                    rec = by_key.get((spec, lib, vn, theme))
+                    html.append(f"<td class='cell {theme}'>")
+                    if rec is None:
+                        html.append("<div class='err'>(no record)</div>")
+                    elif rec.success and rec.png_path:
+                        html.append(f"<a href='{rec.png_path}' target='_blank'>"
+                                    f"<img src='{rec.png_path}' loading='lazy'></a>")
+                        html.append(f"<div class='label'>{rec.duration_sec}s"
+                                    + (f", {rec.patches_applied} patch(es)" if rec.patches_applied else "")
+                                    + "</div>")
+                    else:
+                        err = (rec.error or "unknown error")[:600]
+                        html.append(f"<div class='err'>{err}</div>")
+                    html.append("</td>")


+import json
+import os
+import re
+import shutil


+Outputs land under ``/tmp/anyplot-style-experiments/<timestamp>/`` by default
+- never inside the repo, never on GCS.


codecov · 2026-05-11T16:46:15Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

- Drop unused `shutil` import. - Make signal trapping cross-platform: build list via getattr so SIGHUP-absence on Windows doesn't raise AttributeError at import. - HTML-escape every interpolated text field in compare.html (spec, library, variant name/description, theme, error/log excerpts) via html.escape, and URL-quote png paths used in href/src. - Replace `git diff --name-only` pre/post checks with `git status --porcelain --untracked-files=all -- plots/` so artifacts written next to __file__ by misbehaving impls are also flagged. - Soften module docstring: the script directs cwd to /tmp and detects leakage via the git status check, but cannot prevent an impl from writing relative to __file__.

Copilot

Pull request overview

Copilot reviewed 2 out of 3 changed files in this pull request and generated 2 comments.

+        except Exception as exc:
+            print(f"[style-experiment] WARNING: failed to restore {path}: {exc}",
+                  file=sys.stderr)
+        _pending_restores.pop(path, None)


+    data = yaml.safe_load(path.read_text())
+    catalog = data.get("variants") or {}
+    missing = [n for n in names if n not in catalog]


…6946) ## Summary - Rescues an orphan post-merge commit (`c8e6828`) from `claude/improve-plot-responsiveness-gHYM5`. PR #6378 was squash-merged on 2026-05-11; a Copilot follow-up fix was pushed to the (closed) branch ~1h later and never landed on main. - Verified missing on main: `isinstance(data, Mapping)` guard is absent, and `_pending_restores.pop` runs unconditionally instead of only on successful restore. ## Changes - **`scripts/style-experiment.py`** - `_restore_all`: skip the `_pending_restores.pop` when the disk write fails, so atexit / signal handlers / `patched_in_place` cleanup can retry. Previously a failed restore silently dropped the entry and could leave the repo patched. - `load_variants`: validate that the loaded YAML is a mapping and that the `variants` key (if present) is also a mapping. Empty/scalar YAML used to crash with a cryptic `AttributeError` on `data.get(...)`. ## Test plan - [x] `python3 -m py_compile scripts/style-experiment.py` passes - [ ] CI green - [ ] `claude/improve-plot-responsiveness-gHYM5` can be deleted after merge Co-authored-by: Claude <noreply@anthropic.com>

Copilot AI review requested due to automatic review settings May 11, 2026 16:39

Copilot started reviewing on behalf of MarkusNeusinger May 11, 2026 16:40 View session

Copilot AI reviewed May 11, 2026

View reviewed changes

claude and others added 2 commits May 11, 2026 16:50

Merge branch 'main' into claude/improve-plot-responsiveness-gHYM5

a6157e5

Copilot AI review requested due to automatic review settings May 11, 2026 19:43

MarkusNeusinger enabled auto-merge (squash) May 11, 2026 19:43

MarkusNeusinger disabled auto-merge May 11, 2026 19:43

MarkusNeusinger enabled auto-merge (squash) May 11, 2026 19:43

Copilot started reviewing on behalf of MarkusNeusinger May 11, 2026 19:44 View session

Copilot AI reviewed May 11, 2026

View reviewed changes

MarkusNeusinger merged commit 5b04bea into main May 11, 2026
11 checks passed

MarkusNeusinger deleted the claude/improve-plot-responsiveness-gHYM5 branch May 11, 2026 19:49

MarkusNeusinger mentioned this pull request May 16, 2026

fix(style-experiment): apply post-merge Copilot feedback from #6378 #6946

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(tooling): local style-variant experimentation script#6378

feat(tooling): local style-variant experimentation script#6378
MarkusNeusinger merged 3 commits into
mainfrom
claude/improve-plot-responsiveness-gHYM5

MarkusNeusinger commented May 11, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

codecov Bot commented May 11, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		Outputs land under ``/tmp/anyplot-style-experiments/<timestamp>/`` by default
		- never inside the repo, never on GCS.

Conversation

MarkusNeusinger commented May 11, 2026

Summary

How it works

Starter variants (scripts/style-variants.yaml)

Usage

Verified

Test plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

codecov Bot commented May 11, 2026

Codecov Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Starter variants (`scripts/style-variants.yaml`)