[codex] add stripe emulator vitest coverage and playwright ci by riderx · Pull Request #1940 · Cap-go/capgo

riderx · 2026-04-23T17:04:19Z

Summary (AI generated)

add emulator-backed Vitest coverage for Stripe checkout fallback behavior
make bun run test:front start a fresh seeded Supabase backend before Playwright
add a dedicated GitHub Actions Playwright job and harden Playwright startup for CI

Motivation (AI generated)

Stripe emulator support had only been exercised through browser specs, while CI still skipped Playwright entirely. That left the emulator-specific checkout fallbacks under-tested and let the end-to-end billing flows drift outside the required PR validation path.

Business Impact (AI generated)

This reduces release risk on subscription and credit-purchase flows, which are directly tied to conversion and revenue. It also makes CI enforce the browser billing path instead of relying on manual verification.

Test Plan (AI generated)

bun lint
bun lint:backend
bun typecheck
bunx vitest run tests/stripe-emulator.test.ts tests/stripe-redirects.unit.test.ts
bun run test:front playwright/e2e/subscription-checkout.spec.ts playwright/e2e/credits-top-up.spec.ts

Generated with AI

Summary by CodeRabbit

Tests
- Dedicated frontend Playwright job with more reliable backend orchestration, readiness signaling, tuned timeouts/concurrency, new reusable e2e helpers, UI test hooks for targeted actions, unique resource names, improved retries, clearer failures, and a new Stripe emulator integration suite.
Chores
- CI/test-runner reliability and reproducibility improvements, including a switched test runner entrypoint and pinned tool setup.

coderabbitai · 2026-04-23T17:04:26Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Adds a dedicated Playwright CI job and Bun-based test runner, orchestrates a Supabase-backed backend with readiness gating and retries, tightens Playwright CI config and concurrency, adds test IDs and E2E helpers, hardens multiple backend tests, and introduces a Stripe-emulator integration test suite.

Changes

Cohort / File(s)	Summary
CI workflow `\.github/workflows/tests.yml`	Add `test_playwright` job running frontend Playwright; pin `supabase/setup-cli` to a commit SHA (applied to `test_all` too); upload Playwright artifacts on failure.
Playwright entry & runner `package.json`, `scripts/run-playwright-tests.ts`	Switch `test:front` to `bun scripts/run-playwright-tests.ts`; add runner that kills stale backends, starts backend, polls readiness file, forwards signals, runs `bunx playwright test`, and propagates exit codes.
Backend orchestration for Playwright `scripts/serve-backend-playwright.ts`, `scripts/supabase-worktree.ts` (invoked)	Introduce health-check/status checks, start/retry/backoff loop for supabase, optional DB reset, functions health probe (`/functions/v1/ok`), readiness file writing, and robust child-process cleanup.
Playwright config `playwright.config.ts`	Make server reuse CI-aware, raise webServerTimeout on CI, bind to `127.0.0.1`, disable fullyParallel, use explicit `PLAYWRIGHT_WORKERS` (default 1), and increase action/navigation timeouts.
E2E tests & helpers `playwright/e2e/apikeys.spec.ts`, `playwright/e2e/register.spec.ts`	Add helpers for creating API keys and asserting protected-route redirects; use timestamped unique names; replace ad-hoc navigation/waits with helpers.
UI test hooks & types `src/components.d.ts`, `src/components/comp_def.ts`, `src/components/DataTable.vue`, `src/pages/ApiKeys.vue`	Add global `BuildSetupInvite` type; add optional `testId` to `TableAction`; add `addButtonTestId` prop and per-action `data-test` attributes; apply test IDs for API Keys add/delete.
Backend test hardening `tests/audit-logs.test.ts`, `tests/cron_stat_refresh_completion.test.ts`, `tests/queue_load.test.ts`	Switch audit-key creation to Supabase RPC, use `getEndpointUrl` for triggers, increase test timeouts, use per-run queue names, improve retry/error handling and explicit setup/teardown.
Stripe emulator tests `tests/stripe-emulator.test.ts`	Add new Vitest suite that starts a local Stripe emulator (dynamic ports, retries), stubs Supabase admin lookups, and validates subscription and one-time checkout/session flows.

Sequence Diagram

sequenceDiagram
    participant GH as "GitHub Actions"
    participant Runner as "CI runner"
    participant BunScript as "bun scripts/run-playwright-tests.ts"
    participant Backend as "serve-backend-playwright.ts"
    participant SupabaseCLI as "supabase/setup-cli"
    participant Functions as "Functions server"
    participant Playwright as "Playwright runner"

    GH->>Runner: start test_playwright job
    Runner->>BunScript: execute runner
    BunScript->>BunScript: remove old readiness file, kill stale backends
    BunScript->>Backend: spawn backend (env: PLAYWRIGHT_READY_FILE)
    Backend->>SupabaseCLI: query worktree/status & start supabase (retries/backoff)
    SupabaseCLI-->>Backend: return config (API_URL, ports, keys)
    Backend->>Functions: start functions serve
    Functions-->>Backend: respond /functions/v1/ok
    Backend->>BunScript: write readiness file
    BunScript->>BunScript: poll readiness file (<=360s)
    BunScript->>Playwright: spawn Playwright tests (SKIP_BACKEND_START=true)
    Playwright-->>BunScript: exit status
    BunScript->>Backend: send SIGTERM (cleanup)
    Backend->>SupabaseCLI: stop supabase
    BunScript->>Runner: propagate exit code/result

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

[codex] Support Stripe emulator credit top-ups #1882: Overlaps on Playwright backend orchestration and Playwright-runner/config changes.
[codex] Fix post-merge pricing CI and config hardening #1912: Related CI workflow change that pins supabase/setup-cli in GitHub Actions.
feat: rename cron jobs for plan and stats #1221: Related changes to cron trigger endpoint naming and tests referencing cron_stat_* routes.

Poem

🐇
I hopped through scripts at dawn's light,
Cleared readiness, kept signals tight,
Supabase hummed and tests took flight,
Playwright danced through day and night,
Carrots earned — a rabbit's delight!

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title accurately summarizes the two main changes: adding Stripe emulator test coverage and integrating Playwright into CI workflows.
Description check	✅ Passed	The PR description includes a summary, motivation, business impact, and a detailed test plan with checked items, aligning well with the template structure.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch codex/stripe-emulator-playwright-ci

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

codspeed-hq · 2026-04-23T17:05:46Z

Merging this PR will not alter performance

✅ 28 untouched benchmarks

_{Comparing codex/stripe-emulator-playwright-ci (5f93d32) with main (33048cd)}

socket-security · 2026-04-23T21:04:03Z

Review the following changes in direct dependencies. Learn more about Socket for GitHub.

Diff	Package	Supply Chain Security	Vulnerability	Quality	Maintenance	License
	plausible-tracker@0.3.9
	simple-git-hooks@2.13.1
	unplugin-vue-macros@2.14.5
	unplugin-formkit@0.3.0
	vue-turnstile@1.0.11
	vitest@4.1.5
	vite-plugin-environment@1.1.3
	mime@4.1.0
	vue-chartjs@5.3.3
	pinia@3.0.4
	vite-plugin-vue-layouts@0.11.0
	vue-demi@0.14.10
	vite@8.0.8
	stripe@22.1.0
	vue-sonner@2.0.9
	tailwindcss@4.2.4
	vite-plugin-webfont-dl@3.12.0
	vite-plugin-devtools-json@1.0.0
	vite-plugin-vue-devtools@8.1.1
	semver@7.7.4
	vite-plugin-pwa@1.2.0
	unplugin-auto-import@21.0.0
	unplugin-icons@23.0.1
	pg@8.20.0
	zod@4.3.6
	jose@6.2.2
	typescript@6.0.2
	supabase@2.95.0
	vue-router@5.0.6
	unplugin-vue-components@32.0.0
	vue@3.5.32
	vue-tsc@3.2.6
	wrangler@4.84.1
See 3 more rows in the dashboard

View full report

coderabbitai

Actionable comments posted: 5

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

tests/cron_stat_refresh_completion.test.ts (1)
63-105: ⚠️ Potential issue | 🟡 Minor

Use getEndpointUrl('/triggers/cron_stat_app') instead of BASE_URL for consistency with coding guidelines.

Lines 64 and 99 hit /triggers/cron_stat_app via BASE_URL. Per the coding guidelines ("Use getEndpointUrl(path) test helper to route to correct worker based on endpoint"), these should use getEndpointUrl('/triggers/cron_stat_app') instead. This is already the pattern in tests like tests/version-name-stats.test.ts (line 158) and tests/build_time_tracking.test.ts (line 234).
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/cron_stat_refresh_completion.test.ts` around lines 63 - 105, Replace
direct BASE_URL usage with the test helper
getEndpointUrl('/triggers/cron_stat_app') for both fetch calls that create
firstResponse and secondResponse; update the two fetch invocations in the test
"updates app freshness immediately and only marks the org fresh after the last
pending app completes" to call fetch(getEndpointUrl('/triggers/cron_stat_app'),
{ ... }) so the requests follow the project's routing helper convention used
elsewhere (refer to the fetch calls that set firstResponse and secondResponse).

🧹 Nitpick comments (3)

src/pages/ApiKeys.vue (1)

335-338: Make the row delete selector unique.

Every delete button now renders the same data-test="delete-key", which becomes ambiguous as soon as the table has multiple keys. TableAction.testId already supports a function, so deriving it from the row keeps the selector deterministic.

💡 Suggested fix

       {
         icon: IconTrash,
         onClick: (key: Database['public']['Tables']['apikeys']['Row']) => deleteKey(key),
-        testId: 'delete-key',
+        testId: (key: Database['public']['Tables']['apikeys']['Row']) => `delete-key-${key.id}`,
       },

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@src/pages/ApiKeys.vue` around lines 335 - 338, The delete button's data-test
attribute is not unique because TableAction.testId is set to the same string for
every row; change the testId to a function that derives a deterministic unique
selector from the row (e.g., use the row's id) so each delete button is
distinct. Locate the TableAction entry that currently uses IconTrash and
onClick: (key) => deleteKey(key) and replace the static testId with a function
like (key) => `delete-key-${key.id}` (or another unique row field) so test
selectors are unambiguous.

scripts/run-playwright-tests.ts (1)

29-34: Add cwd to spawnSync for consistency with the backend script.

The matching function in scripts/serve-backend-playwright.ts (line 86-92) includes cwd: repoRoot. While pkill doesn't depend on the working directory, adding it maintains consistency and makes the behavior explicit.
🔧 Minor consistency improvement
 function stopExistingPlaywrightBackend() {
   spawnSync('pkill', ['-f', 'supabase-functions.playwright.env'], {
+    cwd: process.cwd(),
     stdio: 'ignore',
     env: process.env,
   })
 }
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@scripts/run-playwright-tests.ts` around lines 29 - 34, In
stopExistingPlaywrightBackend(), add cwd: repoRoot to the options object passed
to spawnSync (alongside stdio and env) so the pkill invocation matches the
serve-backend-playwright.ts behavior; locate the spawnSync call inside function
stopExistingPlaywrightBackend and include cwd: repoRoot in that options literal
for consistency with the backend script.

.github/workflows/tests.yml (1)

202-233: Consider uploading Playwright artifacts on failure for easier CI debugging.

The job configuration looks solid with proper Supabase CLI pinning and browser installation. However, when Playwright tests fail in CI, having the HTML report and traces available would significantly ease debugging.

📦 Proposed addition: Upload Playwright artifacts

Add these steps after line 233:

      - name: Upload Playwright report
        uses: actions/upload-artifact@v4
        if: failure()
        with:
          name: playwright-report
          path: playwright-report/
          retention-days: 7

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.github/workflows/tests.yml around lines 202 - 233, The test_playwright job
doesn't upload Playwright artifacts on failure which hinders CI debugging; add
post-test steps after the "Run Playwright tests" step to upload the Playwright
HTML report and traces using actions/upload-artifact@v4 with if: failure(), e.g.
upload the playwright-report/ directory (and optionally playwright-report/traces
or traces/) with a descriptive artifact name like "playwright-report" and a
short retention-days value so failed-run reports and traces are available for
inspection.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@scripts/serve-backend-playwright.ts`:
- Around line 16-20: The Supabase health-check can misreport valid outputs
because the SupabaseStatus interface only lists API_URL, ANON_KEY, and
SERVICE_ROLE_KEY and the health-check logic only checks those exact keys; update
the interface SupabaseStatus to also include PUBLISHABLE_KEY and SECRET_KEY
(aliases for ANON_KEY and SERVICE_ROLE_KEY) and modify the health-check code
that validates ANON_KEY and SERVICE_ROLE_KEY (the check around lines ~74-75) to
accept either the primary keys or their aliases (use ANON_KEY || PUBLISHABLE_KEY
and SERVICE_ROLE_KEY || SECRET_KEY when determining health) so payloads using
alternate names are treated as healthy.

In `@tests/queue_load.test.ts`:
- Around line 133-141: The stress test's Promise.all burst can fail fast on
thrown fetch() exceptions because fetchQueueSync currently only retries non-202
responses; modify fetchQueueSync (the function invoked in
tests/queue_load.test.ts) to wrap its fetch calls in a try/catch and retry on
thrown errors using the same retry loop/logic used in
webhook-queue-processing.test.ts (the try/catch + backoff/retry pattern present
around lines 27-50), honoring the existing maxRetries argument (e.g., maxRetries
= 6) so transient network exceptions are retried instead of aborting the whole
test.

In `@tests/stripe-emulator.test.ts`:
- Line 78: The test's global variable emulator should be nullable so teardown
doesn't throw if setup failed: change the declaration of emulator
(Awaited<ReturnType<typeof createEmulator>>) to allow undefined/null, ensure
beforeAll assigns it as before, and modify the afterAll teardown to check
emulator before calling its close/stop method (e.g., if (emulator) await
emulator.close() or similar). Update any usages in tests to handle the nullable
type where needed.
- Around line 54-75: getFreePort currently closes the probe server and returns
the port, creating a TOCTOU where another process can take the port before the
emulator binds; change this by either (A) returning a still-bound server (keep
the server returned by createServer/listen open and expose its port so the
emulator reuses that listener) or (B) implement a retry-on-EADDRINUSE loop when
binding the emulator: preserve the probe server until the emulator binds or
catch EADDRINUSE and retry getFreePort/bind several times before failing. Update
the code paths referencing getFreePort (the createServer/listen logic) so the
probe socket is not closed before the emulator's listen call (or ensure robust
retries on port collision).
- Around line 4-5: ESLint import ordering fails because the 'emulate' import
must come before the 'vitest' named imports; reorder the top imports so the line
"import { createEmulator } from 'emulate'" appears before "import { afterAll,
afterEach, beforeAll, describe, expect, it, vi } from 'vitest'" to satisfy
perfectionist/sort-imports.

---

Outside diff comments:
In `@tests/cron_stat_refresh_completion.test.ts`:
- Around line 63-105: Replace direct BASE_URL usage with the test helper
getEndpointUrl('/triggers/cron_stat_app') for both fetch calls that create
firstResponse and secondResponse; update the two fetch invocations in the test
"updates app freshness immediately and only marks the org fresh after the last
pending app completes" to call fetch(getEndpointUrl('/triggers/cron_stat_app'),
{ ... }) so the requests follow the project's routing helper convention used
elsewhere (refer to the fetch calls that set firstResponse and secondResponse).

---

Nitpick comments:
In @.github/workflows/tests.yml:
- Around line 202-233: The test_playwright job doesn't upload Playwright
artifacts on failure which hinders CI debugging; add post-test steps after the
"Run Playwright tests" step to upload the Playwright HTML report and traces
using actions/upload-artifact@v4 with if: failure(), e.g. upload the
playwright-report/ directory (and optionally playwright-report/traces or
traces/) with a descriptive artifact name like "playwright-report" and a short
retention-days value so failed-run reports and traces are available for
inspection.

In `@scripts/run-playwright-tests.ts`:
- Around line 29-34: In stopExistingPlaywrightBackend(), add cwd: repoRoot to
the options object passed to spawnSync (alongside stdio and env) so the pkill
invocation matches the serve-backend-playwright.ts behavior; locate the
spawnSync call inside function stopExistingPlaywrightBackend and include cwd:
repoRoot in that options literal for consistency with the backend script.

In `@src/pages/ApiKeys.vue`:
- Around line 335-338: The delete button's data-test attribute is not unique
because TableAction.testId is set to the same string for every row; change the
testId to a function that derives a deterministic unique selector from the row
(e.g., use the row's id) so each delete button is distinct. Locate the
TableAction entry that currently uses IconTrash and onClick: (key) =>
deleteKey(key) and replace the static testId with a function like (key) =>
`delete-key-${key.id}` (or another unique row field) so test selectors are
unambiguous.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 7516dff2-16d3-469e-82d2-902e519e2c00

📥 Commits

Reviewing files that changed from the base of the PR and between 33048cd and 3f6f2b4.

📒 Files selected for processing (15)

.github/workflows/tests.yml
package.json
playwright.config.ts
playwright/e2e/apikeys.spec.ts
playwright/e2e/register.spec.ts
scripts/run-playwright-tests.ts
scripts/serve-backend-playwright.ts
src/components.d.ts
src/components/DataTable.vue
src/components/comp_def.ts
src/pages/ApiKeys.vue
tests/audit-logs.test.ts
tests/cron_stat_refresh_completion.test.ts
tests/queue_load.test.ts
tests/stripe-emulator.test.ts

coderabbitai

🧹 Nitpick comments (5)

playwright/e2e/apikeys.spec.ts (2)

39-41: Prefer resilient deletion verification over exact toast wording.

The row-removal assertion already validates behavior; exact success-copy matching is brittle for i18n/content updates.

Suggested change

-    const toast = page.locator('[data-test="toast"]')
-    await expect(toast).toContainText('API key has been successfully deleted')
+    await expect(page.locator('[data-test="toast"]')).toBeVisible()
     await expect(page.locator('tr', { hasText: keyName })).toHaveCount(0)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@playwright/e2e/apikeys.spec.ts` around lines 39 - 41, The test currently
asserts an exact toast message which is brittle; replace the precise text check
on the toast (the locator stored in toast via
page.locator('[data-test="toast"]')) with a resilient presence/visibility
assertion (e.g., await expect(toast).toBeVisible() or toHaveCount(1)) and keep
the row-removal assertion (await expect(page.locator('tr', { hasText: keyName
})).toHaveCount(0)); update the assertion referencing toast and remove the
toContainText('API key has been successfully deleted') check.

9-9: Avoid hard-coded localized toast copy in this helper.

This text is translation-driven (toast.success(t('add-api-key')) in src/pages/ApiKeys.vue), so exact matching can cause avoidable CI flakes on locale/copy updates.

Suggested change

-  await expect(page.locator('[data-test="toast"]')).toContainText('Added new API key successfully')
+  await expect(page.locator('[data-test="toast"]')).toBeVisible()

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@playwright/e2e/apikeys.spec.ts` at line 9, The test currently asserts an
exact localized toast message which is brittle; instead, update the assertion on
page.locator('[data-test="toast"]') to avoid hard-coded translation text (see
ApiKeys.vue where toast.success(t('add-api-key')) is used). Replace the exact
toContainText('Added new API key successfully') check with a locale-agnostic
assertion such as verifying the toast is visible and contains non-empty text or
matches a stable pattern (e.g., success state/class plus presence of any text)
so the test no longer depends on the translated copy.

tests/queue_load.test.ts (2)

66-73: Consider it.concurrent() for tests that don't touch shared queue state.

The health check (line 66) and invalid request tests (line 79) don't interact with the shared queueName state and could run concurrently, per coding guidelines.
♻️ Example for health check
-  it('should handle queue consumer health check', async () => {
+  it.concurrent('should handle queue consumer health check', async () => {
As per coding guidelines: "Use it.concurrent() instead of it() when possible to run tests in parallel within the same file, maximizing parallelism for faster CI/CD"

Also applies to: 79-112
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/queue_load.test.ts` around lines 66 - 73, The health-check test "should
handle queue consumer health check" (and the other tests that don't touch shared
queueName state between lines 79-112, e.g., the invalid request tests) should be
converted from serial tests using it(...) to parallel tests using
it.concurrent(...); locate the test declarations by their titles in
tests/queue_load.test.ts and replace the it(...) calls with it.concurrent(...)
so they can run in parallel without altering test logic or shared state
handling.
15-19: Redundant DELETE statements on freshly created queue.

Since the queue is newly created with a unique name per run, the tables pgmq.q_${queueName} and pgmq.a_${queueName} are guaranteed empty. These DELETE statements can be removed.
♻️ Suggested simplification
 beforeAll(async () => {
   await pool.query('SELECT pgmq.create($1)', [queueName])
-  await pool.query(`DELETE FROM pgmq.q_${queueName}`)
-  await pool.query(`DELETE FROM pgmq.a_${queueName}`)
 })
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/queue_load.test.ts` around lines 15 - 19, The beforeAll setup in
tests/queue_load.test.ts redundantly issues DELETEs after creating a fresh
queue; remove the two pool.query calls that delete from `pgmq.q_${queueName}`
and `pgmq.a_${queueName}` so the block only calls `pool.query('SELECT
pgmq.create($1)', [queueName])` (leave `beforeAll`, `pool.query`, and
`queueName` unchanged).

.github/workflows/tests.yml (1)

202-244: Optional: extract duplicated CI bootstrap steps into a reusable unit.

test_all and test_playwright now duplicate cache/checkout/bun install/dependency setup and Supabase template linking. Consider a reusable workflow or composite action to keep these in sync.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.github/workflows/tests.yml around lines 202 - 244, The test_playwright job
duplicates CI bootstrap steps present in test_all (cache/checkout/setup
bun/install dependencies/install Supabase CLI/link templates); extract those
shared steps (e.g., the steps named "Cache Deno dependencies", "Checkout capgo",
"Setup bun", "Install dependencies", "Install Supabase CLI", "Link Supabase
templates") into a reusable workflow or composite action (invocable via
workflow_call or a composite action in .github/actions) and replace the
duplicated sequences in both jobs with a single call to that reusable unit,
leaving job-specific steps like "Install Playwright browser" and "Run Playwright
tests" in test_playwright so each job can still add or override steps as needed.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In @.github/workflows/tests.yml:
- Around line 202-244: The test_playwright job duplicates CI bootstrap steps
present in test_all (cache/checkout/setup bun/install dependencies/install
Supabase CLI/link templates); extract those shared steps (e.g., the steps named
"Cache Deno dependencies", "Checkout capgo", "Setup bun", "Install
dependencies", "Install Supabase CLI", "Link Supabase templates") into a
reusable workflow or composite action (invocable via workflow_call or a
composite action in .github/actions) and replace the duplicated sequences in
both jobs with a single call to that reusable unit, leaving job-specific steps
like "Install Playwright browser" and "Run Playwright tests" in test_playwright
so each job can still add or override steps as needed.

In `@playwright/e2e/apikeys.spec.ts`:
- Around line 39-41: The test currently asserts an exact toast message which is
brittle; replace the precise text check on the toast (the locator stored in
toast via page.locator('[data-test="toast"]')) with a resilient
presence/visibility assertion (e.g., await expect(toast).toBeVisible() or
toHaveCount(1)) and keep the row-removal assertion (await
expect(page.locator('tr', { hasText: keyName })).toHaveCount(0)); update the
assertion referencing toast and remove the toContainText('API key has been
successfully deleted') check.
- Line 9: The test currently asserts an exact localized toast message which is
brittle; instead, update the assertion on page.locator('[data-test="toast"]') to
avoid hard-coded translation text (see ApiKeys.vue where
toast.success(t('add-api-key')) is used). Replace the exact toContainText('Added
new API key successfully') check with a locale-agnostic assertion such as
verifying the toast is visible and contains non-empty text or matches a stable
pattern (e.g., success state/class plus presence of any text) so the test no
longer depends on the translated copy.

In `@tests/queue_load.test.ts`:
- Around line 66-73: The health-check test "should handle queue consumer health
check" (and the other tests that don't touch shared queueName state between
lines 79-112, e.g., the invalid request tests) should be converted from serial
tests using it(...) to parallel tests using it.concurrent(...); locate the test
declarations by their titles in tests/queue_load.test.ts and replace the it(...)
calls with it.concurrent(...) so they can run in parallel without altering test
logic or shared state handling.
- Around line 15-19: The beforeAll setup in tests/queue_load.test.ts redundantly
issues DELETEs after creating a fresh queue; remove the two pool.query calls
that delete from `pgmq.q_${queueName}` and `pgmq.a_${queueName}` so the block
only calls `pool.query('SELECT pgmq.create($1)', [queueName])` (leave
`beforeAll`, `pool.query`, and `queueName` unchanged).

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: d42ed338-b8fc-43c8-ba0e-65dc02192e5f

📥 Commits

Reviewing files that changed from the base of the PR and between b4d66b9 and dee1421.

📒 Files selected for processing (8)

.github/workflows/tests.yml
playwright/e2e/apikeys.spec.ts
scripts/run-playwright-tests.ts
scripts/serve-backend-playwright.ts
src/pages/ApiKeys.vue
tests/cron_stat_refresh_completion.test.ts
tests/queue_load.test.ts
tests/stripe-emulator.test.ts

🚧 Files skipped from review as they are similar to previous changes (4)

src/pages/ApiKeys.vue
tests/stripe-emulator.test.ts
scripts/run-playwright-tests.ts
scripts/serve-backend-playwright.ts

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: dee1421f81

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: ffc11c2633

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

sonarqubecloud · 2026-04-24T08:32:46Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 5f93d3200b

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-04-24T08:35:02Z

+  while (Date.now() - startedAt < timeoutMs) {
+    try {
+      const response = await fetch(targetUrl)
+      if (response.ok)
+        return


Fail fast when functions serve exits before readiness

This readiness loop only polls targetUrl and never checks whether the spawned functions serve process has already exited. If that child dies early (for example from a bad env file or port conflict), the script waits the full PLAYWRIGHT_BACKEND_TIMEOUT_MS (default 360000 ms) before failing, which adds long false hangs to CI/local failures. Check the child exit/signal state inside the loop and throw immediately when it is no longer running.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-24T08:35:02Z

+try {
+  await waitForFunctionsReady(functionsReadyTimeoutMs)
+  childStarted = true


Register signal forwarding before waiting for readiness

The script waits for backend readiness before installing SIGINT/SIGTERM forwarding, so cancellation during startup can terminate this wrapper without propagating cleanup to the spawned functions serve child. That leaves stale edge-runtime processes bound to ports and can break subsequent runs in the same environment. Move signal handler registration to immediately after spawn (or wrap startup in guaranteed child cleanup on signal).

Useful? React with 👍 / 👎.

test(ci): run stripe emulator coverage and playwright

40a522f

test(ci): stabilize playwright and stripe emulator coverage

2176d5d

riderx added 2 commits April 23, 2026 23:39

test(playwright): isolate onboarding redirect assertion

f0d70e6

test(backend): stabilize audit log apikey setup

3f6f2b4

riderx marked this pull request as ready for review April 23, 2026 21:52

ci: pin supabase setup action

b4d66b9

coderabbitai Bot reviewed Apr 23, 2026

View reviewed changes

Comment thread scripts/serve-backend-playwright.ts

Comment thread tests/queue_load.test.ts

Comment thread tests/stripe-emulator.test.ts Outdated

Comment thread tests/stripe-emulator.test.ts

Comment thread tests/stripe-emulator.test.ts Outdated

fix(ci): address pr review follow-ups

dee1421

coderabbitai Bot reviewed Apr 24, 2026

View reviewed changes

chatgpt-codex-connector Bot reviewed Apr 24, 2026

View reviewed changes

Comment thread scripts/serve-backend-playwright.ts Outdated

Comment thread scripts/serve-backend-playwright.ts

riderx added 2 commits April 24, 2026 09:52

test(backend): streamline queue load checks

547bef0

fix(ci): harden playwright backend startup

ffc11c2

chatgpt-codex-connector Bot reviewed Apr 24, 2026

View reviewed changes

Comment thread scripts/run-playwright-tests.ts Outdated

Comment thread scripts/run-playwright-tests.ts

fix(ci): handle signaled playwright exits

5f93d32

riderx merged commit 57f6829 into main Apr 24, 2026
16 checks passed

riderx deleted the codex/stripe-emulator-playwright-ci branch April 24, 2026 08:34

chatgpt-codex-connector Bot reviewed Apr 24, 2026

View reviewed changes

Uh oh!

Conversation

riderx commented Apr 23, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary (AI generated)

Motivation (AI generated)

Business Impact (AI generated)

Test Plan (AI generated)

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

codspeed-hq Bot commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merging this PR will not alter performance

Uh oh!

socket-security Bot commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

sonarqubecloud Bot commented Apr 24, 2026

Quality Gate passed

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

riderx commented Apr 23, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Apr 23, 2026 •

edited

Loading

codspeed-hq Bot commented Apr 23, 2026 •

edited

Loading

socket-security Bot commented Apr 23, 2026 •

edited

Loading