Skip to content

fix: surface 429 rate-limit errors in e2e tests and CLI#1309

Merged
TooTallNate merged 3 commits into
mainfrom
fix/e2e-error-surfacing
Mar 10, 2026
Merged

fix: surface 429 rate-limit errors in e2e tests and CLI#1309
TooTallNate merged 3 commits into
mainfrom
fix/e2e-error-surfacing

Conversation

@TooTallNate
Copy link
Copy Markdown
Member

Summary

  • Fix e2e test suite and CLI to properly surface HTTP errors (especially 429 rate limits) from the encryption key fetch endpoint, instead of producing confusing, unrelated-looking test failures

Problem

When the Vercel run-key endpoint returns a 429, the test failures show misleading assertion errors like expected 133, received "🔒 Encrypted" with no indication of the actual cause. This is due to 4 stacked issues:

  1. awaitCommand() ignores exit codes (packages/core/e2e/utils.ts) — always resolves even when the CLI exits with code 1
  2. cliInspectJson() falls back to {} on empty stdout — silently parses to an empty object
  3. maybeDecryptFields() swallows HTTP errors (packages/cli/src/lib/inspect/hydration.ts) — catches 429 errors and silently falls back to encrypted placeholders
  4. fetchRunKey() produces generic error messages (packages/world-vercel/src/encryption.ts) — drops the response body and status text

Changes

File Change
packages/core/e2e/utils.ts awaitCommand() now rejects on non-zero exit codes with stderr/stdout in the error. cliInspectJson()/cliHealthJson() throw on empty stdout.
packages/cli/src/lib/inspect/hydration.ts maybeDecryptFields() re-throws HTTP errors instead of silently falling back to encrypted placeholders
packages/cli/src/lib/inspect/output.ts Added 429: 'Too Many Requests' to getStatusText()
packages/world-vercel/src/encryption.ts fetchRunKey() includes response body and status text in error messages

Multiple layered issues caused encryption key 429 errors to produce
confusing, unrelated-looking test failures:

- awaitCommand() in e2e utils now rejects on non-zero exit codes
  instead of silently resolving (the most critical fix)
- cliInspectJson/cliHealthJson throw on empty stdout instead of
  falling back to '{}'
- maybeDecryptFields re-throws HTTP errors instead of silently
  falling back to encrypted placeholders
- fetchRunKey includes response body and status text in errors
- Added 429 to CLI status text map
Copilot AI review requested due to automatic review settings March 9, 2026 22:47
@TooTallNate TooTallNate requested a review from a team as a code owner March 9, 2026 22:47
@vercel
Copy link
Copy Markdown
Contributor

vercel Bot commented Mar 9, 2026

@changeset-bot
Copy link
Copy Markdown

changeset-bot Bot commented Mar 9, 2026

🦋 Changeset detected

Latest commit: 36bd024

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 16 packages
Name Type
@workflow/cli Patch
@workflow/world-vercel Patch
workflow Patch
@workflow/world-testing Patch
@workflow/core Patch
@workflow/builders Patch
@workflow/next Patch
@workflow/nitro Patch
@workflow/vitest Patch
@workflow/web-shared Patch
@workflow/astro Patch
@workflow/nest Patch
@workflow/rollup Patch
@workflow/sveltekit Patch
@workflow/vite Patch
@workflow/nuxt Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Mar 9, 2026

🧪 E2E Test Results

Some tests failed

Summary

Passed Failed Skipped Total
✅ ▲ Vercel Production 571 0 67 638
✅ 💻 Local Development 612 0 84 696
✅ 📦 Local Production 612 0 84 696
✅ 🐘 Local Postgres 612 0 84 696
✅ 🪟 Windows 55 0 3 58
❌ 🌍 Community Worlds 118 56 15 189
✅ 📋 Other 147 0 27 174
Total 2727 56 364 3147

❌ Failed Tests

🌍 Community Worlds (56 failed)

mongodb (3 failed):

  • hookWorkflow is not resumable via public webhook endpoint
  • webhookWorkflow
  • concurrent hook token conflict - two workflows cannot use the same hook token simultaneously

redis (2 failed):

  • hookWorkflow is not resumable via public webhook endpoint
  • concurrent hook token conflict - two workflows cannot use the same hook token simultaneously

turso (51 failed):

  • addTenWorkflow
  • addTenWorkflow
  • wellKnownAgentWorkflow (.well-known/agent)
  • should work with react rendering in step
  • promiseAllWorkflow
  • promiseRaceWorkflow
  • promiseAnyWorkflow
  • importedStepOnlyWorkflow
  • hookWorkflow
  • hookWorkflow is not resumable via public webhook endpoint
  • webhookWorkflow
  • sleepingWorkflow
  • parallelSleepWorkflow
  • nullByteWorkflow
  • workflowAndStepMetadataWorkflow
  • fetchWorkflow
  • promiseRaceStressTestWorkflow
  • error handling error propagation workflow errors nested function calls preserve message and stack trace
  • error handling error propagation workflow errors cross-file imports preserve message and stack trace
  • error handling error propagation step errors basic step error preserves message and stack trace
  • error handling error propagation step errors cross-file step error preserves message and function names in stack
  • error handling retry behavior regular Error retries until success
  • error handling retry behavior FatalError fails immediately without retries
  • error handling retry behavior RetryableError respects custom retryAfter delay
  • error handling retry behavior maxRetries=0 disables retries
  • error handling retry behavior workflow completes despite transient 5xx on step_completed
  • error handling catchability FatalError can be caught and detected with FatalError.is()
  • hookCleanupTestWorkflow - hook token reuse after workflow completion
  • concurrent hook token conflict - two workflows cannot use the same hook token simultaneously
  • hookDisposeTestWorkflow - hook token reuse after explicit disposal while workflow still running
  • stepFunctionPassingWorkflow - step function references can be passed as arguments (without closure vars)
  • stepFunctionWithClosureWorkflow - step function with closure variables passed as argument
  • closureVariableWorkflow - nested step functions with closure variables
  • spawnWorkflowFromStepWorkflow - spawning a child workflow using start() inside a step
  • health check (queue-based) - workflow and step endpoints respond to health check messages
  • pathsAliasWorkflow - TypeScript path aliases resolve correctly
  • Calculator.calculate - static workflow method using static step methods from another class
  • AllInOneService.processNumber - static workflow method using sibling static step methods
  • ChainableService.processWithThis - static step methods using this to reference the class
  • thisSerializationWorkflow - step function invoked with .call() and .apply()
  • customSerializationWorkflow - custom class serialization with WORKFLOW_SERIALIZE/WORKFLOW_DESERIALIZE
  • instanceMethodStepWorkflow - instance methods with "use step" directive
  • crossContextSerdeWorkflow - classes defined in step code are deserializable in workflow context
  • stepFunctionAsStartArgWorkflow - step function reference passed as start() argument
  • cancelRun - cancelling a running workflow
  • cancelRun via CLI - cancelling a running workflow
  • pages router addTenWorkflow via pages router
  • pages router promiseAllWorkflow via pages router
  • pages router sleepingWorkflow via pages router
  • hookWithSleepWorkflow - hook payloads delivered correctly with concurrent sleep
  • sleepWithSequentialStepsWorkflow - sequential steps work with concurrent sleep (control)

Details by Category

✅ ▲ Vercel Production
App Passed Failed Skipped
✅ astro 51 0 7
✅ example 51 0 7
✅ express 51 0 7
✅ fastify 51 0 7
✅ hono 51 0 7
✅ nextjs-turbopack 56 0 2
✅ nextjs-webpack 56 0 2
✅ nitro 51 0 7
✅ nuxt 51 0 7
✅ sveltekit 51 0 7
✅ vite 51 0 7
✅ 💻 Local Development
App Passed Failed Skipped
✅ astro-stable 49 0 9
✅ express-stable 49 0 9
✅ fastify-stable 49 0 9
✅ hono-stable 49 0 9
✅ nextjs-turbopack-canary 55 0 3
✅ nextjs-turbopack-stable 55 0 3
✅ nextjs-webpack-canary 55 0 3
✅ nextjs-webpack-stable 55 0 3
✅ nitro-stable 49 0 9
✅ nuxt-stable 49 0 9
✅ sveltekit-stable 49 0 9
✅ vite-stable 49 0 9
✅ 📦 Local Production
App Passed Failed Skipped
✅ astro-stable 49 0 9
✅ express-stable 49 0 9
✅ fastify-stable 49 0 9
✅ hono-stable 49 0 9
✅ nextjs-turbopack-canary 55 0 3
✅ nextjs-turbopack-stable 55 0 3
✅ nextjs-webpack-canary 55 0 3
✅ nextjs-webpack-stable 55 0 3
✅ nitro-stable 49 0 9
✅ nuxt-stable 49 0 9
✅ sveltekit-stable 49 0 9
✅ vite-stable 49 0 9
✅ 🐘 Local Postgres
App Passed Failed Skipped
✅ astro-stable 49 0 9
✅ express-stable 49 0 9
✅ fastify-stable 49 0 9
✅ hono-stable 49 0 9
✅ nextjs-turbopack-canary 55 0 3
✅ nextjs-turbopack-stable 55 0 3
✅ nextjs-webpack-canary 55 0 3
✅ nextjs-webpack-stable 55 0 3
✅ nitro-stable 49 0 9
✅ nuxt-stable 49 0 9
✅ sveltekit-stable 49 0 9
✅ vite-stable 49 0 9
✅ 🪟 Windows
App Passed Failed Skipped
✅ nextjs-turbopack 55 0 3
❌ 🌍 Community Worlds
App Passed Failed Skipped
✅ mongodb-dev 3 0 2
❌ mongodb 52 3 3
✅ redis-dev 3 0 2
❌ redis 53 2 3
✅ turso-dev 3 0 2
❌ turso 4 51 3
✅ 📋 Other
App Passed Failed Skipped
✅ e2e-local-dev-nest-stable 49 0 9
✅ e2e-local-postgres-nest-stable 49 0 9
✅ e2e-local-prod-nest-stable 49 0 9

📋 View full workflow run

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Mar 9, 2026

📊 Benchmark Results

📈 Comparing against baseline from main branch. Green 🟢 = faster, Red 🔺 = slower.

workflow with no steps

💻 Local Development

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
💻 Local 🥇 Express 0.031s (-4.7%) 1.006s (~) 0.975s 10 1.00x
💻 Local Nitro 0.033s (-7.7% 🟢) 1.006s (~) 0.973s 10 1.06x
💻 Local Next.js (Turbopack) 0.038s 1.006s 0.967s 10 1.25x
🌐 Redis Next.js (Turbopack) 0.043s 1.005s 0.962s 10 1.41x
🐘 Postgres Next.js (Turbopack) 0.049s 1.011s 0.962s 10 1.61x
🐘 Postgres Nitro 0.055s (+0.7%) 1.014s (~) 0.959s 10 1.79x
🐘 Postgres Express 0.060s (+9.3% 🔺) 1.012s (~) 0.953s 10 1.95x
🌐 MongoDB Next.js (Turbopack) 0.098s 1.009s 0.911s 10 3.21x

▲ Production (Vercel)

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
▲ Vercel 🥇 Express 0.438s (-9.2% 🟢) 1.848s (-24.5% 🟢) 1.410s 10 1.00x
▲ Vercel Nitro 0.497s (-4.6%) 2.266s (+1.4%) 1.769s 10 1.13x
▲ Vercel Next.js (Turbopack) 0.546s (-13.6% 🟢) 2.440s (+4.2%) 1.894s 10 1.25x

🔍 Observability: Express | Nitro | Next.js (Turbopack)

workflow with 1 step

💻 Local Development

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
💻 Local 🥇 Express 1.084s (-1.8%) 2.006s (~) 0.923s 10 1.00x
💻 Local Next.js (Turbopack) 1.105s 2.006s 0.902s 10 1.02x
💻 Local Nitro 1.105s (~) 2.007s (~) 0.902s 10 1.02x
🌐 Redis Next.js (Turbopack) 1.105s 2.006s 0.901s 10 1.02x
🐘 Postgres Next.js (Turbopack) 1.129s 2.013s 0.884s 10 1.04x
🐘 Postgres Express 1.135s (+0.6%) 2.012s (~) 0.877s 10 1.05x
🐘 Postgres Nitro 1.138s (-1.5%) 2.012s (~) 0.875s 10 1.05x
🌐 MongoDB Next.js (Turbopack) 1.318s 2.009s 0.691s 10 1.22x

▲ Production (Vercel)

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
▲ Vercel 🥇 Nitro 1.980s (-6.1% 🟢) 3.449s (-9.2% 🟢) 1.469s 10 1.00x
▲ Vercel Next.js (Turbopack) 2.110s (~) 3.587s (+2.3%) 1.476s 10 1.07x
▲ Vercel Express 2.132s (+5.6% 🔺) 3.277s (-10.6% 🟢) 1.145s 10 1.08x

🔍 Observability: Nitro | Next.js (Turbopack) | Express

workflow with 10 sequential steps

💻 Local Development

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
💻 Local 🥇 Express 10.574s (-1.9%) 11.023s (~) 0.449s 3 1.00x
💻 Local Next.js (Turbopack) 10.687s 11.022s 0.336s 3 1.01x
🌐 Redis Next.js (Turbopack) 10.716s 11.024s 0.308s 3 1.01x
💻 Local Nitro 10.787s (~) 11.024s (~) 0.237s 3 1.02x
🐘 Postgres Nitro 10.837s (-0.5%) 11.045s (~) 0.208s 3 1.02x
🐘 Postgres Next.js (Turbopack) 10.844s 11.042s 0.198s 3 1.03x
🐘 Postgres Express 10.889s (+0.7%) 11.042s (~) 0.153s 3 1.03x
🌐 MongoDB Next.js (Turbopack) 12.157s 13.016s 0.858s 3 1.15x

▲ Production (Vercel)

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
▲ Vercel 🥇 Express 16.137s (-3.6%) 16.972s (-8.7% 🟢) 0.835s 2 1.00x
▲ Vercel Nitro 16.608s (-1.0%) 18.012s (~) 1.404s 2 1.03x
▲ Vercel Next.js (Turbopack) 17.382s (+2.2%) 20.044s (+10.7% 🔺) 2.662s 2 1.08x

🔍 Observability: Express | Nitro | Next.js (Turbopack)

workflow with 25 sequential steps

💻 Local Development

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
🌐 Redis 🥇 Next.js (Turbopack) 26.508s 27.052s 0.545s 3 1.00x
💻 Local Express 26.845s (-1.2%) 27.060s (-3.5%) 0.215s 3 1.01x
🐘 Postgres Next.js (Turbopack) 26.900s 27.068s 0.169s 3 1.01x
💻 Local Next.js (Turbopack) 26.970s 27.052s 0.082s 3 1.02x
🐘 Postgres Nitro 26.980s (~) 27.396s (+1.2%) 0.416s 3 1.02x
🐘 Postgres Express 27.097s (~) 27.733s (+2.5%) 0.636s 3 1.02x
💻 Local Nitro 27.241s (~) 28.054s (~) 0.813s 3 1.03x
🌐 MongoDB Next.js (Turbopack) 30.397s 31.038s 0.641s 2 1.15x

▲ Production (Vercel)

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
▲ Vercel 🥇 Nitro 43.085s (-1.1%) 44.262s (-1.8%) 1.177s 2 1.00x
▲ Vercel Express 43.586s (~) 45.092s (~) 1.506s 2 1.01x
▲ Vercel Next.js (Turbopack) 43.749s (~) 45.346s (~) 1.596s 2 1.02x

🔍 Observability: Nitro | Express | Next.js (Turbopack)

workflow with 50 sequential steps

💻 Local Development

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
🌐 Redis 🥇 Next.js (Turbopack) 52.981s 53.096s 0.115s 2 1.00x
🐘 Postgres Next.js (Turbopack) 53.566s 54.106s 0.540s 2 1.01x
🐘 Postgres Nitro 53.872s (~) 54.093s (~) 0.221s 2 1.02x
🐘 Postgres Express 54.078s (~) 54.105s (~) 0.027s 2 1.02x
💻 Local Express 55.218s (-1.5%) 56.105s (~) 0.887s 2 1.04x
💻 Local Next.js (Turbopack) 55.660s 56.103s 0.443s 2 1.05x
💻 Local Nitro 56.297s (~) 57.106s (~) 0.809s 2 1.06x
🌐 MongoDB Next.js (Turbopack) 60.638s 61.060s 0.422s 2 1.14x

▲ Production (Vercel)

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
▲ Vercel 🥇 Next.js (Turbopack) 95.771s (-76.2% 🟢) 97.604s (-75.9% 🟢) 1.833s 1 1.00x
▲ Vercel Nitro 98.161s (+2.0%) 99.279s (+1.8%) 1.118s 1 1.02x
▲ Vercel Express 98.897s (+3.5%) 100.172s (+3.3%) 1.275s 1 1.03x

🔍 Observability: Next.js (Turbopack) | Nitro | Express

Promise.all with 10 concurrent steps

💻 Local Development

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
🌐 Redis 🥇 Next.js (Turbopack) 1.271s 2.006s 0.735s 15 1.00x
🐘 Postgres Nitro 1.351s (-1.9%) 2.012s (~) 0.661s 15 1.06x
🐘 Postgres Express 1.382s (~) 2.013s (~) 0.631s 15 1.09x
💻 Local Express 1.401s (-0.8%) 2.007s (~) 0.605s 15 1.10x
💻 Local Nitro 1.416s (-1.2%) 2.005s (~) 0.590s 15 1.11x
🐘 Postgres Next.js (Turbopack) 1.418s 2.012s 0.595s 15 1.12x
💻 Local Next.js (Turbopack) 1.419s 2.006s 0.587s 15 1.12x
🌐 MongoDB Next.js (Turbopack) 2.132s 3.008s 0.876s 10 1.68x

▲ Production (Vercel)

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
▲ Vercel 🥇 Express 2.372s (+1.9%) 3.242s (-13.7% 🟢) 0.870s 10 1.00x
▲ Vercel Next.js (Turbopack) 2.601s (-4.3%) 4.046s (+0.9%) 1.445s 8 1.10x
▲ Vercel Nitro 2.797s (+11.4% 🔺) 4.238s (+11.2% 🔺) 1.441s 8 1.18x

🔍 Observability: Express | Next.js (Turbopack) | Nitro

Promise.all with 25 concurrent steps

💻 Local Development

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
🐘 Postgres 🥇 Next.js (Turbopack) 1.952s 2.514s 0.562s 12 1.00x
🐘 Postgres Nitro 1.993s (-5.6% 🟢) 2.515s (-3.3%) 0.522s 12 1.02x
🐘 Postgres Express 2.048s (+1.7%) 2.601s (~) 0.553s 12 1.05x
💻 Local Express 2.476s (-3.2%) 3.008s (~) 0.532s 10 1.27x
🌐 Redis Next.js (Turbopack) 2.541s 3.008s 0.468s 10 1.30x
💻 Local Nitro 2.635s (-0.9%) 3.007s (~) 0.373s 10 1.35x
💻 Local Next.js (Turbopack) 2.659s 3.008s 0.349s 10 1.36x
🌐 MongoDB Next.js (Turbopack) 4.671s 5.178s 0.507s 6 2.39x

▲ Production (Vercel)

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
▲ Vercel 🥇 Express 2.542s (-7.3% 🟢) 3.453s (-14.2% 🟢) 0.911s 9 1.00x
▲ Vercel Nitro 2.593s (-10.9% 🟢) 3.760s (-8.9% 🟢) 1.167s 8 1.02x
▲ Vercel Next.js (Turbopack) 2.876s (+3.1%) 4.317s (+10.7% 🔺) 1.441s 7 1.13x

🔍 Observability: Express | Nitro | Next.js (Turbopack)

Promise.all with 50 concurrent steps

💻 Local Development

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
🐘 Postgres 🥇 Express 3.642s (-2.2%) 4.450s (+3.1%) 0.808s 7 1.00x
🐘 Postgres Next.js (Turbopack) 3.644s 4.310s 0.666s 7 1.00x
🐘 Postgres Nitro 3.777s (~) 4.595s (+3.0%) 0.818s 7 1.04x
🌐 Redis Next.js (Turbopack) 4.050s 4.725s 0.675s 7 1.11x
💻 Local Express 6.923s (-9.9% 🟢) 7.515s (-6.3% 🟢) 0.592s 4 1.90x
💻 Local Next.js (Turbopack) 7.234s 7.765s 0.531s 4 1.99x
💻 Local Nitro 7.790s (+2.6%) 8.023s (~) 0.233s 4 2.14x
🌐 MongoDB Next.js (Turbopack) 9.954s 10.350s 0.396s 3 2.73x

▲ Production (Vercel)

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
▲ Vercel 🥇 Express 2.940s (+3.2%) 4.303s (+11.7% 🔺) 1.362s 7 1.00x
▲ Vercel Next.js (Turbopack) 3.251s (+14.1% 🔺) 4.603s (+16.6% 🔺) 1.352s 7 1.11x
▲ Vercel Nitro 3.439s (+18.5% 🔺) 4.499s (+4.2%) 1.060s 7 1.17x

🔍 Observability: Express | Next.js (Turbopack) | Nitro

Promise.race with 10 concurrent steps

💻 Local Development

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
🌐 Redis 🥇 Next.js (Turbopack) 1.249s 2.006s 0.757s 15 1.00x
💻 Local Express 1.374s (-6.2% 🟢) 2.006s (~) 0.631s 15 1.10x
🐘 Postgres Next.js (Turbopack) 1.395s 2.012s 0.617s 15 1.12x
🐘 Postgres Express 1.403s (~) 2.013s (~) 0.610s 15 1.12x
🐘 Postgres Nitro 1.413s (+0.9%) 2.012s (~) 0.599s 15 1.13x
💻 Local Nitro 1.425s (-0.9%) 2.005s (~) 0.580s 15 1.14x
💻 Local Next.js (Turbopack) 1.445s 2.005s 0.560s 15 1.16x
🌐 MongoDB Next.js (Turbopack) 2.176s 3.008s 0.831s 10 1.74x

▲ Production (Vercel)

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
▲ Vercel 🥇 Nitro 2.062s (-11.9% 🟢) 3.485s (-11.5% 🟢) 1.423s 9 1.00x
▲ Vercel Express 2.150s (-2.6%) 3.177s (-13.4% 🟢) 1.028s 10 1.04x
▲ Vercel Next.js (Turbopack) 2.385s (+13.5% 🔺) 3.976s (+15.9% 🔺) 1.591s 8 1.16x

🔍 Observability: Nitro | Express | Next.js (Turbopack)

Promise.race with 25 concurrent steps

💻 Local Development

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
🐘 Postgres 🥇 Nitro 2.055s (-1.3%) 2.517s (-6.2% 🟢) 0.462s 12 1.00x
🐘 Postgres Next.js (Turbopack) 2.099s 2.514s 0.415s 12 1.02x
🐘 Postgres Express 2.177s (-2.4%) 2.741s (+2.2%) 0.564s 11 1.06x
🌐 Redis Next.js (Turbopack) 2.501s 3.008s 0.507s 10 1.22x
💻 Local Express 2.515s (-8.0% 🟢) 3.008s (~) 0.493s 10 1.22x
💻 Local Next.js (Turbopack) 2.746s 3.008s 0.262s 10 1.34x
💻 Local Nitro 2.770s (~) 3.010s (~) 0.240s 10 1.35x
🌐 MongoDB Next.js (Turbopack) 4.670s 5.176s 0.506s 6 2.27x

▲ Production (Vercel)

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
▲ Vercel 🥇 Express 2.165s (-3.3%) 3.121s (-11.8% 🟢) 0.955s 10 1.00x
▲ Vercel Next.js (Turbopack) 2.399s (+0.9%) 3.780s (+10.0% 🔺) 1.381s 8 1.11x
▲ Vercel Nitro 2.592s (+10.5% 🔺) 3.786s (+2.1%) 1.194s 8 1.20x

🔍 Observability: Express | Next.js (Turbopack) | Nitro

Promise.race with 50 concurrent steps

💻 Local Development

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
🐘 Postgres 🥇 Nitro 3.900s (+3.1%) 4.596s (~) 0.696s 7 1.00x
🐘 Postgres Express 3.939s (+5.1% 🔺) 5.022s (+12.9% 🔺) 1.083s 6 1.01x
🌐 Redis Next.js (Turbopack) 4.016s 4.582s 0.566s 7 1.03x
🐘 Postgres Next.js (Turbopack) 4.247s 5.024s 0.777s 7 1.09x
💻 Local Express 7.774s (-5.6% 🟢) 8.266s (-8.4% 🟢) 0.492s 4 1.99x
💻 Local Next.js (Turbopack) 8.094s 8.770s 0.676s 4 2.08x
💻 Local Nitro 8.369s (+1.1%) 9.022s (~) 0.653s 4 2.15x
🌐 MongoDB Next.js (Turbopack) 9.850s 10.349s 0.499s 3 2.53x

▲ Production (Vercel)

World Framework Workflow Time Wall Time Overhead Samples vs Fastest
▲ Vercel 🥇 Nitro 3.157s (+2.2%) 4.149s (-4.8%) 0.993s 8 1.00x
▲ Vercel Express 3.215s (+8.3% 🔺) 4.046s (-6.6% 🟢) 0.831s 8 1.02x
▲ Vercel Next.js (Turbopack) 3.384s (+18.8% 🔺) 5.032s (+25.1% 🔺) 1.649s 6 1.07x

🔍 Observability: Nitro | Express | Next.js (Turbopack)

Stream Benchmarks (includes TTFB metrics)
workflow with stream

💻 Local Development

World Framework Workflow Time TTFB Slurp Wall Time Overhead Samples vs Fastest
💻 Local 🥇 Express 0.113s (-32.8% 🟢) 1.002s (~) 0.011s (-7.0% 🟢) 1.016s (~) 0.904s 10 1.00x
🌐 Redis Next.js (Turbopack) 0.149s 1.000s 0.002s 1.007s 0.859s 10 1.32x
💻 Local Next.js (Turbopack) 0.149s 1.001s 0.012s 1.018s 0.868s 10 1.33x
💻 Local Nitro 0.174s (+0.9%) 1.003s (~) 0.013s (+9.4% 🔺) 1.019s (~) 0.844s 10 1.55x
🐘 Postgres Next.js (Turbopack) 0.187s 1.001s 0.002s 1.015s 0.828s 10 1.66x
🐘 Postgres Nitro 0.198s (-2.2%) 0.991s (~) 0.001s (-13.3% 🟢) 1.012s (~) 0.814s 10 1.76x
🐘 Postgres Express 0.202s (+8.1% 🔺) 0.998s (~) 0.002s (+15.4% 🔺) 1.013s (~) 0.811s 10 1.79x
🌐 MongoDB Next.js (Turbopack) 0.467s 0.975s 0.001s 1.009s 0.542s 10 4.15x

▲ Production (Vercel)

World Framework Workflow Time TTFB Slurp Wall Time Overhead Samples vs Fastest
▲ Vercel 🥇 Next.js (Turbopack) 1.552s (-9.0% 🟢) 2.021s (-26.2% 🟢) 0.005s (-98.3% 🟢) 2.506s (-27.5% 🟢) 0.953s 10 1.00x
▲ Vercel Express 1.555s (-4.3%) 2.296s (-9.5% 🟢) 0.005s (-19.3% 🟢) 2.640s (-11.8% 🟢) 1.085s 10 1.00x
▲ Vercel Nitro 1.976s (+25.2% 🔺) 2.676s (+3.7%) 0.004s (-12.2% 🟢) 3.263s (+7.3% 🔺) 1.287s 10 1.27x

🔍 Observability: Next.js (Turbopack) | Express | Nitro

Summary

Fastest Framework by World

Winner determined by most benchmark wins

World 🥇 Fastest Framework Wins
💻 Local Express 12/12
🐘 Postgres Next.js (Turbopack) 7/12
▲ Vercel Express 6/12
Fastest World by Framework

Winner determined by most benchmark wins

Framework 🥇 Fastest World Wins
Express 💻 Local 6/12
Next.js (Turbopack) 🌐 Redis 5/12
Nitro 🐘 Postgres 6/12
Column Definitions
  • Workflow Time: Runtime reported by workflow (completedAt - createdAt) - primary metric
  • TTFB: Time to First Byte - time from workflow start until first stream byte received (stream benchmarks only)
  • Slurp: Time from first byte to complete stream consumption (stream benchmarks only)
  • Wall Time: Total testbench time (trigger workflow + poll for result)
  • Overhead: Testbench overhead (Wall Time - Workflow Time)
  • Samples: Number of benchmark iterations run
  • vs Fastest: How much slower compared to the fastest configuration for this benchmark

Worlds:

  • 💻 Local: In-memory filesystem world (local development)
  • 🐘 Postgres: PostgreSQL database world (local development)
  • ▲ Vercel: Vercel production/preview deployment
  • 🌐 Turso: Community world (local development)
  • 🌐 MongoDB: Community world (local development)
  • 🌐 Redis: Community world (local development)
  • 🌐 Jazz: Community world (local development)

📋 View full workflow run

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR improves error surfacing for HTTP failures (especially 429 rate-limit errors) across the e2e test suite, CLI hydration layer, and Vercel encryption key fetch. Previously, rate-limit responses would produce confusing, unrelated test failures with misleading assertion errors. The fix addresses four stacked issues: silent exit code handling in tests, empty stdout fallback, swallowed HTTP errors during decryption, and generic error messages in fetchRunKey.

Changes:

  • awaitCommand() now rejects on non-zero exit codes (with stderr/stdout in the error), and cliInspectJson()/cliHealthJson() throw on empty stdout instead of silently parsing {}.
  • maybeDecryptFields() re-throws HTTP errors instead of falling back to encrypted placeholders, and fetchRunKey() now includes response body and status text in error messages.
  • Added 429: 'Too Many Requests' to the CLI's getStatusText() map.

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 1 comment.

Show a summary per file
File Description
packages/core/e2e/utils.ts awaitCommand() rejects on non-zero exit code; cliInspectJson()/cliHealthJson() throw on empty stdout
packages/cli/src/lib/inspect/hydration.ts Re-throws HTTP-related errors from decryption catch block instead of silently falling back
packages/cli/src/lib/inspect/output.ts Adds 429 status code to getStatusText() lookup
packages/world-vercel/src/encryption.ts Enriches fetchRunKey error with response body and status text
.changeset/fix-world-vercel-fetchrunkey-error.md Changeset for @workflow/world-vercel patch
.changeset/fix-cli-429-error-surfacing.md Changeset for @workflow/cli patch

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

You can also share your feedback on Copilot code review. Take the survey.

// If the key fetch failed due to an HTTP error (e.g. 429 rate limit,
// 500 server error), re-throw so the caller surfaces a clear failure
// instead of silently showing encrypted placeholders.
if (message.includes('HTTP ')) {
Copy link

Copilot AI Mar 9, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The message.includes('HTTP ') heuristic for distinguishing HTTP errors from decryption errors is fragile. If any future error message from maybeDecrypt, importKey, or other code in the try block happens to contain the substring "HTTP ", it will be incorrectly re-thrown instead of falling through to the graceful fallback.

Consider a more robust approach, such as checking for the specific prefix used in fetchRunKey (e.g., message.startsWith('Failed to fetch run key')) or tagging HTTP errors with a custom property (e.g., error.isHttpError = true) in fetchRunKey and checking for that property here.

Suggested change
if (message.includes('HTTP ')) {
const isHttpError =
(err as any)?.isHttpError === true ||
message.startsWith('Failed to fetch run key');
if (isHttpError) {

Copilot uses AI. Check for mistakes.
@TooTallNate TooTallNate merged commit d842ce1 into main Mar 10, 2026
105 checks passed
@TooTallNate TooTallNate deleted the fix/e2e-error-surfacing branch March 10, 2026 07:23
pranaygp added a commit that referenced this pull request Mar 12, 2026
…ignal

* origin/main: (26 commits)
  Fix flaky streamer test ENOENT when chunks directory does not exist yet (#1330)
  Version Packages (beta) (#1325)
  [web-shared] Improve workflow observability event list UX (#1337)
  feat: add `exists` getter to `Run` class (#1336)
  Support client-side tools in DurableAgent (#1329)
  [world-postgres] [world-local] Execute Graphile jobs directly instead of defering to world-local queue (#1334)
  Merge CLAUDE.md into AGENTS.md and symlink CLAUDE.md (#1326)
  [web] Polish loading indicators (#1327)
  Fix flaky webhookWorkflow e2e test by polling instead of fixed sleep (#1328)
  feat: support `deploymentId: 'latest'` in `start()` to resolve most recent deployment (#1317)
  Fix bug where the SWC compiler bug prunes step-only imports in the client-mode transformation
  [web] [world-vercel] Ensure user-passed run IDs are URL encoded and call out self-hosted security (#1322)
  Version Packages (beta) (#1306)
  Remove hard-coded VERCEL_DEPLOYMENT_KEY from nextjs-turbopack workbench (#1319)
  fix(web): move react-router deps to devDependencies (#1265)
  fix(ai): use workspace:* for workflow peer dependency (#1320)
  fix(core): pass resolved deploymentId to getEncryptionKeyForRun in start() (#1318)
  fix: surface 429 rate-limit errors in e2e tests and CLI (#1309)
  fix(world-local): return HTTP 200 instead of 503 for queue timeout re-enqueue signals (#1307)
  [web-shared] [cli] Refactor observability data fetching (#1261)
  ...

# Conflicts:
#	packages/core/e2e/e2e.test.ts
#	packages/web-shared/src/components/sidebar/attribute-panel.tsx
#	workbench/example/workflows/99_e2e.ts
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants