Skip to content
#

headless-testing

Here are 27 public repositories matching this topic...

Extensible benchmarking suite for evaluating AI coding agents on web search tasks. Compare native search vs MCP servers (You.com, expanding) across multiple agents (Claude Code, Gemini, Droid, Codex, expanding) with automated Docker workflows and statistical analysis.

  • Updated Feb 27, 2026
  • TypeScript

Improve this page

Add a description, image, and links to the headless-testing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the headless-testing topic, visit your repo's landing page and select "manage topics."

Learn more