eval-driven-development

Here are 6 public repositories matching this topic...

mega-edo / mega-security

Security optimization for AI agent systems.

security-optimization agent-security agent-optimization eval-driven-development eval-driven-optimization agent-security-optimization system-prompt-security

Updated May 7, 2026
Python

zircote / autoresearch

Star

Autonomous skill improvement loop for Claude Code plugins — inspired by Karpathy's autoresearch. Modify → evaluate → keep/discard → repeat until convergence. Zero-touch quality iteration at scale.

python convergence quality-assurance autonomous-agents ai-agents karpathy claude-code skill-improvement claude-code-plugin eval-driven-development autoresearch improvement-loop

Updated Mar 27, 2026
Python

shahcolate / Product-Kit

Star

Most AI plugins hope they work. These prove it. Eval-driven Claude plugins for product teams.

product-management claude product-strategy ai-tools llm-as-judge claude-plugin eval-driven-development llm-plugins behavioral-evals

Updated Mar 26, 2026
Python

GeniusTechnoMystic / agentic-swe-grounding-system

Star

Modular self-referencing Markdown grounding system for agentic AI software engineering and architecture

Updated Apr 30, 2026
Python

yosuancrespo / specforge-ai

Star

AI-augmented QA platform for spec-driven development and testing, RAG-grounded analysis, eval-driven development and contract validation across Python, Go, Rust and Solidity.

Updated Apr 2, 2026
Python

SAY-5 / genai-eval

Star

Multilingual GenAI evaluation service across 5 task types and 3 languages, with regression-trend dashboard

multilingual nextjs fastapi llm-eval eval-driven-development

Updated May 7, 2026
Python

Improve this page

Add a description, image, and links to the eval-driven-development topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the eval-driven-development topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

eval-driven-development

Here are 6 public repositories matching this topic...

mega-edo / mega-security

zircote / autoresearch

shahcolate / Product-Kit

GeniusTechnoMystic / agentic-swe-grounding-system

yosuancrespo / specforge-ai

SAY-5 / genai-eval

Improve this page

Add this topic to your repo