Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 7 additions & 1 deletion plugins/agentv-dev/skills/agentv-eval-builder/SKILL.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,12 @@
---
name: agentv-eval-builder
description: Create and maintain AgentV evaluation files for testing AI agent performance. Use this skill when creating new eval files, adding tests, configuring evaluators, or converting Agent Skills evals.json files to AgentV format.
description: >-
Create and maintain AgentV EVAL.yaml / .eval.yaml evaluation files for testing AI agent output quality.
Use when asked to create new AgentV eval files, add test cases to an existing .eval.yaml,
configure AgentV evaluators (llm-judge, code-judge, rubrics), or convert evals.json to AgentV EVAL YAML format
using `agentv convert`.
Do NOT use for creating SKILL.md files, writing skill definitions, or building skill test suites —
those tasks belong to the skill-creator skill.
---

# AgentV Eval Builder
Expand Down
7 changes: 6 additions & 1 deletion plugins/agentv-dev/skills/agentv-eval-orchestrator/SKILL.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,11 @@
---
name: agentv-eval-orchestrator
description: Run AgentV evaluations by orchestrating eval subcommands. Use this skill when asked to run evals, evaluate an agent, test prompt quality using agentv, or run Agent Skills evals.json files.
description: >-
Run AgentV evaluations against EVAL.yaml / .eval.yaml / evals.json files using the `agentv prompt eval` and `agentv eval` CLI commands.
Use when asked to run AgentV evals, evaluate agent output quality with AgentV, execute an AgentV evaluation suite,
or orchestrate AgentV eval subcommands.
Do NOT use for creating or modifying SKILL.md files, packaging skills, optimizing skill trigger descriptions,
or measuring skill-creator performance — those tasks belong to the skill-creator skill.
---

# AgentV Eval Orchestrator
Expand Down
9 changes: 8 additions & 1 deletion plugins/agentv-dev/skills/agentv-optimizer/SKILL.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,13 @@
---
name: agentv-optimizer
description: Optimize agent prompts through evaluation-driven refinement. Five-phase workflow (Discovery → Planning → Optimization → Polish → Handoff) that ensures evaluation integrity and keeps the user in control.
description: >-
Optimize agent task prompts through AgentV evaluation-driven refinement using `agentv prompt eval` and EVAL.yaml files.
Five-phase workflow (Discovery → Planning → Optimization → Polish → Handoff) that iteratively improves prompts
based on AgentV eval scores.
Use when asked to optimize agent performance against AgentV evals, improve prompt quality using AgentV evaluation results,
or run the AgentV optimization loop.
Do NOT use for optimizing SKILL.md trigger descriptions, improving skill discoverability, or editing skill metadata —
those tasks belong to the skill-creator skill.
---

# AgentV Optimizer
Expand Down
8 changes: 7 additions & 1 deletion plugins/agentv-dev/skills/agentv-trace-analyst/SKILL.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,12 @@
---
name: agentv-trace-analyst
description: Analyze AgentV evaluation traces using CLI primitives. Use when asked to inspect eval results, find regressions, identify failure patterns, analyze tool trajectories, compute cost/latency statistics, or reason about agent performance from trace data.
description: >-
Analyze AgentV evaluation traces and result JSONL files using `agentv trace` and `agentv compare` CLI commands.
Use when asked to inspect AgentV eval results, find regressions between AgentV evaluation runs,
identify failure patterns in AgentV trace data, analyze tool trajectories, or compute cost/latency/score statistics
from AgentV result files.
Do NOT use for benchmarking skill trigger accuracy, analyzing skill-creator eval performance,
or measuring skill description quality — those tasks belong to the skill-creator skill.
---

# AgentV Trace Analyst
Expand Down