skill-benchmark-design
Design lightweight benchmarks for agent skills and skillsets. Use when Codex is asked to prove whether a skill improves outputs, define benchmark scenarios, compare baseline versus skill-enabled behavior, or create a skill evaluation plan.
Details
- Path
- mirrors/repos/jeremylongworth-source@AgentSkills/skills/skill-benchmark-design/SKILL.md
- License
- MIT