Skip to content

skill-benchmark-design

Design lightweight benchmarks for agent skills and skillsets. Use when Codex is asked to prove whether a skill improves outputs, define benchmark scenarios, compare baseline versus skill-enabled behavior, or create a skill evaluation plan.

Repository Source folder

Details

Path
mirrors/repos/jeremylongworth-source@AgentSkills/skills/skill-benchmark-design/SKILL.md
License
MIT

FAQ