eval
Evaluate and rank agent results by metric or LLM judge for an AgentHub session. Use when the user runs /hub:eval or asks to score, compare, or pick a winner among completed AgentHub agents.
Details
- Path
- .gemini/skills/eval
- License
- MIT
- Dependencies
- 1