Skip to content

Evaluation

Runs unit, golden-dataset, and LLM-as-judge tests for a skill.

Repository Source folder

Details

Path
skills/Evaluation/SKILL.md

FAQ