generalization-evaluator
Skillby majiayu000
Cross-domain evaluation to estimate generality and detect blind spots. Use when asked to assess broad capability, compare models across domains, or identify missing skills.
Details
- Path
- data/generalization-evaluator/SKILL.md