Skip to content

generalization-evaluator

Cross-domain evaluation to estimate generality and detect blind spots. Use when asked to assess broad capability, compare models across domains, or identify missing skills.

Repository Source folder

Details

Path
data/generalization-evaluator/SKILL.md

FAQ