lighteval/mmlu
Question AnsweringENBenchmark
Lighteval/mmlu is a question answering benchmark dataset in EN from lighteval in Parquet format.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About lighteval/mmlu
Dataset Card for MMLU
Dataset Summary
Measuring Massive Multitask Language Understanding by Dan Hendrycks, Collin Burns, Steven Basart, Andy Zou, Mantas Mazeika, Dawn Song, and Jacob Steinhardt (ICLR 2021).
This is a massive multitas...
Details
- Task
- Question Answering
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- lighteval
- Year
- 2023