Skip to content

lighteval/mmlu

Question AnsweringENBenchmark

Lighteval/mmlu is a question answering benchmark dataset in EN from lighteval in Parquet format.

📊 This dataset is used as an LLM benchmark. See model leaderboards →

About lighteval/mmlu

Dataset Card for MMLU Dataset Summary Measuring Massive Multitask Language Understanding by Dan Hendrycks, Collin Burns, Steven Basart, Andy Zou, Mantas Mazeika, Dawn Song, and Jacob Steinhardt (ICLR 2021). This is a massive multitas...

Details

Task
Question Answering
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
lighteval
Year
2023
Download

Related Question Answering datasets

FAQ