Skip to content

CohereLabs/Global-MMLU-Lite

General NLPAR, BN, CSBenchmarkapache-2.0

Created by CohereLabs at 2024, the CohereLabs/Global-MMLU-Lite is a General NLP benchmark dataset in AR, BN, CS containing 14,000 records in Parquet format. With 7.2K downloads and 41 likes, it is actively used by the community. It is released under the apache-2.0 license and is a 10K<n<100K-scale dataset.

📊 This dataset is used as an LLM benchmark. See model leaderboards →

About CohereLabs/Global-MMLU-Lite

Releases: Version 3.0 (May 2026): GMMLU Lite 3.0 release with 5 new languages: Czech, Hungarian, Italian (updated), Oriya, Slovak and Tajik Version 2.0 (Dec 2025): GMMLU Lite 2.0 release with 3 new languages: Albanian, Burmese and Welsh Versio...

Details

Task
General NLP
Language
AR, BN, CS
Format
Parquet
Rows / instances
14000
Size
10K<n<100K
Creator
CohereLabs
Year
2024
License
apache-2.0
Downloads
7206
Likes
41
Download Homepage

Related General NLP datasets

FAQ