google-research-datasets/mbpp
General NLPENBenchmarkcc-by-4.0
The google-research-datasets/mbpp dataset is a EN General NLP resource from google-research-datasets at 2026 comprising 1,401 examples. With 161.7K downloads and 231 likes, it is actively used by the community. It is released under the cc-by-4.0 license and is a 1K<n<10K-scale dataset.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About google-research-datasets/mbpp
Dataset Card for Mostly Basic Python Problems (mbpp)
Dataset Summary
The benchmark consists of around 1,000 crowd-sourced Python programming problems, designed to be solvable by entry level programmers, covering programming f...
Details
- Task
- General NLP
- Language
- EN
- Format
- Parquet
- Rows / instances
- 1401
- Size
- 1K<n<10K
- Creator
- google-research-datasets
- Year
- 2026
- License
- cc-by-4.0
- Downloads
- 161740
- Likes
- 231