microsoft/ms_marco
General NLPENBenchmark
Microsoft/ms_marco is a General NLP-focused benchmark dataset in EN distributed in Parquet format.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About microsoft/ms_marco
Dataset Card for "ms_marco"
Dataset Summary
Starting with a paper released at NIPS 2016, MS MARCO is a collection of datasets focused on deep learning in search.
The first dataset was a question answering dataset featuring 100,000 re...
Details
- Task
- General NLP
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- microsoft
- Year
- 2022