miracl/miracl-corpus
Text RetrievalAR, BN, EN
Miracl/miracl-corpus is a text retrieval dataset in AR, BN, EN from miracl in Parquet format.
About miracl/miracl-corpus
Dataset Card for MIRACL Corpus
MIRACL 🌍🙌🌏 (Multilingual Information Retrieval Across a Continuum of Languages) is a multilingual retrieval dataset that focuses on search across 18 different languages, which collectively encompass over three bil...
Details
- Task
- Text Retrieval
- Language
- AR, BN, EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- miracl
- Year
- 2022