Skip to content

miracl/miracl-corpus

Text RetrievalAR, BN, EN

Miracl/miracl-corpus is a text retrieval dataset in AR, BN, EN from miracl in Parquet format.

About miracl/miracl-corpus

Dataset Card for MIRACL Corpus MIRACL 🌍🙌🌏 (Multilingual Information Retrieval Across a Continuum of Languages) is a multilingual retrieval dataset that focuses on search across 18 different languages, which collectively encompass over three bil...

Details

Task
Text Retrieval
Language
AR, BN, EN
Format
Parquet
Rows / instances
N/A
Creator
miracl
Year
2022
Download

Related Text Retrieval datasets

FAQ