Skip to content

The Cross-lingual Natural Language Inference corpus (XNLI)

EntailmentMulti-Lingual

The Cross-lingual Natural Language Inference corpus (XNLI) is a entailment-focused dataset in Multi-Lingual that provides 112,5 labeled examples distributed in JSON, Text format.

About The Cross-lingual Natural Language Inference corpus (XNLI)

Dataset contains collection of 5,000 test and 2,500 dev pairs for the MultiNLI corpus. The pairs are annotated with textual entailment and translated into 14 languages: French, Spanish, German, Greek, Bulgarian, Russian, Turkish, Arabic, Vietnamese, Thai, Chinese, Hindi, Swahili and Urdu.

Details

Task
Entailment
Language
Multi-Lingual
Format
JSON, Text
Rows / instances
112,5
Creator
Conneau et al.
Year
2018
Download Paper

Related Entailment datasets

FAQ