European Parliament Proceedings (Europarl)
Text CorporaMachine TranslationMulti-Lingual
European Parliament Proceedings (Europarl) is a text corpora-focused dataset in Multi-Lingual that provides 10 labeled examples distributed in XML format.
About European Parliament Proceedings (Europarl)
The Europarl parallel corpus is extracted from the proceedings of the European Parliament. It includes versions in 21 European languages.
Details
- Task
- Text Corpora, Machine Translation
- Language
- Multi-Lingual
- Format
- XML
- Rows / instances
- 10M+
- Creator
- Koehn et al.
- Year
- 2002