wmt/wmt19
TranslationCS, DE, EN
Created by wmt at 2022, the wmt/wmt19 is a translation dataset in CS, DE, EN in Parquet format.
About wmt/wmt19
Dataset Card for "wmt19"
Dataset Summary
Warning: There are issues with the Common Crawl corpus data (training-parallel-commoncrawl.tgz):
Non-English files contain many English sentences.
Their "parallel" sentences in E...
Details
- Task
- Translation
- Language
- CS, DE, EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- wmt
- Year
- 2022