Skip to content

wmt/wmt19

TranslationCS, DE, EN

Created by wmt at 2022, the wmt/wmt19 is a translation dataset in CS, DE, EN in Parquet format.

About wmt/wmt19

Dataset Card for "wmt19" Dataset Summary Warning: There are issues with the Common Crawl corpus data (training-parallel-commoncrawl.tgz): Non-English files contain many English sentences. Their "parallel" sentences in E...

Details

Task
Translation
Language
CS, DE, EN
Format
Parquet
Rows / instances
N/A
Creator
wmt
Year
2022
Download

Related Translation datasets

FAQ