HPLT/DocHPLT
TranslationAF, AR, AZcc0-1.0
Created by HPLT at 2025, the HPLT/DocHPLT is a translation dataset in AF, AR, AZ containing 124,177,103 records in Parquet format. With 31.1K downloads and 20 likes, it is actively used by the community. It is released under the cc0-1.0 license and is a 100M<n<1B-scale dataset.
About HPLT/DocHPLT
DocHPLT: A Massively Multilingual Document-Level Translation Dataset
Existing document-level machine translation resources are only available for a handful of languages, mostly high-resourced ones. To facilitate the training and evaluation of d...
Details
- Task
- Translation
- Language
- AF, AR, AZ
- Format
- Parquet
- Rows / instances
- 124177103
- Size
- 100M<n<1B
- Creator
- HPLT
- Year
- 2025
- License
- cc0-1.0
- Downloads
- 31099
- Likes
- 20