Skip to content

HPLT/DocHPLT

TranslationAF, AR, AZcc0-1.0

Created by HPLT at 2025, the HPLT/DocHPLT is a translation dataset in AF, AR, AZ containing 124,177,103 records in Parquet format. With 31.1K downloads and 20 likes, it is actively used by the community. It is released under the cc0-1.0 license and is a 100M<n<1B-scale dataset.

About HPLT/DocHPLT

DocHPLT: A Massively Multilingual Document-Level Translation Dataset Existing document-level machine translation resources are only available for a handful of languages, mostly high-resourced ones. To facilitate the training and evaluation of d...

Details

Task
Translation
Language
AF, AR, AZ
Format
Parquet
Rows / instances
124177103
Size
100M<n<1B
Creator
HPLT
Year
2025
License
cc0-1.0
Downloads
31099
Likes
20
Download Homepage

Related Translation datasets

FAQ