Skip to content

HPLT/HPLT2.0_cleaned

Fill MaskText GenerationACE, AF, ALScc0-1.0

HPLT/HPLT2.0_cleaned is a fill mask-focused dataset in ACE, AF, ALS distributed in Parquet format. It is distributed under the cc0-1.0 license and falls in the n>1T size category, and has been downloaded 25.3K times.

About HPLT/HPLT2.0_cleaned

NB: HPLT2.0 is now superseded by a newer release: HPLT3.0 We recommed switching to v3.0, unless you have a compelling reason to stay on 2.0. This is a large-scale collection of web-crawled documents in 191 world languages, produced by the HPLT pro...

Details

Task
Fill Mask, Text Generation
Language
ACE, AF, ALS
Format
Parquet
Rows / instances
N/A
Size
n>1T
Creator
HPLT
Year
2024
License
cc0-1.0
Downloads
25276
Likes
43
Download Homepage

Related Fill Mask, Text Generation datasets

FAQ