HPLT/HPLT2.0_cleaned
Fill MaskText GenerationACE, AF, ALScc0-1.0
HPLT/HPLT2.0_cleaned is a fill mask-focused dataset in ACE, AF, ALS distributed in Parquet format. It is distributed under the cc0-1.0 license and falls in the n>1T size category, and has been downloaded 25.3K times.
About HPLT/HPLT2.0_cleaned
NB: HPLT2.0 is now superseded by a newer release:
HPLT3.0
We recommed switching to v3.0, unless you have a compelling reason to stay on 2.0.
This is a large-scale collection of web-crawled documents in 191 world languages, produced by the HPLT pro...
Details
- Task
- Fill Mask, Text Generation
- Language
- ACE, AF, ALS
- Format
- Parquet
- Rows / instances
- N/A
- Size
- n>1T
- Creator
- HPLT
- Year
- 2024
- License
- cc0-1.0
- Downloads
- 25276
- Likes
- 43