emozilla/pg19
General NLPEnglish
The emozilla/pg19 dataset is a English General NLP resource from emozilla at 2023 comprising 28,752 examples. With 11.8K downloads and 18 likes, it is actively used by the community and is a 10K<n<100K-scale dataset.
About emozilla/pg19
Dataset Card for "pg19"
Paraquet version of pg19
Statistics (in # of characters): total_len: 11425076324, average_len: 399450.2595622684
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- 28752
- Size
- 10K<n<100K
- Creator
- emozilla
- Year
- 2023
- Downloads
- 11836
- Likes
- 18