Skip to content

emozilla/pg19

General NLPEnglish

The emozilla/pg19 dataset is a English General NLP resource from emozilla at 2023 comprising 28,752 examples. With 11.8K downloads and 18 likes, it is actively used by the community and is a 10K<n<100K-scale dataset.

About emozilla/pg19

Dataset Card for "pg19" Paraquet version of pg19 Statistics (in # of characters): total_len: 11425076324, average_len: 399450.2595622684

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
28752
Size
10K<n<100K
Creator
emozilla
Year
2023
Downloads
11836
Likes
18
Download Homepage

Related General NLP datasets

FAQ