Skip to content

deepmind/pg19

Text GenerationENapache-2.0

Deepmind/pg19 is a text generation dataset in EN from deepmind with 28,752 records in Parquet format. It is distributed under the apache-2.0 license and falls in the 10K<n<100K size category, and has been downloaded 4.3K times.

About deepmind/pg19

This repository contains the PG-19 language modeling benchmark. It includes a set of books extracted from the Project Gutenberg books library, that were published before 1919. It also contains metadata of book titles and publication dates. PG-19 ...

Details

Task
Text Generation
Language
EN
Format
Parquet
Rows / instances
28752
Size
10K<n<100K
Creator
deepmind
Year
2022
License
apache-2.0
Downloads
4264
Likes
60
Download Homepage

Related Text Generation datasets

FAQ