deepmind/pg19
Text GenerationENapache-2.0
Deepmind/pg19 is a text generation dataset in EN from deepmind with 28,752 records in Parquet format. It is distributed under the apache-2.0 license and falls in the 10K<n<100K size category, and has been downloaded 4.3K times.
About deepmind/pg19
This repository contains the PG-19 language modeling benchmark.
It includes a set of books extracted from the Project Gutenberg books library, that were published before 1919.
It also contains metadata of book titles and publication dates.
PG-19 ...
Details
- Task
- Text Generation
- Language
- EN
- Format
- Parquet
- Rows / instances
- 28752
- Size
- 10K<n<100K
- Creator
- deepmind
- Year
- 2022
- License
- apache-2.0
- Downloads
- 4264
- Likes
- 60