Skip to content

TheBritishLibrary/blbooks

Text GenerationFill MaskOtherDE, EN, ES

TheBritishLibrary/blbooks is a text generation dataset in DE, EN, ES from TheBritishLibrary in Parquet format.

About TheBritishLibrary/blbooks

A dataset comprising of text created by OCR from the 49,455 digitised books, equating to 65,227 volumes (25+ million pages), published between c. 1510 - c. 1900. The books cover a wide range of subject areas including philosophy, history, poetry a...

Details

Task
Text Generation, Fill Mask, Other
Language
DE, EN, ES
Format
Parquet
Rows / instances
N/A
Creator
TheBritishLibrary
Year
2022
Download

FAQ