TheBritishLibrary/blbooks
Text GenerationFill MaskOtherDE, EN, ES
TheBritishLibrary/blbooks is a text generation dataset in DE, EN, ES from TheBritishLibrary in Parquet format.
About TheBritishLibrary/blbooks
A dataset comprising of text created by OCR from the 49,455 digitised books, equating to 65,227 volumes (25+ million pages), published between c. 1510 - c. 1900.
The books cover a wide range of subject areas including philosophy, history, poetry a...
Details
- Task
- Text Generation, Fill Mask, Other
- Language
- DE, EN, ES
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- TheBritishLibrary
- Year
- 2022