chcaa/kb-books
General NLPEnglishcc0-1.0
The chcaa/kb-books dataset is a English General NLP resource from chcaa at 2025. With 22.7K downloads and 3 likes, it is actively used by the community. It is released under the cc0-1.0 license and is a 1M<n<10M-scale dataset.
About chcaa/kb-books
open-rdl-books
Dataset Description
Language
dan, dansk, Danish
License
Public Domain, cc0-1.0
Dataset Summary
Documents from the Royal Danish Library published between 1750 and 1930.
The dataset has ea...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 1M<n<10M
- Creator
- chcaa
- Year
- 2025
- License
- cc0-1.0
- Downloads
- 22693
- Likes
- 3