applied-ai-018/pretraining_v1-omega_books
General NLPEnglish
The applied-ai-018/pretraining_v1-omega_books dataset is a English General NLP resource from applied-ai-018 at 2026 comprising 51,901,183 examples. With 364.5K downloads and 7 likes, it is actively used by the community and is a 100M<n<1B-scale dataset.
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- 51901183
- Size
- 100M<n<1B
- Creator
- applied-ai-018
- Year
- 2026
- Downloads
- 364474
- Likes
- 7