nampdn-ai/tiny-orca-textbooks
Text GenerationEN
Nampdn-ai/tiny-orca-textbooks is a text generation-focused dataset in EN distributed in Parquet format.
About nampdn-ai/tiny-orca-textbooks
Textbook-like Dataset: A Comprehensive Resource for Text-Based Skills Development in Small Language Models
This dataset is a collection of 147k synthetic textbooks designed to enhance the text-based skills of small language models. The curricul...
Details
- Task
- Text Generation
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- nampdn-ai
- Year
- 2023