Skip to content

nampdn-ai/tiny-orca-textbooks

Text GenerationEN

Nampdn-ai/tiny-orca-textbooks is a text generation-focused dataset in EN distributed in Parquet format.

About nampdn-ai/tiny-orca-textbooks

Textbook-like Dataset: A Comprehensive Resource for Text-Based Skills Development in Small Language Models This dataset is a collection of 147k synthetic textbooks designed to enhance the text-based skills of small language models. The curricul...

Details

Task
Text Generation
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
nampdn-ai
Year
2023
Download

Related Text Generation datasets

FAQ