hiepp2/tvp4
Text GenerationEN
Created by hiepp2 at 2025, the hiepp2/tvp4 is a text generation dataset in EN containing 698,634 records in Parquet format. With 17.4K downloads and 1 likes, it is actively used by the community and is a n>1T-scale dataset.
About hiepp2/tvp4
Dataset summary
Mixture-of-Thoughts is a curated dataset of 350k verified reasoning traces distilled from DeepSeek-R1. The dataset spans tasks in mathematics, coding, and science, and is designed to teach language models to reason step-by-step....
Details
- Task
- Text Generation
- Language
- EN
- Format
- Parquet
- Rows / instances
- 698634
- Size
- n>1T
- Creator
- hiepp2
- Year
- 2025
- Downloads
- 17405
- Likes
- 1