Skip to content

hiepp2/tvp4

Text GenerationEN

Created by hiepp2 at 2025, the hiepp2/tvp4 is a text generation dataset in EN containing 698,634 records in Parquet format. With 17.4K downloads and 1 likes, it is actively used by the community and is a n>1T-scale dataset.

About hiepp2/tvp4

Dataset summary Mixture-of-Thoughts is a curated dataset of 350k verified reasoning traces distilled from DeepSeek-R1. The dataset spans tasks in mathematics, coding, and science, and is designed to teach language models to reason step-by-step....

Details

Task
Text Generation
Language
EN
Format
Parquet
Rows / instances
698634
Size
n>1T
Creator
hiepp2
Year
2025
Downloads
17405
Likes
1
Download Homepage

Related Text Generation datasets

FAQ