Skip to content

glaiveai/reasoning-v1-20m

Text GenerationENapache-2.0

Created by glaiveai at 2025, the glaiveai/reasoning-v1-20m is a text generation dataset in EN containing 22,199,375 records in Parquet format. With 3.7K downloads and 236 likes, it is actively used by the community. It is released under the apache-2.0 license and is a 10M<n<100M-scale dataset.

About glaiveai/reasoning-v1-20m

We are excited to release a synthetic reasoning dataset containing 22mil+ general reasoning questions and responses generated using deepseek-ai/DeepSeek-R1-Distill-Llama-70B. While there have been multiple efforts to build open reasoning datasets ...

Details

Task
Text Generation
Language
EN
Format
Parquet
Rows / instances
22199375
Size
10M<n<100M
Creator
glaiveai
Year
2025
License
apache-2.0
Downloads
3660
Likes
236
Download Homepage

Related Text Generation datasets

FAQ