glaiveai/reasoning-v1-20m
Text GenerationENapache-2.0
Created by glaiveai at 2025, the glaiveai/reasoning-v1-20m is a text generation dataset in EN containing 22,199,375 records in Parquet format. With 3.7K downloads and 236 likes, it is actively used by the community. It is released under the apache-2.0 license and is a 10M<n<100M-scale dataset.
About glaiveai/reasoning-v1-20m
We are excited to release a synthetic reasoning dataset containing 22mil+ general reasoning questions and responses generated using deepseek-ai/DeepSeek-R1-Distill-Llama-70B. While there have been multiple efforts to build open reasoning datasets ...
Details
- Task
- Text Generation
- Language
- EN
- Format
- Parquet
- Rows / instances
- 22199375
- Size
- 10M<n<100M
- Creator
- glaiveai
- Year
- 2025
- License
- apache-2.0
- Downloads
- 3660
- Likes
- 236