PrimeIntellect/SYNTHETIC-1-SFT-Data
General NLPEnglish
PrimeIntellect/SYNTHETIC-1-SFT-Data is a General NLP dataset in English from PrimeIntellect in Parquet format.
About PrimeIntellect/SYNTHETIC-1-SFT-Data
SYNTHETIC-1: Two Million Crowdsourced Reasoning Traces from Deepseek-R1
SYNTHETIC-1 is a reasoning dataset obtained from Deepseek-R1, generated with crowdsourced compute and annotated with diverse verifiers such as LLM judges or symbolic mathe...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- PrimeIntellect
- Year
- 2025