nvidia/OpenCodeReasoning
Text GenerationEnglish
Created by nvidia at 2025, the nvidia/OpenCodeReasoning is a text generation dataset in English in Parquet format.
About nvidia/OpenCodeReasoning
OpenCodeReasoning: Advancing Data Distillation for Competitive Coding
Data Overview
OpenCodeReasoning is the largest reasoning-based synthetic dataset to date for coding, comprises 735,255 samples in Python across 28,319 unique compe...
Details
- Task
- Text Generation
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- nvidia
- Year
- 2025