Skip to content

nvidia/OpenCodeReasoning-2

Text GenerationEnglish

Nvidia/OpenCodeReasoning-2 is a text generation-focused dataset in English distributed in Parquet format.

About nvidia/OpenCodeReasoning-2

OpenCodeReasoning-2: A Large-scale Dataset for Reasoning in Code Generation and Critique Dataset Description OpenCodeReasoning-2 is the largest reasoning-based synthetic dataset to date for coding, comprising 1.4M samples in Python a...

Details

Task
Text Generation
Language
English
Format
Parquet
Rows / instances
N/A
Creator
nvidia
Year
2025
Download

Related Text Generation datasets

FAQ