nvidia/Nemotron-CC-Math-v1
Text GenerationEnglish
Nvidia/Nemotron-CC-Math-v1 is a text generation dataset in English from nvidia in Parquet format.
About nvidia/Nemotron-CC-Math-v1
Nemotron-Pre-Training-Dataset-v1 Release
👩💻 Authors: Rabeeh Karimi Mahabadi, Sanjeev Satheesh
📘 Paper: Nemotron-cc-math: A 133 Billion-Token-Scale High Quality Math Pretraining Dataset
📝 Blog: Nemotron-cc-math blog
Data Overview
We’...
Details
- Task
- Text Generation
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- nvidia
- Year
- 2025