Skip to content

nvidia/Nemotron-CC-Math-v1

Text GenerationEnglish

Nvidia/Nemotron-CC-Math-v1 is a text generation dataset in English from nvidia in Parquet format.

About nvidia/Nemotron-CC-Math-v1

Nemotron-Pre-Training-Dataset-v1 Release 👩‍💻 Authors: Rabeeh Karimi Mahabadi, Sanjeev Satheesh 📘 Paper: Nemotron-cc-math: A 133 Billion-Token-Scale High Quality Math Pretraining Dataset 📝 Blog: Nemotron-cc-math blog Data Overview We’...

Details

Task
Text Generation
Language
English
Format
Parquet
Rows / instances
N/A
Creator
nvidia
Year
2025
Download

Related Text Generation datasets

FAQ