Skip to content

nvidia/Nemotron-CC-v2.1

Text GenerationEnglish

Created by nvidia at 2025, the nvidia/Nemotron-CC-v2.1 is a text generation dataset in English in Parquet format.

About nvidia/Nemotron-CC-v2.1

Nemotron-Pre-Training-Dataset-v2.1 Dataset Description The Nemotron-Pre-Training-Dataset-v2.1 extends the previously released Nemotron pretraining datasets with refreshed, higher-quality, and more diverse data across...

Details

Task
Text Generation
Language
English
Format
Parquet
Rows / instances
N/A
Creator
nvidia
Year
2025
Download

Related Text Generation datasets

FAQ