Skip to content

nvidia/Nemotron-Pretraining-Code-v3

Text GenerationCODE

Nvidia/Nemotron-Pretraining-Code-v3 is a text generation dataset in CODE from nvidia in Parquet format.

About nvidia/Nemotron-Pretraining-Code-v3

Nemotron-Pretraining-Code-v3 Dataset Description: The Nemotron-Pretraining-Code-v3 dataset is part of the Nemotron Pretraining Data collection of pretraining datasets. Designed for the NVIDIA Nemotron 3 family of LLMs, this datas...

Details

Task
Text Generation
Language
CODE
Format
Parquet
Rows / instances
N/A
Creator
nvidia
Year
2026
Download

Related Text Generation datasets

FAQ