nvidia/Nemotron-Pretraining-Code-v3
Text GenerationCODE
Nvidia/Nemotron-Pretraining-Code-v3 is a text generation dataset in CODE from nvidia in Parquet format.
About nvidia/Nemotron-Pretraining-Code-v3
Nemotron-Pretraining-Code-v3
Dataset Description:
The Nemotron-Pretraining-Code-v3 dataset is part of the Nemotron Pretraining Data collection of pretraining datasets. Designed for the NVIDIA Nemotron 3 family of LLMs, this datas...
Details
- Task
- Text Generation
- Language
- CODE
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- nvidia
- Year
- 2026