Skip to content

Llama Nemotron Ultra 253B

NVIDIALanguage modeling/generationQuestion answeringQuantitative reasoningCode generationNeural Architecture Search - NASOpen weights

Llama Nemotron Ultra 253B is a language modeling/generation model from NVIDIA released in 2025 with 253000000000.0 parameters.

About Llama Nemotron Ultra 253B

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) which is a derivative of Meta Llama-3.1-405B-Instruct (AKA the reference model). It is a reasoning model that is post trained for reasoning, human chat preferences, and tasks, such as R

Details

Provider
NVIDIA
Task
Language modeling/generation,Question answering,Quantitative reasoning,Code generation,Neural Architecture Search - NAS
Parameters
253000000000.0
Released
2025-03-18
Open weights
Yes
View model source

Explore

FAQ