Llama Nemotron Ultra 253B
NVIDIALanguage modeling/generationQuestion answeringQuantitative reasoningCode generationNeural Architecture Search - NASOpen weights
Llama Nemotron Ultra 253B is a language modeling/generation model from NVIDIA released in 2025 with 253000000000.0 parameters.
About Llama Nemotron Ultra 253B
Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) which is a derivative of Meta Llama-3.1-405B-Instruct (AKA the reference model). It is a reasoning model that is post trained for reasoning, human chat preferences, and tasks, such as R
Details
- Provider
- NVIDIA
- Task
- Language modeling/generation,Question answering,Quantitative reasoning,Code generation,Neural Architecture Search - NAS
- Parameters
- 253000000000.0
- Released
- 2025-03-18
- Open weights
- Yes