Skip to content

Megatron-LM (8.3B)

NVIDIALanguage modeling/generation

Developed by NVIDIA in 2019, Megatron-LM (8.3B) is a language modeling/generation model with 8300000000.0 parameters.

About Megatron-LM (8.3B)

Recent work in language modeling demonstrates that training large transformer models advances the state of the art in Natural Language Processing applications. However, very large models can be quite difficult to train due to memory constraints. In t

Details

Provider
NVIDIA
Task
Language modeling/generation
Parameters
8300000000.0
Released
2019-09-17
Open weights
No
View model source

Explore

FAQ