Megatron-LM (8.3B)
NVIDIALanguage modeling/generation
Developed by NVIDIA in 2019, Megatron-LM (8.3B) is a language modeling/generation model with 8300000000.0 parameters.
About Megatron-LM (8.3B)
Recent work in language modeling demonstrates that training large transformer models advances the state of the art in Natural Language Processing applications. However, very large models can be quite difficult to train due to memory constraints. In t
Details
- Provider
- NVIDIA
- Task
- Language modeling/generation
- Parameters
- 8300000000.0
- Released
- 2019-09-17
- Open weights
- No