ST-MoE
GoogleGoogle BrainGoogle ResearchLanguage modeling/generation
The ST-MoE model is a language modeling/generation model from Google,Google Brain,Google Research with 269000000000.0 parameters.
About ST-MoE
Scale has opened new frontiers in natural language processing -- but at a high cost. In response, Mixture-of-Experts (MoE) and Switch Transformers have been proposed as an energy efficient path to even larger and more capable language models. But adv
Details
- Provider
- Google,Google Brain,Google Research
- Task
- Language modeling/generation
- Parameters
- 269000000000.0
- Released
- 2022-02-17
- Open weights
- No