Skip to content

ST-MoE

GoogleGoogle BrainGoogle ResearchLanguage modeling/generation

The ST-MoE model is a language modeling/generation model from Google,Google Brain,Google Research with 269000000000.0 parameters.

About ST-MoE

Scale has opened new frontiers in natural language processing -- but at a high cost. In response, Mixture-of-Experts (MoE) and Switch Transformers have been proposed as an energy efficient path to even larger and more capable language models. But adv

Details

Provider
Google,Google Brain,Google Research
Task
Language modeling/generation
Parameters
269000000000.0
Released
2022-02-17
Open weights
No
View model source

Explore

FAQ