Skip to content

GShard (dense)

GoogleTranslation

Developed by Google in 2020, GShard (dense) is a translation model with 2300000000.0 parameters.

About GShard (dense)

Neural network scaling has been critical for improving the model quality in many real-world machine learning applications with vast amounts of training data and compute. Although this trend of scaling is affirmed to be a sure-fire approach for better

Details

Provider
Google
Task
Translation
Parameters
2300000000.0
Released
2020-06-30
Open weights
No
View model source

Explore

FAQ