Chinchilla
DeepMindLanguage modeling
Developed by DeepMind in 2022, Chinchilla is a language modeling model with 70000000000.0 parameters.
About Chinchilla
We investigate the optimal model size and number of tokens for training a transformer language model under a given compute budget. We find that current large language models are significantly undertrained, a consequence of the recent focus on scaling
Details
- Provider
- DeepMind
- Task
- Language modeling
- Parameters
- 70000000000.0
- Released
- 2022-03-29
- Open weights
- No