BERT-Large-CAS (PTB+WT2+WT103)
AmazonNeural Architecture Search - NASLanguage modeling/generation
BERT-Large-CAS (PTB+WT2+WT103) is a Neural Architecture Search - NAS model from Amazon released in 2019 with 395000000.0 parameters.
About BERT-Large-CAS (PTB+WT2+WT103)
The Transformer architecture is superior to RNN-based models in computational efficiency. Recently, GPT and BERT demonstrate the efficacy of Transformer models on various NLP tasks using pre-trained language models on large-scale corpora. Surprisingl
Details
- Provider
- Amazon
- Task
- Neural Architecture Search - NAS,Language modeling/generation
- Parameters
- 395000000.0
- Released
- 2019-04-20
- Open weights
- No