TCAN (WT2)
Nanjing UniversityAnt GroupLanguage modeling
TCAN (WT2) is language modeling model published by Nanjing University,Ant Group in 2020 featuring 33000000.0 parameters.
About TCAN (WT2)
With the development of feed-forward models, the default model for sequence modeling has gradually evolved to replace recurrent networks. Many powerful feed-forward models based on convolutional networks and attention mechanism were proposed and show
Details
- Provider
- Nanjing University,Ant Group
- Task
- Language modeling
- Parameters
- 33000000.0
- Released
- 2020-02-28
- Open weights
- No