Skip to content

TCAN (WT2)

Nanjing UniversityAnt GroupLanguage modeling

TCAN (WT2) is language modeling model published by Nanjing University,Ant Group in 2020 featuring 33000000.0 parameters.

About TCAN (WT2)

With the development of feed-forward models, the default model for sequence modeling has gradually evolved to replace recurrent networks. Many powerful feed-forward models based on convolutional networks and attention mechanism were proposed and show

Details

Provider
Nanjing University,Ant Group
Task
Language modeling
Parameters
33000000.0
Released
2020-02-28
Open weights
No
View model source

Explore

FAQ