Skip to content

AWD-LSTM + MoS + Partial Shuffled

University of Texas at AustinLanguage modelingOpen weights

The AWD-LSTM + MoS + Partial Shuffled model is an open-weights language modeling model from University of Texas at Austin with 35000000.0 parameters.

About AWD-LSTM + MoS + Partial Shuffled

Recently, substantial progress has been made in language modeling by using deep neural networks. However, in practice, large scale neural language models have been shown to be prone to overfitting. In this paper, we present a simple yet highly effect

Details

Provider
University of Texas at Austin
Task
Language modeling
Parameters
35000000.0
Released
2019-06-10
Open weights
Yes
View model source

Explore

FAQ