AWD-LSTM + MoS + Partial Shuffled
University of Texas at AustinLanguage modelingOpen weights
The AWD-LSTM + MoS + Partial Shuffled model is an open-weights language modeling model from University of Texas at Austin with 35000000.0 parameters.
About AWD-LSTM + MoS + Partial Shuffled
Recently, substantial progress has been made in language modeling by using deep neural networks. However, in practice, large scale neural language models have been shown to be prone to overfitting. In this paper, we present a simple yet highly effect
Details
- Provider
- University of Texas at Austin
- Task
- Language modeling
- Parameters
- 35000000.0
- Released
- 2019-06-10
- Open weights
- Yes