Skip to content

AWD-LSTM-MoS+PDR + dynamic evaluation (WT2)

IBMLanguage modeling

AWD-LSTM-MoS+PDR + dynamic evaluation (WT2) is language modeling model published by IBM in 2018 featuring 35000000.0 parameters.

About AWD-LSTM-MoS+PDR + dynamic evaluation (WT2)

Highly regularized LSTMs achieve impressive results on several benchmark datasets in language modeling. We propose a new regularization method based on decoding the last token in the context using the predicted distribution of the next token. This bi

Details

Provider
IBM
Task
Language modeling
Parameters
35000000.0
Released
2018-08-14
Open weights
No
View model source

Explore

FAQ