Skip to content

WikiText-103 & 2

Language ModelingEnglish

WikiText-103 & 2 is a language modeling dataset in English from Merity et al. with 100 records in TOKENS format.

Details

Task
Language Modeling
Language
English
Format
TOKENS
Rows / instances
100M+
Creator
Merity et al.
Year
2016
Download Paper

Related Language Modeling datasets

FAQ