Skip to content

EleutherAI/wikitext_document_level

General NLPEnglishcc-by-sa-3.0

The EleutherAI/wikitext_document_level dataset is a English General NLP resource from EleutherAI at 2023. With 52.8K downloads and 18 likes, it is actively used by the community. It is released under the cc-by-sa-3.0 license and is a 10K<n<100K-scale dataset.

About EleutherAI/wikitext_document_level

Wikitext Document Level This is a modified version of https://huggingface.co/datasets/wikitext that returns Wiki pages instead of Wiki text line-by-line. The original readme is contained below. Dataset Card for "wikitext" Dat...

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Size
10K<n<100K
Creator
EleutherAI
Year
2023
License
cc-by-sa-3.0
Downloads
52843
Likes
18
Download Homepage

Related General NLP datasets

FAQ