Skip to content

EleutherAI/lambada_openai

General NLPDE, EN, ESmit

EleutherAI/lambada_openai is a General NLP-focused dataset in DE, EN, ES that provides 30,918 labeled examples distributed in Parquet format. It is distributed under the mit license and falls in the 10K<n<100K size category, and has been downloaded 76.4K times.

About EleutherAI/lambada_openai

Dataset Summary This dataset is comprised of the LAMBADA test split as pre-processed by OpenAI (see relevant discussions here and here). It also contains machine translated versions of the split in German, Spanish, French, and Italian. LAMBADA ...

Details

Task
General NLP
Language
DE, EN, ES
Format
Parquet
Rows / instances
30918
Size
10K<n<100K
Creator
EleutherAI
Year
2022
License
mit
Downloads
76391
Likes
49
Download Homepage

Related General NLP datasets

FAQ