Skip to content

allenai/soda

General NLPENcc-by-4.0

Allenai/soda is a General NLP-focused dataset in EN distributed in Parquet format. It is distributed under the cc-by-4.0 license and falls in the 1M<n<10M size category, and has been downloaded 1.6K times.

About allenai/soda

Dataset Card for 🥤SODA Dataset Summary 🥤SODA is the first publicly available, million-scale, high-quality dialogue dataset covering a wide range of social interactions. Dialogues are distilled from a PLM (InstructGPT; Ouyang et al., ...

Details

Task
General NLP
Language
EN
Format
Parquet
Rows / instances
N/A
Size
1M<n<10M
Creator
allenai
Year
2023
License
cc-by-4.0
Downloads
1581
Likes
154
Download Homepage

Related General NLP datasets

FAQ