Skip to content

laion/OIG

General NLPEnglish

Created by laion at 2023, the laion/OIG is a General NLP dataset in English in Parquet format.

About laion/OIG

This is the Open Instruction Generalist Dataset This is our attempt to create a large instruction dataset of medium quality along with a smaller high quality instruciton dataset (OIG-small-chip2). The data is in the form of jsonl objects, with ...

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Creator
laion
Year
2023
Download

Related General NLP datasets

FAQ