Skip to content

Open-Orca/FLAN

General NLPEN

Open-Orca/FLAN is a General NLP-focused dataset in EN distributed in Parquet format.

About Open-Orca/FLAN

🍮 The WHOLE FLAN Collection! 🍮 Overview This repository includes the full dataset from the FLAN Collection, totalling ~300GB as parquets. Generated using the official seqio templating from the Google FLAN Collection GitHub repo. The d...

Details

Task
General NLP
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
Open-Orca
Year
2023
Download

Related General NLP datasets

FAQ