Open-Orca/FLAN
General NLPEN
Open-Orca/FLAN is a General NLP-focused dataset in EN distributed in Parquet format.
About Open-Orca/FLAN
🍮 The WHOLE FLAN Collection! 🍮
Overview
This repository includes the full dataset from the FLAN Collection, totalling ~300GB as parquets.
Generated using the official seqio templating from the Google FLAN Collection GitHub repo.
The d...
Details
- Task
- General NLP
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- Open-Orca
- Year
- 2023