VMware/open-instruct
Text GenerationEN
VMware/open-instruct is a text generation-focused dataset in EN distributed in Parquet format.
About VMware/open-instruct
Dataset Card for "open-instruct"
This dataset is a combination of:
Filtered subset of OpenAssistant/oasst1
train split of Mosaic-dolly-hhrlhf (consists of Databrick's dolly-15k dataset and a filtered subset of Anthropic's HH-RLHF).
Filtered su...
Details
- Task
- Text Generation
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- VMware
- Year
- 2023