Skip to content

VMware/open-instruct

Text GenerationEN

VMware/open-instruct is a text generation-focused dataset in EN distributed in Parquet format.

About VMware/open-instruct

Dataset Card for "open-instruct" This dataset is a combination of: Filtered subset of OpenAssistant/oasst1 train split of Mosaic-dolly-hhrlhf (consists of Databrick's dolly-15k dataset and a filtered subset of Anthropic's HH-RLHF). Filtered su...

Details

Task
Text Generation
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
VMware
Year
2023
Download

Related Text Generation datasets

FAQ