Skip to content

HuggingFaceH4/ultrachat_200k

Text GenerationENmit

The HuggingFaceH4/ultrachat_200k dataset is a EN text generation resource from HuggingFaceH4 at 2023 comprising 515,311 examples. With 57.3K downloads and 736 likes, it is actively used by the community. It is released under the mit license and is a 100K<n<1M-scale dataset.

About HuggingFaceH4/ultrachat_200k

Dataset Card for UltraChat 200k Dataset Description This is a heavily filtered version of the UltraChat dataset and was used to train Zephyr-7B-β, a state of the art 7b chat model. The original datasets consists of 1.4M dialogues gen...

Details

Task
Text Generation
Language
EN
Format
Parquet
Rows / instances
515311
Size
100K<n<1M
Creator
HuggingFaceH4
Year
2023
License
mit
Downloads
57252
Likes
736
Download Homepage

Related Text Generation datasets

FAQ