HuggingFaceH4/ultrachat_200k
Text GenerationENmit
The HuggingFaceH4/ultrachat_200k dataset is a EN text generation resource from HuggingFaceH4 at 2023 comprising 515,311 examples. With 57.3K downloads and 736 likes, it is actively used by the community. It is released under the mit license and is a 100K<n<1M-scale dataset.
About HuggingFaceH4/ultrachat_200k
Dataset Card for UltraChat 200k
Dataset Description
This is a heavily filtered version of the UltraChat dataset and was used to train Zephyr-7B-β, a state of the art 7b chat model.
The original datasets consists of 1.4M dialogues gen...
Details
- Task
- Text Generation
- Language
- EN
- Format
- Parquet
- Rows / instances
- 515311
- Size
- 100K<n<1M
- Creator
- HuggingFaceH4
- Year
- 2023
- License
- mit
- Downloads
- 57252
- Likes
- 736