llamafactory/DPO-En-Zh-20k
Text GenerationEN, ZH
Llamafactory/DPO-En-Zh-20k is a text generation-focused dataset in EN, ZH distributed in Parquet format.
About llamafactory/DPO-En-Zh-20k
This dataset is composed by
4,000 examples of argilla/distilabel-capybara-dpo-7k-binarized with chosen score>=4.
3,000 examples of argilla/distilabel-intel-orca-dpo-pairs with chosen score>=8.
3,000 examples of argilla/ultrafeedback-binarized-pre...
Details
- Task
- Text Generation
- Language
- EN, ZH
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- llamafactory
- Year
- 2024