Skip to content

llamafactory/DPO-En-Zh-20k

Text GenerationEN, ZH

Llamafactory/DPO-En-Zh-20k is a text generation-focused dataset in EN, ZH distributed in Parquet format.

About llamafactory/DPO-En-Zh-20k

This dataset is composed by 4,000 examples of argilla/distilabel-capybara-dpo-7k-binarized with chosen score>=4. 3,000 examples of argilla/distilabel-intel-orca-dpo-pairs with chosen score>=8. 3,000 examples of argilla/ultrafeedback-binarized-pre...

Details

Task
Text Generation
Language
EN, ZH
Format
Parquet
Rows / instances
N/A
Creator
llamafactory
Year
2024
Download

Related Text Generation datasets

FAQ