wenbopan/Chinese-dpo-pairs
General NLPZH
Wenbopan/Chinese-dpo-pairs is a General NLP-focused dataset in ZH distributed in Parquet format.
About wenbopan/Chinese-dpo-pairs
Dataset Card for Chinese-dpo-pairs
Well-curated 10K reference pairs in Chinese. Data are created by GPT-3.5 translation from multiple sources, including:
flan_v2, sharegpt, ultrachat, evol_instruct and false_qa. Sampled from argilla/ultrafeedb...
Details
- Task
- General NLP
- Language
- ZH
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- wenbopan
- Year
- 2024