argilla/distilabel-intel-orca-dpo-pairs
General NLPEnglishapache-2.0
The argilla/distilabel-intel-orca-dpo-pairs dataset is a English General NLP resource from argilla at 2024. With 9.2K downloads and 183 likes, it is actively used by the community. It is released under the apache-2.0 license and is a 10K<n<100K-scale dataset.
About argilla/distilabel-intel-orca-dpo-pairs
distilabel Orca Pairs for DPO
The dataset is a "distilabeled" version of the widely used dataset: Intel/orca_dpo_pairs. The original dataset has been used by 100s of open-source practitioners and models. We knew from fixing UltraFeedback (and b...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 10K<n<100K
- Creator
- argilla
- Year
- 2024
- License
- apache-2.0
- Downloads
- 9177
- Likes
- 183