argilla/dpo-mix-7k
General NLPENmit
Argilla/dpo-mix-7k is a General NLP-focused dataset in EN that provides 7,500 labeled examples distributed in Parquet format. It is distributed under the mit license and falls in the 1K<n<10K size category, and has been downloaded 1.2K times.
About argilla/dpo-mix-7k
Argilla DPO Mix 7K Dataset
A small cocktail combining DPO datasets built by Argilla with distilabel. The goal of this dataset is having a small, high-quality DPO dataset by filtering only highly rated chosen responses.
...
Details
- Task
- General NLP
- Language
- EN
- Format
- Parquet
- Rows / instances
- 7500
- Size
- 1K<n<10K
- Creator
- argilla
- Year
- 2024
- License
- mit
- Downloads
- 1172
- Likes
- 175