Skip to content

argilla/dpo-mix-7k

General NLPENmit

Argilla/dpo-mix-7k is a General NLP-focused dataset in EN that provides 7,500 labeled examples distributed in Parquet format. It is distributed under the mit license and falls in the 1K<n<10K size category, and has been downloaded 1.2K times.

About argilla/dpo-mix-7k

Argilla DPO Mix 7K Dataset A small cocktail combining DPO datasets built by Argilla with distilabel. The goal of this dataset is having a small, high-quality DPO dataset by filtering only highly rated chosen responses. ...

Details

Task
General NLP
Language
EN
Format
Parquet
Rows / instances
7500
Size
1K<n<10K
Creator
argilla
Year
2024
License
mit
Downloads
1172
Likes
175
Download Homepage

Related General NLP datasets

FAQ