Skip to content

jondurbin/py-dpo-v0.1

General NLPCODEcc-by-4.0

The jondurbin/py-dpo-v0.1 dataset is a CODE General NLP resource from jondurbin at 2024. With 138 downloads and 56 likes, it is actively used by the community. It is released under the cc-by-4.0 license and is a 1K<n<10K-scale dataset.

About jondurbin/py-dpo-v0.1

Overview DPO dataset meant to enhance python coding abilities. This dataset uses the excellent https://huggingface.co/datasets/Vezora/Tested-22k-Python-Alpaca dataset as the "chosen" responses, given this dataset was already tested and validate...

Details

Task
General NLP
Language
CODE
Format
Parquet
Rows / instances
N/A
Size
1K<n<10K
Creator
jondurbin
Year
2024
License
cc-by-4.0
Downloads
138
Likes
56
Download Homepage

Related General NLP datasets

FAQ