Skip to content

snorkelai/Snorkel-Mistral-PairRM-DPO-Dataset

Text GenerationEnglish

Created by snorkelai at 2024, the snorkelai/Snorkel-Mistral-PairRM-DPO-Dataset is a text generation dataset in English in Parquet format.

About snorkelai/Snorkel-Mistral-PairRM-DPO-Dataset

Dataset: This is the data used for training Snorkel model We use ONLY the prompts from UltraFeedback; no external LLM responses used. Methodology: Generate 5 response variations for each prompt from a subset of 20,000 using the LLM ...

Details

Task
Text Generation
Language
English
Format
Parquet
Rows / instances
N/A
Creator
snorkelai
Year
2024
Download

Related Text Generation datasets

FAQ