snorkelai/Snorkel-Mistral-PairRM-DPO-Dataset
Text GenerationEnglish
Created by snorkelai at 2024, the snorkelai/Snorkel-Mistral-PairRM-DPO-Dataset is a text generation dataset in English in Parquet format.
About snorkelai/Snorkel-Mistral-PairRM-DPO-Dataset
Dataset:
This is the data used for training Snorkel model
We use ONLY the prompts from UltraFeedback; no external LLM responses used.
Methodology:
Generate 5 response variations for each prompt from a subset of 20,000 using the LLM ...
Details
- Task
- Text Generation
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- snorkelai
- Year
- 2024