tasksource/oasst1_pairwise_rlhf_reward
General NLPEN, ES, RU
Tasksource/oasst1_pairwise_rlhf_reward is a General NLP dataset in EN, ES, RU from tasksource in Parquet format.
About tasksource/oasst1_pairwise_rlhf_reward
Dataset Card for "oasst1_pairwise_rlhf_reward"
OASST1 dataset preprocessed for reward modeling:
import pandas as pd
from datasets import load_dataset,concatenate_datasets, Dataset, DatasetDict
import numpy as np
dataset = load_dataset("OpenAss...
Details
- Task
- General NLP
- Language
- EN, ES, RU
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- tasksource
- Year
- 2023