Skip to content

Anthropic/hh-rlhf

General NLPEnglishmit

Anthropic/hh-rlhf is a General NLP dataset in English from Anthropic in Parquet format. It is distributed under the mit license and falls in the 100K<n<1M size category, and has been downloaded 28.2K times.

About Anthropic/hh-rlhf

Dataset Card for HH-RLHF Dataset Summary This repository provides access to two different kinds of data: Human preference data about helpfulness and harmlessness from Training a Helpful and Harmless Assistant with Reinforcement Lear...

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Size
100K<n<1M
Creator
Anthropic
Year
2022
License
mit
Downloads
28227
Likes
1800
Download Homepage

Related General NLP datasets

FAQ