Anthropic/hh-rlhf
General NLPEnglishmit
Anthropic/hh-rlhf is a General NLP dataset in English from Anthropic in Parquet format. It is distributed under the mit license and falls in the 100K<n<1M size category, and has been downloaded 28.2K times.
About Anthropic/hh-rlhf
Dataset Card for HH-RLHF
Dataset Summary
This repository provides access to two different kinds of data:
Human preference data about helpfulness and harmlessness from Training a Helpful and Harmless Assistant with Reinforcement Lear...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 100K<n<1M
- Creator
- Anthropic
- Year
- 2022
- License
- mit
- Downloads
- 28227
- Likes
- 1800