dikw/hh_rlhf_cn
General NLPEnglish
Created by dikw at 2023, the dikw/hh_rlhf_cn is a General NLP dataset in English in Parquet format. With 193 downloads and 79 likes, it is actively used by the community. It is released under the llama2 license and is a 100K<n<1M-scale dataset.
About dikw/hh_rlhf_cn
hh-rlhf中文翻译版本
基于Anthropic论文Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback 开源的helpful 和harmless数据,使用翻译工具进行了翻译。hh_rlhf_train.jsonl 合并中英文训练集数据 清洗过后17万条hh_rlhf_test.jsonl 合并中英文测试集数据 清洗过后9千条harmless_base_cn...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 100K<n<1M
- Creator
- dikw
- Year
- 2023
- License
- llama2
- Downloads
- 193
- Likes
- 79