BelleGroup/train_2M_CN
General NLPZHgpl-3.0
BelleGroup/train_2M_CN is a General NLP-focused dataset in ZH distributed in Parquet format. It is distributed under the gpl-3.0 license and falls in the 1M<n<10M size category, and has been downloaded 1.8K times.
About BelleGroup/train_2M_CN
内容
包含约200万条由BELLE项目生成的中文指令数据。
样例
{
"instruction": "将以下三个句子组合成一个有意义的段落。\n狗是人类最好的朋友。它们非常聪明,可以进行各种活动。如果你喜欢散步,狗可以成为你一起散步的伙伴。",
"input": "",
"output": "狗是人类最好的朋友,它们非常聪明,可以进行各种活动。如果你喜欢散步,狗可以成为你一起散步的伙伴。出门散步是一种良好的锻炼方式,而有狗的陪伴会让散步变得更有趣,并...
Details
- Task
- General NLP
- Language
- ZH
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 1M<n<10M
- Creator
- BelleGroup
- Year
- 2023
- License
- gpl-3.0
- Downloads
- 1810
- Likes
- 110