Skip to content

BelleGroup/train_2M_CN

General NLPZHgpl-3.0

BelleGroup/train_2M_CN is a General NLP-focused dataset in ZH distributed in Parquet format. It is distributed under the gpl-3.0 license and falls in the 1M<n<10M size category, and has been downloaded 1.8K times.

About BelleGroup/train_2M_CN

内容 包含约200万条由BELLE项目生成的中文指令数据。 样例 { "instruction": "将以下三个句子组合成一个有意义的段落。\n狗是人类最好的朋友。它们非常聪明,可以进行各种活动。如果你喜欢散步,狗可以成为你一起散步的伙伴。", "input": "", "output": "狗是人类最好的朋友,它们非常聪明,可以进行各种活动。如果你喜欢散步,狗可以成为你一起散步的伙伴。出门散步是一种良好的锻炼方式,而有狗的陪伴会让散步变得更有趣,并...

Details

Task
General NLP
Language
ZH
Format
Parquet
Rows / instances
N/A
Size
1M<n<10M
Creator
BelleGroup
Year
2023
License
gpl-3.0
Downloads
1810
Likes
110
Download Homepage

Related General NLP datasets

FAQ