Skip to content

ecnu-icalk/educhat-sft-002-data-osm

General NLPEnglishcc-by-nc-4.0

Created by ecnu-icalk at 2023, the ecnu-icalk/educhat-sft-002-data-osm is a General NLP dataset in English in Parquet format. With 111 downloads and 38 likes, it is actively used by the community. It is released under the cc-by-nc-4.0 license and is a 1M<n<10M-scale dataset.

About ecnu-icalk/educhat-sft-002-data-osm

每条数据由一个存放对话的list和与数据对应的system_prompt组成。list中按照Q,A顺序存放对话。 数据来源为开源数据,使用CleanTool数据清理工具去重。

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Size
1M<n<10M
Creator
ecnu-icalk
Year
2023
License
cc-by-nc-4.0
Downloads
111
Likes
38
Download Homepage

Related General NLP datasets

FAQ