ecnu-icalk/educhat-sft-002-data-osm
General NLPEnglishcc-by-nc-4.0
Created by ecnu-icalk at 2023, the ecnu-icalk/educhat-sft-002-data-osm is a General NLP dataset in English in Parquet format. With 111 downloads and 38 likes, it is actively used by the community. It is released under the cc-by-nc-4.0 license and is a 1M<n<10M-scale dataset.
About ecnu-icalk/educhat-sft-002-data-osm
每条数据由一个存放对话的list和与数据对应的system_prompt组成。list中按照Q,A顺序存放对话。
数据来源为开源数据,使用CleanTool数据清理工具去重。
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 1M<n<10M
- Creator
- ecnu-icalk
- Year
- 2023
- License
- cc-by-nc-4.0
- Downloads
- 111
- Likes
- 38