a686d380/sis-novel
General NLPEnglishopenrail
Created by a686d380 at 2023, the a686d380/sis-novel is a General NLP dataset in English in Parquet format. With 26 downloads and 44 likes, it is actively used by the community. It is released under the openrail license.
About a686d380/sis-novel
这是一个中文H小说数据集,收集自sis001
sis-novel1为中短篇小说,112182项,解压缩后大小5.7GB,数据截止2022年7月
sis-novel2为长篇小说,4555项,解压缩后大小3.6GB,数据截止2023年3月
数据均为未清洗的txt版本,并且可能包含有评论
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- a686d380
- Year
- 2023
- License
- openrail
- Downloads
- 26
- Likes
- 44