Skip to content

jamesqijingsong/zidian

General NLPZH, ENcc-by-nc-4.0

The jamesqijingsong/zidian dataset is a ZH, EN General NLP resource from jamesqijingsong at 2025. With 87.9K downloads and 0 likes, it is actively used by the community. It is released under the cc-by-nc-4.0 license and is a 1K<n<10K-scale dataset.

About jamesqijingsong/zidian

时间线: 2018年搭建成网站 https://zidian.18dao.net 2024年使用AI技術為《國語字典》生成配圖。 2025年上傳到Hugging Face做成數據集。 数据集中的文件: 目录 "image/" 下的文件数量: 4307,文生圖原始png圖片 目录 "image-zidian/" 下的文件数量: 4307,加字後的jpg圖片 目录 "text-zidian/" 下的文件数量: 4307,圖片解釋文字 目录 "pinyin/" 下的文件数量: 1702,...

Details

Task
General NLP
Language
ZH, EN
Format
Parquet
Rows / instances
N/A
Size
1K<n<10K
Creator
jamesqijingsong
Year
2025
License
cc-by-nc-4.0
Downloads
87899
Likes
0
Download Homepage

Related General NLP datasets

FAQ