jamesqijingsong/zidian
General NLPZH, ENcc-by-nc-4.0
The jamesqijingsong/zidian dataset is a ZH, EN General NLP resource from jamesqijingsong at 2025. With 87.9K downloads and 0 likes, it is actively used by the community. It is released under the cc-by-nc-4.0 license and is a 1K<n<10K-scale dataset.
About jamesqijingsong/zidian
时间线:
2018年搭建成网站 https://zidian.18dao.net
2024年使用AI技術為《國語字典》生成配圖。
2025年上傳到Hugging Face做成數據集。
数据集中的文件:
目录 "image/" 下的文件数量: 4307,文生圖原始png圖片
目录 "image-zidian/" 下的文件数量: 4307,加字後的jpg圖片
目录 "text-zidian/" 下的文件数量: 4307,圖片解釋文字
目录 "pinyin/" 下的文件数量: 1702,...
Details
- Task
- General NLP
- Language
- ZH, EN
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 1K<n<10K
- Creator
- jamesqijingsong
- Year
- 2025
- License
- cc-by-nc-4.0
- Downloads
- 87899
- Likes
- 0