DeepGlint-AI/DanQing100M
Zero Shot Image ClassificationImage To TextZHcc-by-nc-4.0
DeepGlint-AI/DanQing100M is a zero shot image classification-focused dataset in ZH that provides 99,892,381 labeled examples distributed in Parquet format. It is distributed under the cc-by-nc-4.0 license and falls in the 10M<n<100M size category, and has been downloaded 1K times.
About DeepGlint-AI/DanQing100M
100M Chinese image-text pairs | 12TB dataset | 2024-2025 web data
DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset
Project Page | Paper | Code
Hengyu Shen∗, Tiancheng Gu∗, Bin Qin, Lan Wu, Yuling Wu, Shuo Tan, ...
Details
- Task
- Zero Shot Image Classification, Image To Text
- Language
- ZH
- Format
- Parquet
- Rows / instances
- 99892381
- Size
- 10M<n<100M
- Creator
- DeepGlint-AI
- Year
- 2026
- License
- cc-by-nc-4.0
- Downloads
- 1013
- Likes
- 51