alibaba-multimodal-industrial-ai/IndustryBench-MIPU
Image To TextVisual Question AnsweringZHmit
Created by alibaba-multimodal-industrial-ai at 2026, the alibaba-multimodal-industrial-ai/IndustryBench-MIPU is a image to text dataset in ZH in Parquet format. With 20.3K downloads and 7 likes, it is actively used by the community. It is released under the mit license and is a 10K<n<100K-scale dataset.
About alibaba-multimodal-industrial-ai/IndustryBench-MIPU
IndustryBench-MIPU: Benchmarking Multi-Image Attribute Value Extraction for Industrial Products
Multi-Image Industrial Product Understanding Benchmark — evaluating MLLMs on structured attribute extraction from real-world industrial product imag...
Details
- Task
- Image To Text, Visual Question Answering
- Language
- ZH
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 10K<n<100K
- Creator
- alibaba-multimodal-industrial-ai
- Year
- 2026
- License
- mit
- Downloads
- 20300
- Likes
- 7