HuggingFaceM4/DoclingMatix
Visual Question AnsweringImage Text To TextEN
Created by HuggingFaceM4 at 2025, the HuggingFaceM4/DoclingMatix is a visual question answering dataset in EN in Parquet format. With 1.9K downloads and 52 likes, it is actively used by the community. It is released under the cdla-permissive-2.0 license and is a 1M<n<10M-scale dataset.
About HuggingFaceM4/DoclingMatix
DoclingMatix
DoclingMatix is a large-scale, multimodal dataset designed for training vision-language models in the domain of document intelligence. It was created specifically for training the SmolDocling model, an ultra-compact model for end-t...
Details
- Task
- Visual Question Answering, Image Text To Text
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 1M<n<10M
- Creator
- HuggingFaceM4
- Year
- 2025
- License
- cdla-permissive-2.0
- Downloads
- 1929
- Likes
- 52