Skip to content

HuggingFaceM4/FineVisionMax

Image Text To TextEN, ZH

HuggingFaceM4/FineVisionMax is a image text to text dataset in EN, ZH from HuggingFaceM4 in Parquet format. And falls in the 10M<n<100M size category, and has been downloaded 43.7K times.

About HuggingFaceM4/FineVisionMax

Fine Vision FineVision is a massive collection of datasets with 17.3M images, 24.3M samples, 88.9M turns, and 9.5B answer tokens, designed for training state-of-the-art open Vision-Language-Models. More detail can be found in the blog post: ht...

Details

Task
Image Text To Text
Language
EN, ZH
Format
Parquet
Rows / instances
N/A
Size
10M<n<100M
Creator
HuggingFaceM4
Year
2025
Downloads
43689
Likes
27
Download Homepage

Related Image Text To Text datasets

FAQ