HuggingFaceM4/FineVisionMax
Image Text To TextEN, ZH
HuggingFaceM4/FineVisionMax is a image text to text dataset in EN, ZH from HuggingFaceM4 in Parquet format. And falls in the 10M<n<100M size category, and has been downloaded 43.7K times.
About HuggingFaceM4/FineVisionMax
Fine Vision
FineVision is a massive collection of datasets with 17.3M images, 24.3M samples, 88.9M turns, and 9.5B answer tokens, designed for training state-of-the-art open Vision-Language-Models.
More detail can be found in the blog post: ht...
Details
- Task
- Image Text To Text
- Language
- EN, ZH
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 10M<n<100M
- Creator
- HuggingFaceM4
- Year
- 2025
- Downloads
- 43689
- Likes
- 27