Skip to content

ScienceOne-AI/S1-MMAlign

Image To TextVisual Question AnsweringFeature ExtractionEN

ScienceOne-AI/S1-MMAlign is a image to text-focused dataset in EN distributed in Parquet format.

About ScienceOne-AI/S1-MMAlign

S1-MMAlign A Large-Scale Multi-Disciplinary Scientific Multimodal Dataset S1-MMAlign is a large-scale, multi-disciplinary multimodal dataset comprising over 15.5 million high-quality image-text pairs derived from 2.5 million open-access scient...

Details

Task
Image To Text, Visual Question Answering, Feature Extraction
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
ScienceOne-AI
Year
2025
Download

FAQ