Skip to content

mawadalla/scientific-figures-captions-context

Visual Question AnsweringDocument Question AnsweringEN

Mawadalla/scientific-figures-captions-context is a visual question answering-focused dataset in EN distributed in Parquet format.

About mawadalla/scientific-figures-captions-context

Dataset Card for Scientific Figures, Captions, and Context A novel vision-language dataset of scientific figures taken directly from research papers. We scraped approximately ~150k papers, with about ~690k figures total. We extracted each figur...

Details

Task
Visual Question Answering, Document Question Answering
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
mawadalla
Year
2023
Download

FAQ