mawadalla/scientific-figures-captions-context
Visual Question AnsweringDocument Question AnsweringEN
Mawadalla/scientific-figures-captions-context is a visual question answering-focused dataset in EN distributed in Parquet format.
About mawadalla/scientific-figures-captions-context
Dataset Card for Scientific Figures, Captions, and Context
A novel vision-language dataset of scientific figures taken directly from research papers.
We scraped approximately ~150k papers, with about ~690k figures total. We extracted each figur...
Details
- Task
- Visual Question Answering, Document Question Answering
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- mawadalla
- Year
- 2023