Skip to content

allenai/CoSyn-400K

Visual Question AnsweringEnglishodc-by

Created by allenai at 2025, the allenai/CoSyn-400K is a visual question answering dataset in English containing 408,227 records in Parquet format. With 3K downloads and 50 likes, it is actively used by the community. It is released under the odc-by license and is a 100K<n<1M-scale dataset.

About allenai/CoSyn-400K

CoSyn-400k CoSyn-400k is a collection of synthetic question-answer pairs about very diverse range of computer-generated images. The data was created by using the Claude large language model to generate code that can be executed to render an im...

Details

Task
Visual Question Answering
Language
English
Format
Parquet
Rows / instances
408227
Size
100K<n<1M
Creator
allenai
Year
2025
License
odc-by
Downloads
2997
Likes
50
Download Homepage

Related Visual Question Answering datasets

FAQ