Skip to content

hiyouga/geometry3k

Visual Question AnsweringENmit

The hiyouga/geometry3k dataset is a EN visual question answering resource from hiyouga at 2025 comprising 3,002 examples. With 31.3K downloads and 82 likes, it is actively used by the community. It is released under the mit license and is a 1K<n<10K-scale dataset.

About hiyouga/geometry3k

This dataset was converted from https://github.com/lupantech/InterGPS using the following script. import json import os from datasets import Dataset, DatasetDict, Sequence from datasets import Image as ImageData from PIL import Image MAPPING = {...

Details

Task
Visual Question Answering
Language
EN
Format
Parquet
Rows / instances
3002
Size
1K<n<10K
Creator
hiyouga
Year
2025
License
mit
Downloads
31335
Likes
82
Download Homepage

Related Visual Question Answering datasets

FAQ