Skip to content

visheratin/realworldqa

Visual Question AnsweringEN

Visheratin/realworldqa is a visual question answering dataset in EN from visheratin in Parquet format.

About visheratin/realworldqa

RealWorldQA dataset This is the benchmark dataset released by xAI along with the Grok-1.5 Vision announcement. This benchmark is designed to evaluate basic real-world spatial understanding capabilities of multimodal models. While many of the ...

Details

Task
Visual Question Answering
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
visheratin
Year
2024
Download

Related Visual Question Answering datasets

FAQ