Skip to content

yiyangd/pointarena_dataset

Image Text To TextVisual Question AnsweringEN

Yiyangd/pointarena_dataset is a image text to text dataset in EN from yiyangd in Parquet format.

About yiyangd/pointarena_dataset

Molmo2 PointArena SFT Data 26,596 supervised pointing examples used to fine-tune Molmo2-8B (yiyangd/molmo2-8b-ft) into a stronger PointArena solver (76.2% → up from 73.9% base, +2.3 pp). Provenance Each record is (image, query, answe...

Details

Task
Image Text To Text, Visual Question Answering
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
yiyangd
Year
2026
Download

FAQ