yiyangd/pointarena_dataset
Image Text To TextVisual Question AnsweringEN
Yiyangd/pointarena_dataset is a image text to text dataset in EN from yiyangd in Parquet format.
About yiyangd/pointarena_dataset
Molmo2 PointArena SFT Data
26,596 supervised pointing examples used to fine-tune Molmo2-8B
(yiyangd/molmo2-8b-ft)
into a stronger PointArena solver (76.2% → up from 73.9% base, +2.3 pp).
Provenance
Each record is (image, query, answe...
Details
- Task
- Image Text To Text, Visual Question Answering
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- yiyangd
- Year
- 2026