Skip to content

mvp-lab/LLaVA-OneVision-2-Data

Video Text To TextVisual Question AnsweringImage Text To TextEN

Created by mvp-lab at 2026, the mvp-lab/LLaVA-OneVision-2-Data is a video text to text dataset in EN in Parquet format.

About mvp-lab/LLaVA-OneVision-2-Data

LLaVA-OneVision-2-Data Training data for the LLaVA-OneVision-2 multimodal model family, covering large-scale video and spatial reasoning corpora used in mid-training. Dataset Composition Subset Format Description mid_train...

Details

Task
Video Text To Text, Visual Question Answering, Image Text To Text
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
mvp-lab
Year
2026
Download

FAQ