mvp-lab/LLaVA-OneVision-2-Data
Video Text To TextVisual Question AnsweringImage Text To TextEN
Created by mvp-lab at 2026, the mvp-lab/LLaVA-OneVision-2-Data is a video text to text dataset in EN in Parquet format.
About mvp-lab/LLaVA-OneVision-2-Data
LLaVA-OneVision-2-Data
Training data for the LLaVA-OneVision-2 multimodal model family, covering large-scale video and spatial reasoning corpora used in mid-training.
Dataset Composition
Subset
Format
Description
mid_train...
Details
- Task
- Video Text To Text, Visual Question Answering, Image Text To Text
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- mvp-lab
- Year
- 2026