lmms-lab/LLaVA-Video-178K
Visual Question AnsweringVideo Text To TextEN
Lmms-lab/LLaVA-Video-178K is a visual question answering-focused dataset in EN distributed in Parquet format.
About lmms-lab/LLaVA-Video-178K
Dataset Card for LLaVA-Video-178K
Uses
This dataset is used for the training of the LLaVA-Video model. We only allow the use of this dataset for academic research and education purpose. For OpenAI GPT-4 generated data, we recommend t...
Details
- Task
- Visual Question Answering, Video Text To Text
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- lmms-lab
- Year
- 2024