liuhaotian/LLaVA-Pretrain
General NLPEN
Created by liuhaotian at 2023, the liuhaotian/LLaVA-Pretrain is a General NLP dataset in EN in Parquet format.
About liuhaotian/LLaVA-Pretrain
LLaVA Visual Instruct Pretrain Dataset Card
Dataset details
Dataset type:
LLaVA Visual Instruct Pretrain LCS-558K is a subset of LAION/CC/SBU dataset, filtered with a more balanced concept coverage distribution.
Captions are also ass...
Details
- Task
- General NLP
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- liuhaotian
- Year
- 2023