Skip to content

liuhaotian/LLaVA-Pretrain

General NLPEN

Created by liuhaotian at 2023, the liuhaotian/LLaVA-Pretrain is a General NLP dataset in EN in Parquet format.

About liuhaotian/LLaVA-Pretrain

LLaVA Visual Instruct Pretrain Dataset Card Dataset details Dataset type: LLaVA Visual Instruct Pretrain LCS-558K is a subset of LAION/CC/SBU dataset, filtered with a more balanced concept coverage distribution. Captions are also ass...

Details

Task
General NLP
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
liuhaotian
Year
2023
Download

Related General NLP datasets

FAQ