Skip to content

Salesforce/blip3-ocr-200m

General NLPEN

Created by Salesforce at 2024, the Salesforce/blip3-ocr-200m is a General NLP dataset in EN in Parquet format.

About Salesforce/blip3-ocr-200m

BLIP3-OCR-200M Dataset Overview The BLIP3-OCR-200M dataset is designed to address the limitations of current Vision-Language Models (VLMs) in processing and interpreting text-rich images, such as documents and charts. Traditional ima...

Details

Task
General NLP
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
Salesforce
Year
2024
Download

Related General NLP datasets

FAQ