Salesforce/blip3-ocr-200m
General NLPEN
Created by Salesforce at 2024, the Salesforce/blip3-ocr-200m is a General NLP dataset in EN in Parquet format.
About Salesforce/blip3-ocr-200m
BLIP3-OCR-200M Dataset
Overview
The BLIP3-OCR-200M dataset is designed to address the limitations of current Vision-Language Models (VLMs) in processing and interpreting text-rich images, such as documents and charts. Traditional ima...
Details
- Task
- General NLP
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- Salesforce
- Year
- 2024