Skip to content

getomni-ai/ocr-benchmark

General NLPEnglishmit

The getomni-ai/ocr-benchmark dataset is a English General NLP resource from getomni-ai at 2025. With 1.5K downloads and 72 likes, it is actively used by the community. It is released under the mit license and is a 1K<n<10K-scale dataset.

About getomni-ai/ocr-benchmark

OmniAI OCR Benchmark A comprehensive benchmark that compares OCR and data extraction capabilities of different multimodal LLMs such as gpt-4o and gemini-2.0, evaluating both text and JSON extraction accuracy. Benchmark Results (Feb 2025) | Sour...

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Size
1K<n<10K
Creator
getomni-ai
Year
2025
License
mit
Downloads
1515
Likes
72
Download Homepage

Related General NLP datasets

FAQ