getomni-ai/ocr-benchmark
General NLPEnglishmit
The getomni-ai/ocr-benchmark dataset is a English General NLP resource from getomni-ai at 2025. With 1.5K downloads and 72 likes, it is actively used by the community. It is released under the mit license and is a 1K<n<10K-scale dataset.
About getomni-ai/ocr-benchmark
OmniAI OCR Benchmark
A comprehensive benchmark that compares OCR and data extraction capabilities of different multimodal LLMs such as gpt-4o and gemini-2.0, evaluating both text and JSON extraction accuracy.
Benchmark Results (Feb 2025) | Sour...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 1K<n<10K
- Creator
- getomni-ai
- Year
- 2025
- License
- mit
- Downloads
- 1515
- Likes
- 72