Skip to content

nvidia/OCR-Synthetic-Multilingual-v1

Object DetectionImage To TextEN, JA, KO

Nvidia/OCR-Synthetic-Multilingual-v1 is a object detection-focused dataset in EN, JA, KO distributed in Parquet format.

About nvidia/OCR-Synthetic-Multilingual-v1

OCR-Synthetic-Multilingual-v1 Dataset Description Large-scale synthetically generated OCR training dataset for multilingual text detection and recognition. The data was produced using a heavily modified and extended version o...

Details

Task
Object Detection, Image To Text
Language
EN, JA, KO
Format
Parquet
Rows / instances
N/A
Creator
nvidia
Year
2026
Download

Related Object Detection, Image To Text datasets

FAQ