nvidia/OCR-Synthetic-Multilingual-v1
Object DetectionImage To TextEN, JA, KO
Nvidia/OCR-Synthetic-Multilingual-v1 is a object detection-focused dataset in EN, JA, KO distributed in Parquet format.
About nvidia/OCR-Synthetic-Multilingual-v1
OCR-Synthetic-Multilingual-v1
Dataset Description
Large-scale synthetically generated OCR training dataset for multilingual text detection and recognition. The data was produced using a heavily modified and extended version o...
Details
- Task
- Object Detection, Image To Text
- Language
- EN, JA, KO
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- nvidia
- Year
- 2026