gksriharsha/chitralekha
Image To TextTEmit
Gksriharsha/chitralekha is a image to text-focused dataset in TE that provides 65,876,177 labeled examples distributed in Parquet format. It is distributed under the mit license and falls in the 10M<n<100M size category, and has been downloaded 46K times.
About gksriharsha/chitralekha
Chitralekha
Dataset Details
Dataset Version
Some of the fonts do not have proper letters/rendering of different telugu letter combinations. Those have been removed as much as I can find them. If there are any other mistake...
Details
- Task
- Image To Text
- Language
- TE
- Format
- Parquet
- Rows / instances
- 65876177
- Size
- 10M<n<100M
- Creator
- gksriharsha
- Year
- 2023
- License
- mit
- Downloads
- 45979
- Likes
- 5