jordanparker6/publaynet
Image To TextEN
Jordanparker6/publaynet is a image to text dataset in EN from jordanparker6 in Parquet format. It is distributed under the other license and falls in the 10K<n<100K size category, and has been downloaded 1.7K times.
About jordanparker6/publaynet
PubLayNet
PubLayNet is a large dataset of document images, of which the layout is annotated with both bounding boxes and polygonal segmentations. The source of the documents is PubMed Central Open Access Subset (commercial use collection). The ...
Details
- Task
- Image To Text
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 10K<n<100K
- Creator
- jordanparker6
- Year
- 2022
- License
- other
- Downloads
- 1659
- Likes
- 37