Skip to content

jordanparker6/publaynet

Image To TextEN

Jordanparker6/publaynet is a image to text dataset in EN from jordanparker6 in Parquet format. It is distributed under the other license and falls in the 10K<n<100K size category, and has been downloaded 1.7K times.

About jordanparker6/publaynet

PubLayNet PubLayNet is a large dataset of document images, of which the layout is annotated with both bounding boxes and polygonal segmentations. The source of the documents is PubMed Central Open Access Subset (commercial use collection). The ...

Details

Task
Image To Text
Language
EN
Format
Parquet
Rows / instances
N/A
Size
10K<n<100K
Creator
jordanparker6
Year
2022
License
other
Downloads
1659
Likes
37
Download Homepage

Related Image To Text datasets

FAQ