Skip to content

upstage/dp-bench

General NLPEnglish

The upstage/dp-bench dataset is a English General NLP resource from upstage at 2024.

About upstage/dp-bench

DP-Bench: Document Parsing Benchmark Document parsing refers to the process of converting complex documents, such as PDFs and scanned images, into structured text formats like HTML and Markdown. It is especially useful as a preprocessor ...

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Creator
upstage
Year
2024
Download

Related General NLP datasets

FAQ