upstage/dp-bench
General NLPEnglish
The upstage/dp-bench dataset is a English General NLP resource from upstage at 2024.
About upstage/dp-bench
DP-Bench: Document Parsing Benchmark
Document parsing refers to the process of converting complex documents, such as PDFs and scanned images, into structured text formats like HTML and Markdown.
It is especially useful as a preprocessor ...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- upstage
- Year
- 2024