Skip to content

sraimund/MapPool

General NLPEnglish

Sraimund/MapPool is a General NLP-focused dataset in English distributed in Parquet format.

About sraimund/MapPool

MapPool - Bubbling up an extremely large corpus of maps for AI MapPool is a dataset of 75 million potential maps and textual captions. It has been derived from CommonPool, a dataset consisting of 12 billion text-image pairs from the Internet....

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Creator
sraimund
Year
2024
Download

Related General NLP datasets

FAQ