sraimund/MapPool
General NLPEnglish
Sraimund/MapPool is a General NLP-focused dataset in English distributed in Parquet format.
About sraimund/MapPool
MapPool - Bubbling up an extremely large corpus of maps for AI
MapPool is a dataset of 75 million potential maps and textual captions. It has been derived from CommonPool, a dataset consisting of 12 billion text-image pairs from the Internet....
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- sraimund
- Year
- 2024