wikimedia/wit_base
Image To TextText RetrievalAF, AN, AR
Created by wikimedia at 2022, the wikimedia/wit_base is a image to text dataset in AF, AN, AR in Parquet format.
About wikimedia/wit_base
Dataset Card for WIT
Dataset Summary
Wikimedia's version of the Wikipedia-based Image Text (WIT) Dataset, a large multimodal multilingual dataset.
From the official blog post:
The core training data is taken from the Wikipedia Image...
Details
- Task
- Image To Text, Text Retrieval
- Language
- AF, AN, AR
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- wikimedia
- Year
- 2022