Skip to content

wikimedia/wit_base

Image To TextText RetrievalAF, AN, AR

Created by wikimedia at 2022, the wikimedia/wit_base is a image to text dataset in AF, AN, AR in Parquet format.

About wikimedia/wit_base

Dataset Card for WIT Dataset Summary Wikimedia's version of the Wikipedia-based Image Text (WIT) Dataset, a large multimodal multilingual dataset. From the official blog post: The core training data is taken from the Wikipedia Image...

Details

Task
Image To Text, Text Retrieval
Language
AF, AN, AR
Format
Parquet
Rows / instances
N/A
Creator
wikimedia
Year
2022
Download

FAQ