ranjaykrishna/visual_genome
Image To TextObject DetectionVisual Question AnsweringEN
Ranjaykrishna/visual_genome is a image to text-focused dataset in EN distributed in Parquet format.
About ranjaykrishna/visual_genome
Visual Genome enable to model objects and relationships between objects.
They collect dense annotations of objects, attributes, and relationships within each image.
Specifically, the dataset contains over 108K images where each image has an averag...
Details
- Task
- Image To Text, Object Detection, Visual Question Answering
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- ranjaykrishna
- Year
- 2022