A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning (CLEVR & CoGenT)
Question AnsweringVisualEnglish
A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning (CLEVR & CoGenT) is a question answering dataset in English from Johnson et al. with 999,968 questions; 100,000 images records in JSON format.
About A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning (CLEVR & CoGenT)
Visual question answering dataset contains 100,000 images and 999,968 questions.
Details
- Task
- Question Answering, Visual
- Language
- English
- Format
- JSON
- Rows / instances
- 999,968 questions; 100,000 images
- Creator
- Johnson et al.
- Year
- 2016