kdexd/red_caps
Image To TextEN
Kdexd/red_caps is a image to text-focused dataset in EN distributed in Parquet format.
About kdexd/red_caps
RedCaps is a large-scale dataset of 12M image-text pairs collected from Reddit.
Images and captions from Reddit depict and describe a wide variety of objects and scenes.
The data is collected from a manually curated set of subreddits (350 total),
...
Details
- Task
- Image To Text
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- kdexd
- Year
- 2022