apple/DataCompDR-12M
Text To ImageImage To TextEN
Created by apple at 2024, the apple/DataCompDR-12M is a text to image dataset in EN in Parquet format. With 2.2K downloads and 36 likes, it is actively used by the community. It is released under the apple-amlr license and is a 10M<n<100M-scale dataset.
About apple/DataCompDR-12M
Dataset Card for DataCompDR-12M
This dataset contains synthetic captions, embeddings, and metadata for DataCompDR-12M.
The metadata has been generated using pretrained image-text models on a 12M subset of DataComp-1B.
For details on how to us...
Details
- Task
- Text To Image, Image To Text
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 10M<n<100M
- Creator
- apple
- Year
- 2024
- License
- apple-amlr
- Downloads
- 2160
- Likes
- 36