Skip to content

SwayStar123/preprocessed_commoncatalog-cc-by_DCAE

Text To ImageEN

SwayStar123/preprocessed_commoncatalog-cc-by_DCAE is a text to image-focused dataset in EN distributed in Parquet format.

About SwayStar123/preprocessed_commoncatalog-cc-by_DCAE

The images are resized and then encoded with the DC-AE f32 autoencoder. The resizing is done with a bucketmanager with base resolution 512x512, minimum side length 256, maximum side length 1024, all sides are divisible by 32 ofcourse as they neede...

Details

Task
Text To Image
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
SwayStar123
Year
2025
Download

Related Text To Image datasets

FAQ