Skip to content

SwayStar123/preprocessed_commoncatalog-cc-by

General NLPEN

SwayStar123/preprocessed_commoncatalog-cc-by is a General NLP-focused dataset in EN distributed in Parquet format.

About SwayStar123/preprocessed_commoncatalog-cc-by

I also seperately provide just the prompts in prompts.json keys are the image_id, and the values are the captions generated Captions generated by moondream: vikhyatk/moondream2 Latents generated by SDXL VAE: madebyollin/sdxl-vae-fp16-fix Embedding...

Details

Task
General NLP
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
SwayStar123
Year
2024
Download

Related General NLP datasets

FAQ