SwayStar123/preprocessed_commoncatalog-cc-by
General NLPEN
SwayStar123/preprocessed_commoncatalog-cc-by is a General NLP-focused dataset in EN distributed in Parquet format.
About SwayStar123/preprocessed_commoncatalog-cc-by
I also seperately provide just the prompts in prompts.json
keys are the image_id, and the values are the captions generated
Captions generated by moondream: vikhyatk/moondream2
Latents generated by SDXL VAE: madebyollin/sdxl-vae-fp16-fix
Embedding...
Details
- Task
- General NLP
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- SwayStar123
- Year
- 2024