Skip to content

KangLiao/Puffin-4M

Text To ImageImage To TextImage To 3DImage To ImageEnglish

The KangLiao/Puffin-4M dataset is a English text to image resource from KangLiao at 2025.

About KangLiao/Puffin-4M

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation    📖 Project Page  |    🖥️ GitHub    |   🤗 Hugging Face   |    📑 Paper    Dataset Details Datasets and benchmarks that s...

Details

Task
Text To Image, Image To Text, Image To 3D, Image To Image
Language
English
Format
Parquet
Rows / instances
N/A
Creator
KangLiao
Year
2025
Download

FAQ