KangLiao/Puffin-4M
Text To ImageImage To TextImage To 3DImage To ImageEnglish
The KangLiao/Puffin-4M dataset is a English text to image resource from KangLiao at 2025.
About KangLiao/Puffin-4M
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation
📖 Project Page | 🖥️ GitHub | 🤗 Hugging Face | 📑 Paper
Dataset Details
Datasets and benchmarks that s...
Details
- Task
- Text To Image, Image To Text, Image To 3D, Image To Image
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- KangLiao
- Year
- 2025