Oryx 34B
Tsinghua UniversityTencentNanyang Technological UniversityVisual question answeringVideo compressionImage captioningVideo descriptionLanguage modeling/generationOpen weights
Oryx 34B is visual question answering model published by Tsinghua University,Tencent,Nanyang Technological University in 2024 featuring 34000000000.0 parameters.
About Oryx 34B
Visual data comes in various forms, ranging from small icons of just a few pixels to long videos spanning hours. Existing multi-modal LLMs usually standardize these diverse visual inputs to a fixed resolution for visual encoders and yield similar num
Details
- Provider
- Tsinghua University,Tencent,Nanyang Technological University
- Task
- Visual question answering,Video compression,Image captioning,Video description,Language modeling/generation
- Parameters
- 34000000000.0
- Released
- 2024-09-19
- Open weights
- Yes