ViT-Huge/14
Google BrainGoogle ResearchImage representationOpen weights
The ViT-Huge/14 model is an open-weights image representation model from Google Brain,Google Research with 632000000.0 parameters.
About ViT-Huge/14
While the Transformer architecture has become the de-facto standard for natural language processing tasks, its applications to computer vision remain limited. In vision, attention is either applied in conjunction with convolutional networks, or used
Details
- Provider
- Google Brain,Google Research
- Task
- Image representation
- Parameters
- 632000000.0
- Released
- 2020-10-22
- Open weights
- Yes