Skip to content

ViT-Huge/14

Google BrainGoogle ResearchImage representationOpen weights

The ViT-Huge/14 model is an open-weights image representation model from Google Brain,Google Research with 632000000.0 parameters.

About ViT-Huge/14

While the Transformer architecture has become the de-facto standard for natural language processing tasks, its applications to computer vision remain limited. In vision, attention is either applied in conjunction with convolutional networks, or used

Details

Provider
Google Brain,Google Research
Task
Image representation
Parameters
632000000.0
Released
2020-10-22
Open weights
Yes
View model source

Explore

FAQ