Skip to content

ViT-Base/32

Google BrainImage representationOpen weights

Developed by Google Brain in 2020, ViT-Base/32 is a image representation model with 86000000.0 parameters with openly available weights.

About ViT-Base/32

While the Transformer architecture has become the de-facto standard for natural language processing tasks, its applications to computer vision remain limited. In vision, attention is either applied in conjunction with convolutional networks, or used

Details

Provider
Google Brain
Task
Image representation
Parameters
86000000.0
Released
2020-10-22
Open weights
Yes
View model source

Explore

FAQ