ViT-Base/32
Google BrainImage representationOpen weights
Developed by Google Brain in 2020, ViT-Base/32 is a image representation model with 86000000.0 parameters with openly available weights.
About ViT-Base/32
While the Transformer architecture has become the de-facto standard for natural language processing tasks, its applications to computer vision remain limited. In vision, attention is either applied in conjunction with convolutional networks, or used
Details
- Provider
- Google Brain
- Task
- Image representation
- Parameters
- 86000000.0
- Released
- 2020-10-22
- Open weights
- Yes