Skip to content

ByT5-XXL

GoogleGoogle ResearchLanguage modelingOpen weights

Developed by Google,Google Research in 2021, ByT5-XXL is a language modeling model with 12900000000.0 parameters with openly available weights.

About ByT5-XXL

Most widely-used pre-trained language models operate on sequences of tokens corresponding to word or subword units. By comparison, token-free models that operate directly on raw text (bytes or characters) have many benefits: they can process text in

Details

Provider
Google,Google Research
Task
Language modeling
Parameters
12900000000.0
Released
2021-05-28
Open weights
Yes
View model source

Explore

FAQ