Skip to content

Rostlab/prot_t5_xl_uniref50

The Rostlab/prot_t5_xl_uniref50 model is a machine learning model.

About Rostlab/prot_t5_xl_uniref50

The ProtT5-XL-UniRef50 model is based on the t5-3b model and was pretrained on a large corpus of protein sequences in a self-supervised fashion . It was trained on uppercase amino acids: it only works with capital letter amino acids . The model could be used for protein feature extraction or to be fine-tuned on downstream tasks . It could also be used to extract features of a given protein sequence in PyTorch . For feature extraction, its better to use the feature extracted from the encoder not from the decoder. The model is trained on UniRef50, a dataset consisting of 45 million protein sequences. The rare,
View model source

Explore

FAQ