Skip to content

SEBIS/legal_t5_small_trans_sv_fr

SEBIS/legal_t5_small_trans_sv_fr is a machine learning model.

About SEBIS/legal_t5_small_trans_sv_fr

This model is based on the t5-small model and was trained on a large corpus of parallel text . It scales the baseline model of t5 down by using dmodel = 512, dff = 2,048, 8-headed attention, and only 6 layers each in the encoder and decoder . This variant has about 60 million parameters . The model could be used for translation of legal texts from Swedish to French . It has a total of approximately 220M parameters . It was trained using a single TPU Pod V3-8 for 250K steps in total, using sequence length 512 (batch size 4096) The optimizer used is AdaFactor with inverse square root learning rate schedule,
View model source

Explore

FAQ