Question 1

What is the ceshine/TinyBERT_L-4_H-312_v2-distill-AllNLI model?

Accepted Answer

This is distilled from the bert-base-nli-stsb-mean-tokens pre-trained model from Sentence-Transformers . The embedding vector is obtained by mean/average pooling of the last layer's hidden states .…

Question 2

Who created ceshine/TinyBERT_L-4_H-312_v2-distill-AllNLI?

Accepted Answer

Publisher information for ceshine/TinyBERT_L-4_H-312_v2-distill-AllNLI is not recorded in our dataset.

ceshine/TinyBERT_L-4_H-312_v2-distill-AllNLI

About ceshine/TinyBERT_L-4_H-312_v2-distill-AllNLI

Explore

FAQ