Question 1

What is the microsoft/MiniLM-L12-H384-uncased model?

Accepted Answer

MiniLM is a distilled model from the paper "MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers" Please note: This checkpoint can be an inplace substitution for BERT and it needs to be fine-tuned before use! We present the dev results on SQuAD 2.0 and several GLUE benchmar…

Question 2

Who created microsoft/MiniLM-L12-H384-uncased?

Accepted Answer

Publisher information for microsoft/MiniLM-L12-H384-uncased is not recorded in our dataset.

microsoft/MiniLM-L12-H384-uncased

About microsoft/MiniLM-L12-H384-uncased

Explore

FAQ