Skip to content

huawei-noah/TinyBERT_General_4L_312D

Huawei-noah/TinyBERT_General_4L_312D is a machine learning model.

About huawei-noah/TinyBERT_General_4L_312D

TinyBERT is 7.5x smaller and 9.4x faster on inference than BERT-base . It performs a novel transformer distillation at both the pre-training and task-specific learning stages . In general distillation, we use the original Bert-base without fine-tuning as the teacher and a large-scale text corpus as the learning data . For more details about the techniques, refer to our paper:TinyBERT: Distilling BERT for Natural Language Understanding. The paper is published by Jiao, Xiaoqi and Yin, Yichun and Shang, Lifeng and Jiang, Xin and Chen, Xiao and Li, Linlin and Wang,,
View model source

Explore

FAQ