lanwuwei/GigaBERT-v3-Arabic-and-English
Lanwuwei/GigaBERT-v3-Arabic-and-English is machine learning model.
About lanwuwei/GigaBERT-v3-Arabic-and-English
GigaBERT-v3 is a customized bilingual BERT for English and Arabic . It was pre-trained in a large-scale corpus (Gigaword+Oscar+Wikipedia) with 10B tokens . It shows state-of-the-art zero-shot transfer performance from English to Arabic on information extraction (IE) tasks . More details can be found in the following paper: The 2020 Conference on Empirical Methods on Natural Language Processing (EMNLP) The paper is published in the Proceedings of the 2020 Conference of the EMNLP Conference on the ENCNLP. For more information on the paper, visit www.inproceedings.com/,