Skip to content

bionlp/bluebert_pubmed_uncased_L-24_H-1024_A-16

Bionlp/bluebert_pubmed_uncased_L-24_H-1024_A-16 is machine learning model.

About bionlp/bluebert_pubmed_uncased_L-24_H-1024_A-16

A BERT model pre-trained on PubMed abstracts was used to pre-train the BlueBERT models . The corpus contains around4000M words extracted from the PubMed ASCII code version . The model was trained using pre-processed pre-training data . The code is used to tokenize the text using the NLTK Treebank tokenizer . The tool shows the results of research conducted in the Computational Biology Branch, NCBI . The information produced on this website is not intended for direct diagnostic use or medical decision-making without review and oversight by a clinical professional . The National Institutes of Health does not independently verify the validity or utility of the information produced by this tool. NIH does,
View model source

Explore

FAQ