Skip to content

bionlp/bluebert_pubmed_mimic_uncased_L-12_H-768_A-12

The bionlp/bluebert_pubmed_mimic_uncased_L-12_H-768_A-12 model is a machine learning model.

About bionlp/bluebert_pubmed_mimic_uncased_L-12_H-768_A-12

The BlueBERT model was pre-trained on PubMed abstracts and clinical notes (MIMIC-III) The model was used to pre-train the models using pre-processed texts . The corpus contains around4000M words extracted from the PubMed ASCII code version . The data and codes were used to train the models . The model's training procedure was done using the NLTK Treebank tokenizer (tokenize) tokenizing the text . The code is used to lowercasing the text and removing speical chars \x00-\x7F . The information produced on this website is not intended for direct diagnostic use or medical decision-making without review and oversight by a,
View model source

Explore

FAQ