Question 1

What is the nyu-mll/roberta-med-small-1M-3 model?

Accepted Answer

We release 3 models with lowest perplexities for each pretraining data size out of 25 runs (or 10 in the case of 1B tokens) The data reproduces that of BERT: We combine English Wikipedia and a reproduction of BookCorpus using texts from smashwords in a ratio of approximately 3:1 .…

Question 2

Who created nyu-mll/roberta-med-small-1M-3?

Accepted Answer

Publisher information for nyu-mll/roberta-med-small-1M-3 is not recorded in our dataset.

nyu-mll/roberta-med-small-1M-3

About nyu-mll/roberta-med-small-1M-3

Explore

FAQ