Skip to content

aliosm/ComVE-gpt2-large

The aliosm/ComVE-gpt2-large model is a machine learning model.

About aliosm/ComVE-gpt2-large

The model is able to generate a reason why a given natural language statement is against commonsense . You can use the raw model for text generation to generate reasons why natural language statements are against common sense . The model achieved 16.5110/15.9299 BLEU scores on SemEval2020 Task4: Commonsense Validation and Explanation development and testing dataset. The model was trained on Nvidia Tesla P100 GPU from Google Colab platform with 5e-5 learning rate, 5 epochs, 128 maximum sequence length and 64 batch size. It is based on the ComVE dataset which contains 10K against Commonsense sentences, each of them is paired with three reference reasons.,
View model source

Explore

FAQ