aliosm/ComVE-gpt2
The aliosm/ComVE-gpt2 model is a machine learning model.
About aliosm/ComVE-gpt2
The model is able to generate a reason why a given natural language statement is against commonsense . You can use the raw model for text generation to generate reasons why natural language statements are against common sense . Newer versions has some issue in text generation and the model repeats the last token generated again and again . The model achieved 14.0547/13.6534 BLEU scores on SemEval2020 Task4: Commonsense Validation and Explanation development and testing dataset. The model was trained on Nvidia Tesla P100 GPU from Google Colab platform with 5e-5 learning rate, 5 epochs, 128 maximum sequence length and 64 batch size. It is based on the Com,