Skip to content

Paraphrase and Semantic Similarity in Twitter (PIT)

ClassificationEnglish

Paraphrase and Semantic Similarity in Twitter (PIT) is a classification-focused dataset in English that provides 18,762 labeled examples distributed in Text format.

About Paraphrase and Semantic Similarity in Twitter (PIT)

Dataset focuses on whether tweets have (almost) same meaning/information or not.

Details

Task
Classification
Language
English
Format
Text
Rows / instances
18,762
Creator
Xu et al.
Year
2015
Download Paper

Related Classification datasets

FAQ