Skip to content

Armenian Paraphrase Detection Corpus (ARPA)

Paraphrase IdentificationArmenian

Armenian Paraphrase Detection Corpus (ARPA) is a paraphrase identification-focused dataset in Armenian that provides 2,36 labeled examples distributed in TSV format.

About Armenian Paraphrase Detection Corpus (ARPA)

Dataset used for paraphrase detection in Armenian was collected from news texts consisting of articles written in the last 10 years from Hetq and Panarmenian news websites.

Details

Task
Paraphrase Identification
Language
Armenian
Format
TSV
Rows / instances
2,36
Creator
Malajyan et al.
Year
2020
Download Paper

FAQ