Skip to content

mteb/sts22-crosslingual-sts

Sentence SimilarityARA, CMN, DEU

Created by mteb at 2022, the mteb/sts22-crosslingual-sts is a sentence similarity dataset in ARA, CMN, DEU in Parquet format.

About mteb/sts22-crosslingual-sts

STS22.v2 An MTEB dataset Massive Text Embedding Benchmark SemEval 2022 Task 8: Multilingual News Article Similarity. Version 2 filters updated on STS22 by removing pairs where one of entries contain empty sentences. Task categor...

Details

Task
Sentence Similarity
Language
ARA, CMN, DEU
Format
Parquet
Rows / instances
N/A
Creator
mteb
Year
2022
Download

Related Sentence Similarity datasets

FAQ