Skip to content

Mewsli-9 

Entity LinkingMulti-Lingual

Mewsli-9 is a entity linking-focused dataset in Multi-Lingual that provides 289,087 labeled examples distributed in TSV format.

About Mewsli-9 

Dataset consists of entity mentions linked to WikiData, extracted from WikiNews articles. It covers 9 diverse languages, 5 language families and 6 writing systems. It features many WikiData entities that do not appear in English Wikipedia, thereby incentivizing research into multilingual entity linking against WikiData at-large. Langs: Japanese, German, Spanish, Arabic, Serbian, Turkish, Persian, Tamil & English.

Details

Task
Entity Linking
Language
Multi-Lingual
Format
TSV
Rows / instances
289,087
Creator
Botha et al.
Year
2020
Download Paper

Related Entity Linking datasets

FAQ