Mewsli-9
Entity LinkingMulti-Lingual
Mewsli-9 is a entity linking-focused dataset in Multi-Lingual that provides 289,087 labeled examples distributed in TSV format.
About Mewsli-9
Dataset consists of entity mentions linked to WikiData, extracted from WikiNews articles. It covers 9 diverse languages, 5 language families and 6 writing systems. It features many WikiData entities that do not appear in English Wikipedia, thereby incentivizing research into multilingual entity linking against WikiData at-large. Langs: Japanese, German, Spanish, Arabic, Serbian, Turkish, Persian, Tamil & English.
Details
- Task
- Entity Linking
- Language
- Multi-Lingual
- Format
- TSV
- Rows / instances
- 289,087
- Creator
- Botha et al.
- Year
- 2020