Skip to content

GEM/wiki_lingua

SummarizationAR, CS, DEcc-by-nc-sa-3.0

Created by GEM at 2022, the GEM/wiki_lingua is a summarization dataset in AR, CS, DE in Parquet format. With 1.4K downloads and 50 likes, it is actively used by the community. It is released under the cc-by-nc-sa-3.0 license.

About GEM/wiki_lingua

WikiLingua is a large-scale multilingual dataset for the evaluation of crosslingual abstractive summarization systems. The dataset includes ~770k article and summary pairs in 18 languages from WikiHow. The gold-standard article-summary alignments ...

Details

Task
Summarization
Language
AR, CS, DE
Format
Parquet
Rows / instances
N/A
Creator
GEM
Year
2022
License
cc-by-nc-sa-3.0
Downloads
1380
Likes
50
Download Homepage

Related Summarization datasets

FAQ