Skip to content

esdurmus/wiki_lingua

SummarizationAR, CS, DEcc-by-3.0

Esdurmus/wiki_lingua is a summarization dataset in AR, CS, DE from esdurmus with 273,824 records in Parquet format. It is distributed under the cc-by-3.0 license and falls in the 100K<n<1M size category, and has been downloaded 744 times.

About esdurmus/wiki_lingua

Dataset Card for "wiki_lingua" Dataset Summary We introduce WikiLingua, a large-scale, multilingual dataset for the evaluation of cross-lingual abstractive summarization systems. We extract article and summary pairs in 18 languages f...

Details

Task
Summarization
Language
AR, CS, DE
Format
Parquet
Rows / instances
273824
Size
100K<n<1M
Creator
esdurmus
Year
2022
License
cc-by-3.0
Downloads
744
Likes
53
Download Homepage

Related Summarization datasets

FAQ