esdurmus/wiki_lingua
SummarizationAR, CS, DEcc-by-3.0
Esdurmus/wiki_lingua is a summarization dataset in AR, CS, DE from esdurmus with 273,824 records in Parquet format. It is distributed under the cc-by-3.0 license and falls in the 100K<n<1M size category, and has been downloaded 744 times.
About esdurmus/wiki_lingua
Dataset Card for "wiki_lingua"
Dataset Summary
We introduce WikiLingua, a large-scale, multilingual dataset for the evaluation of cross-lingual abstractive summarization systems. We extract article and summary pairs in 18 languages f...
Details
- Task
- Summarization
- Language
- AR, CS, DE
- Format
- Parquet
- Rows / instances
- 273824
- Size
- 100K<n<1M
- Creator
- esdurmus
- Year
- 2022
- License
- cc-by-3.0
- Downloads
- 744
- Likes
- 53