Skip to content

reciTAL/mlsum

SummarizationTranslationText ClassificationDE, ES, FR

The reciTAL/mlsum dataset is a DE, ES, FR summarization resource from reciTAL at 2022.

About reciTAL/mlsum

We present MLSUM, the first large-scale MultiLingual SUMmarization dataset. Obtained from online newspapers, it contains 1.5M+ article/summary pairs in five different languages -- namely, French, German, Spanish, Russian, Turkish. Together with En...

Details

Task
Summarization, Translation, Text Classification
Language
DE, ES, FR
Format
Parquet
Rows / instances
N/A
Creator
reciTAL
Year
2022
Download

FAQ