reciTAL/mlsum
SummarizationTranslationText ClassificationDE, ES, FR
The reciTAL/mlsum dataset is a DE, ES, FR summarization resource from reciTAL at 2022.
About reciTAL/mlsum
We present MLSUM, the first large-scale MultiLingual SUMmarization dataset.
Obtained from online newspapers, it contains 1.5M+ article/summary pairs in five different languages -- namely, French, German, Spanish, Russian, Turkish.
Together with En...
Details
- Task
- Summarization, Translation, Text Classification
- Language
- DE, ES, FR
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- reciTAL
- Year
- 2022