Skip to content

Summarization Datasets

There are 18 summarization datasets in our directory. Each links to its source, paper, and download — browse the full list below or filter by language.

Summarization is the task of condensing a longer document into a shorter version that retains its key points. We catalog 18 datasets for it.

Updated June 2026

What languages do summarization datasets cover?

Explore other dataset tasks

Frequently asked questions