Skip to content

PleIAs/YouTube-Commons

Text GenerationEN, FR, ES

Created by PleIAs at 2024, the PleIAs/YouTube-Commons is a text generation dataset in EN, FR, ES in Parquet format.

About PleIAs/YouTube-Commons

📺 YouTube-Commons 📺 YouTube-Commons is a collection of audio transcripts of 2,063,066 videos shared on YouTube under a CC-By license. Content The collection comprises 22,709,724 original and automatically translated transcripts from ...

Details

Task
Text Generation
Language
EN, FR, ES
Format
Parquet
Rows / instances
N/A
Creator
PleIAs
Year
2024
Download

Related Text Generation datasets

FAQ