PleIAs/YouTube-Commons
Text GenerationEN, FR, ES
Created by PleIAs at 2024, the PleIAs/YouTube-Commons is a text generation dataset in EN, FR, ES in Parquet format.
About PleIAs/YouTube-Commons
📺 YouTube-Commons 📺
YouTube-Commons is a collection of audio transcripts of 2,063,066 videos shared on YouTube under a CC-By license.
Content
The collection comprises 22,709,724 original and automatically translated transcripts from ...
Details
- Task
- Text Generation
- Language
- EN, FR, ES
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- PleIAs
- Year
- 2024