facebook/voxpopuli
Automatic Speech RecognitionEN, DE, FRcc0-1.0
The facebook/voxpopuli dataset is a EN, DE, FR automatic speech recognition resource from facebook at 2022 comprising 1,255,237 examples. With 18.4K downloads and 153 likes, it is actively used by the community. It is released under the cc0-1.0 license and is a 1M<n<10M-scale dataset.
About facebook/voxpopuli
Dataset Card for Voxpopuli
Dataset Summary
VoxPopuli is a large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation.
The raw data is collected from 2009-2020 European Parliament e...
Details
- Task
- Automatic Speech Recognition
- Language
- EN, DE, FR
- Format
- Parquet
- Rows / instances
- 1255237
- Size
- 1M<n<10M
- Creator
- Year
- 2022
- License
- cc0-1.0
- Downloads
- 18439
- Likes
- 153