Skip to content

facebook/voxpopuli

Automatic Speech RecognitionEN, DE, FRcc0-1.0

The facebook/voxpopuli dataset is a EN, DE, FR automatic speech recognition resource from facebook at 2022 comprising 1,255,237 examples. With 18.4K downloads and 153 likes, it is actively used by the community. It is released under the cc0-1.0 license and is a 1M<n<10M-scale dataset.

About facebook/voxpopuli

Dataset Card for Voxpopuli Dataset Summary VoxPopuli is a large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation. The raw data is collected from 2009-2020 European Parliament e...

Details

Task
Automatic Speech Recognition
Language
EN, DE, FR
Format
Parquet
Rows / instances
1255237
Size
1M<n<10M
Creator
facebook
Year
2022
License
cc0-1.0
Downloads
18439
Likes
153
Download Homepage

Related Automatic Speech Recognition datasets

FAQ