disco-eth/EuroSpeech
Automatic Speech RecognitionText To SpeechDE, EN, MT
Disco-eth/EuroSpeech is a automatic speech recognition-focused dataset in DE, EN, MT that provides 12,260,891 labeled examples distributed in Parquet format. It is distributed under the other license and falls in the 10M<n<100M size category, and has been downloaded 24.3K times.
About disco-eth/EuroSpeech
EuroSpeech Dataset
Dataset Description
EuroSpeech is a large-scale multilingual speech corpus containing high-quality aligned parliamentary speech across 22 European languages. The dataset was constructed by processing parliamentary ...
Details
- Task
- Automatic Speech Recognition, Text To Speech
- Language
- DE, EN, MT
- Format
- Parquet
- Rows / instances
- 12260891
- Size
- 10M<n<100M
- Creator
- disco-eth
- Year
- 2025
- License
- other
- Downloads
- 24250
- Likes
- 94