Skip to content

disco-eth/EuroSpeech

Automatic Speech RecognitionText To SpeechDE, EN, MT

Disco-eth/EuroSpeech is a automatic speech recognition-focused dataset in DE, EN, MT that provides 12,260,891 labeled examples distributed in Parquet format. It is distributed under the other license and falls in the 10M<n<100M size category, and has been downloaded 24.3K times.

About disco-eth/EuroSpeech

EuroSpeech Dataset Dataset Description EuroSpeech is a large-scale multilingual speech corpus containing high-quality aligned parliamentary speech across 22 European languages. The dataset was constructed by processing parliamentary ...

Details

Task
Automatic Speech Recognition, Text To Speech
Language
DE, EN, MT
Format
Parquet
Rows / instances
12260891
Size
10M<n<100M
Creator
disco-eth
Year
2025
License
other
Downloads
24250
Likes
94
Download Homepage

Related Automatic Speech Recognition, Text To Speech datasets

FAQ