Speech Recognition Datasets
There are 5 speech recognition datasets in our directory. Each links to its source, paper, and download — browse the full list below or filter by language.
Speech Recognition is the task of transcribing spoken audio into written text. We catalog 5 datasets for it.
Updated June 2026
- Common VoiceSpeech RecognitionMulti-Lingual
- Microsoft Information-Seeking Conversation (MISC) datasetSpeech Recognition, Dialogue, VisualEnglish
- Microsoft Speech Language Translation Corpus (MSLT)Speech Recognition, Machine TranslationMulti-Lingual
- Voices Obscured in Complex Environmental Settings (VOiCES)Speech RecognitionEnglish
- VoxCelebSpeech Recognition, VisualMulti-Lingual