zhifeixie/Voices-in-the-Wild-2M
Automatic Speech RecognitionEN, ZH
Zhifeixie/Voices-in-the-Wild-2M is a automatic speech recognition-focused dataset in EN, ZH distributed in Parquet format.
About zhifeixie/Voices-in-the-Wild-2M
Voices in the Wild
Project Page | Paper | GitHub
Voices in the Wild (Voices-in-the-Wild-2M) is a large-scale automatic speech recognition (ASR) dataset designed for robustness training and evaluation under diverse, real-world acoustic condition...
Details
- Task
- Automatic Speech Recognition
- Language
- EN, ZH
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- zhifeixie
- Year
- 2026