speechcolab/gigaspeech
Automatic Speech RecognitionText To SpeechText To AudioEN
Speechcolab/gigaspeech is a automatic speech recognition-focused dataset in EN distributed in Parquet format.
About speechcolab/gigaspeech
Dataset Card for Gigaspeech
Dataset Description
GigaSpeech is an evolving, multi-domain English speech recognition corpus with 10,000 hours of high quality labeled audio suitable for supervised training. The transcribed audio data is...
Details
- Task
- Automatic Speech Recognition, Text To Speech, Text To Audio
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- speechcolab
- Year
- 2022