Skip to content

speechcolab/gigaspeech

Automatic Speech RecognitionText To SpeechText To AudioEN

Speechcolab/gigaspeech is a automatic speech recognition-focused dataset in EN distributed in Parquet format.

About speechcolab/gigaspeech

Dataset Card for Gigaspeech Dataset Description GigaSpeech is an evolving, multi-domain English speech recognition corpus with 10,000 hours of high quality labeled audio suitable for supervised training. The transcribed audio data is...

Details

Task
Automatic Speech Recognition, Text To Speech, Text To Audio
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
speechcolab
Year
2022
Download

Related Automatic Speech Recognition, Text To Speech, Text To Audio datasets

FAQ