Question 1

What is the speechcolab/gigaspeech dataset?

Accepted Answer

Dataset Card for Gigaspeech

Dataset Description

GigaSpeech is an evolving, multi-domain English speech recognition corpus with 10,000 hours of high quality labeled audio suitable for supervised training. The transcribed audio data is...

Question 2

Is speechcolab/gigaspeech a benchmark?

Accepted Answer

speechcolab/gigaspeech is a dataset for training or evaluation; it isn't tracked as a standard LLM benchmark in our catalog.

Question 3

Where can I download speechcolab/gigaspeech?

Accepted Answer

speechcolab/gigaspeech is available at its source: https://huggingface.co/datasets/speechcolab/gigaspeech.

speechcolab/gigaspeech

About speechcolab/gigaspeech

Details

Related Automatic Speech Recognition, Text To Speech, Text To Audio datasets

FAQ