Skip to content

speechcolab/gigaspeech2

Automatic Speech RecognitionTH, ID, VI

Speechcolab/gigaspeech2 is a automatic speech recognition dataset in TH, ID, VI from speechcolab in Parquet format.

About speechcolab/gigaspeech2

Dataset Card for GigaSpeech 2 Dataset Description GigaSpeech 2 is an evolving, large-scale, multi-domain, and multilingual ASR corpus focusing on low-resource languages. GigaSpeech 2 raw comprises about 30,000 hours of automatically ...

Details

Task
Automatic Speech Recognition
Language
TH, ID, VI
Format
Parquet
Rows / instances
N/A
Creator
speechcolab
Year
2024
Download

Related Automatic Speech Recognition datasets

FAQ