speechcolab/gigaspeech2
Automatic Speech RecognitionTH, ID, VI
Speechcolab/gigaspeech2 is a automatic speech recognition dataset in TH, ID, VI from speechcolab in Parquet format.
About speechcolab/gigaspeech2
Dataset Card for GigaSpeech 2
Dataset Description
GigaSpeech 2 is an evolving, large-scale, multi-domain, and multilingual ASR corpus focusing on low-resource languages. GigaSpeech 2 raw comprises about 30,000 hours of automatically ...
Details
- Task
- Automatic Speech Recognition
- Language
- TH, ID, VI
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- speechcolab
- Year
- 2024