Skip to content

legacy-datasets/common_voice

Automatic Speech RecognitionAB, AR, AS

Legacy-datasets/common_voice is a automatic speech recognition dataset in AB, AR, AS from legacy-datasets in Parquet format.

About legacy-datasets/common_voice

Common Voice is Mozilla's initiative to help teach machines how real people speak. The dataset currently consists of 7,335 validated hours of speech in 60 languages, but we’re always adding more voices and languages.

Details

Task
Automatic Speech Recognition
Language
AB, AR, AS
Format
Parquet
Rows / instances
N/A
Creator
legacy-datasets
Year
2022
Download

Related Automatic Speech Recognition datasets

FAQ