Skip to content

bofenghuang/stt-pseudo-labeled-whisper-large-v3-multilingual

General NLPEnglishcc-by-3.0

Bofenghuang/stt-pseudo-labeled-whisper-large-v3-multilingual is a General NLP dataset in English from bofenghuang in Parquet format. It is distributed under the cc-by-3.0 license, and has been downloaded 14.2K times.

About bofenghuang/stt-pseudo-labeled-whisper-large-v3-multilingual

This collection includes over 189,000 hours of speech-to-text data in seven languages: English, French, Spanish, Portuguese, Italian, German, and Dutch All segments were initially sorted by their IDs (timestamps). Adjacent segments from the same s...

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Creator
bofenghuang
Year
2024
License
cc-by-3.0
Downloads
14156
Likes
4
Download Homepage

Related General NLP datasets

FAQ