bofenghuang/stt-pseudo-labeled-whisper-large-v3-multilingual
General NLPEnglishcc-by-3.0
Bofenghuang/stt-pseudo-labeled-whisper-large-v3-multilingual is a General NLP dataset in English from bofenghuang in Parquet format. It is distributed under the cc-by-3.0 license, and has been downloaded 14.2K times.
About bofenghuang/stt-pseudo-labeled-whisper-large-v3-multilingual
This collection includes over 189,000 hours of speech-to-text data in seven languages: English, French, Spanish, Portuguese, Italian, German, and Dutch
All segments were initially sorted by their IDs (timestamps). Adjacent segments from the same s...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- bofenghuang
- Year
- 2024
- License
- cc-by-3.0
- Downloads
- 14156
- Likes
- 4