1111xxx/zoengjyutgaai
Automatic Speech RecognitionText To SpeechText GenerationFeature ExtractionAudio To AudioAudio ClassificationText To AudioYUEcc0-1.0
1111xxx/zoengjyutgaai is a automatic speech recognition dataset in YUE from 1111xxx with 116,587 records in Parquet format. It is distributed under the cc0-1.0 license and falls in the 10K<n<100K size category, and has been downloaded 59.5K times.
About 1111xxx/zoengjyutgaai
張悦楷講古語音數據集
English
呢個係張悦楷講《三國演義》、《水滸傳》、《走進毛澤東的最後歲月》、《鹿鼎記》語音數據集。張悦楷係廣州最出名嘅講古佬 / 粵語説書藝人。佢從上世紀七十年代開始就喺廣東各個收音電台度講古,佢把聲係好多廣州人嘅共同回憶。本數據集收集嘅係佢最知名嘅四部作品。
數據集用途:
TTS(語音合成)訓練集
ASR(語音識別)訓練集或測試集
各種語言學、文學研究
直接聽嚟欣賞藝術!
TTS 效果演示:https://huggingface.co/spaces/...
Details
- Task
- Automatic Speech Recognition, Text To Speech, Text Generation, Feature Extraction, Audio To Audio, Audio Classification, Text To Audio
- Language
- YUE
- Format
- Parquet
- Rows / instances
- 116587
- Size
- 10K<n<100K
- Creator
- 1111xxx
- Year
- 2026
- License
- cc0-1.0
- Downloads
- 59476
- Likes
- 0