Skip to content

SparkAudio/voxbox

Text To SpeechZH, EN

SparkAudio/voxbox is a text to speech dataset in ZH, EN from SparkAudio in Parquet format.

About SparkAudio/voxbox

VoxBox This dataset is a curated collection of bilingual speech corpora annotated clean transcriptions and rich metadata incluing age, gender, and emotion. Dataset Structure . ├── audios/ │ └── aishell-3/ # Aud...

Details

Task
Text To Speech
Language
ZH, EN
Format
Parquet
Rows / instances
N/A
Creator
SparkAudio
Year
2025
Download

Related Text To Speech datasets

FAQ