davidchan/anim400k
Text To SpeechAutomatic Speech RecognitionAudio To AudioAudio ClassificationText ClassificationVideo ClassificationSummarizationEN, JA
The davidchan/anim400k dataset is a EN, JA text to speech resource from davidchan at 2024. With 231 downloads and 41 likes, it is actively used by the community and is a 100K<n<1M-scale dataset.
About davidchan/anim400k
Anim-400K: A dataset designed from the ground up for automated dubbing of video
What is Anim-400K?
Anim-400K is a large-scale dataset of aligned audio-video clips in both the English and Japanese languages. It is comprised of over 4...
Details
- Task
- Text To Speech, Automatic Speech Recognition, Audio To Audio, Audio Classification, Text Classification, Video Classification, Summarization
- Language
- EN, JA
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 100K<n<1M
- Creator
- davidchan
- Year
- 2024
- Downloads
- 231
- Likes
- 41