Skip to content

davidchan/anim400k

Text To SpeechAutomatic Speech RecognitionAudio To AudioAudio ClassificationText ClassificationVideo ClassificationSummarizationEN, JA

The davidchan/anim400k dataset is a EN, JA text to speech resource from davidchan at 2024. With 231 downloads and 41 likes, it is actively used by the community and is a 100K<n<1M-scale dataset.

About davidchan/anim400k

Anim-400K: A dataset designed from the ground up for automated dubbing of video What is Anim-400K? Anim-400K is a large-scale dataset of aligned audio-video clips in both the English and Japanese languages. It is comprised of over 4...

Details

Task
Text To Speech, Automatic Speech Recognition, Audio To Audio, Audio Classification, Text Classification, Video Classification, Summarization
Language
EN, JA
Format
Parquet
Rows / instances
N/A
Size
100K<n<1M
Creator
davidchan
Year
2024
Downloads
231
Likes
41
Download Homepage

FAQ