Skip to content

cvssp/WavCaps

General NLPENcc-by-4.0

Created by cvssp at 2023, the cvssp/WavCaps is a General NLP dataset in EN in Parquet format. With 5.3K downloads and 55 likes, it is actively used by the community. It is released under the cc-by-4.0 license and is a n<1K-scale dataset.

About cvssp/WavCaps

WavCaps WavCaps is a ChatGPT-assisted weakly-labelled audio captioning dataset for audio-language multimodal research, where the audio clips are sourced from three websites (FreeSound, BBC Sound Effects, and SoundBible) and a sound event detect...

Details

Task
General NLP
Language
EN
Format
Parquet
Rows / instances
N/A
Size
n<1K
Creator
cvssp
Year
2023
License
cc-by-4.0
Downloads
5270
Likes
55
Download Homepage

Related General NLP datasets

FAQ