Skip to content

nvidia/AudioSkills

Audio Text To TextEN

Nvidia/AudioSkills is a audio text to text dataset in EN from nvidia in Parquet format.

About nvidia/AudioSkills

AudioSkills-XL Dataset Project page | Paper | Code Dataset Description AudioSkills-XL is a large-scale audio question-answering (AQA) dataset designed to develop (large) audio-language models on expert-level reasoning and problem-sol...

Details

Task
Audio Text To Text
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
nvidia
Year
2025
Download

Related Audio Text To Text datasets

FAQ