nvidia/AudioSkills
Audio Text To TextEN
Nvidia/AudioSkills is a audio text to text dataset in EN from nvidia in Parquet format.
About nvidia/AudioSkills
AudioSkills-XL Dataset
Project page | Paper | Code
Dataset Description
AudioSkills-XL is a large-scale audio question-answering (AQA) dataset designed to develop (large) audio-language models on expert-level reasoning and problem-sol...
Details
- Task
- Audio Text To Text
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- nvidia
- Year
- 2025