jamescalam/youtube-transcriptions
Question AnsweringText RetrievalVisual Question AnsweringEN
Jamescalam/youtube-transcriptions is a question answering-focused dataset in EN distributed in Parquet format. It is distributed under the afl-3.0 license and falls in the 100K<n<1M size category, and has been downloaded 982 times.
About jamescalam/youtube-transcriptions
The YouTube transcriptions dataset contains technical tutorials (currently from James Briggs, Daniel Bourke, and AI Coffee Break) transcribed using OpenAI's Whisper (large). Each row represents roughly a sentence-length chunk of text alongside the...
Details
- Task
- Question Answering, Text Retrieval, Visual Question Answering
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 100K<n<1M
- Creator
- jamescalam
- Year
- 2022
- License
- afl-3.0
- Downloads
- 982
- Likes
- 44