Skip to content

jamescalam/youtube-transcriptions

Question AnsweringText RetrievalVisual Question AnsweringEN

Jamescalam/youtube-transcriptions is a question answering-focused dataset in EN distributed in Parquet format. It is distributed under the afl-3.0 license and falls in the 100K<n<1M size category, and has been downloaded 982 times.

About jamescalam/youtube-transcriptions

The YouTube transcriptions dataset contains technical tutorials (currently from James Briggs, Daniel Bourke, and AI Coffee Break) transcribed using OpenAI's Whisper (large). Each row represents roughly a sentence-length chunk of text alongside the...

Details

Task
Question Answering, Text Retrieval, Visual Question Answering
Language
EN
Format
Parquet
Rows / instances
N/A
Size
100K<n<1M
Creator
jamescalam
Year
2022
License
afl-3.0
Downloads
982
Likes
44
Download Homepage

FAQ