Skip to content

omegalabsinc/omega-multimodal

Video Text To TextVideo ClassificationImage ClassificationImage To TextImage To VideoImage Feature ExtractionVisual Question AnsweringAudio ClassificationAudio To AudioText To AudioText To ImageText To SpeechText To VideoEnglish

Omegalabsinc/omega-multimodal is a video text to text-focused dataset in English distributed in Parquet format.

About omegalabsinc/omega-multimodal

OMEGA Labs Bittensor Subnet: Multimodal Dataset for AGI Research Introduction The OMEGA Labs Bittensor Subnet Dataset is a groundbreaking resource for accelerating Artificial General Intelligence (AGI) research and development. This...

Details

Task
Video Text To Text, Video Classification, Image Classification, Image To Text, Image To Video, Image Feature Extraction, Visual Question Answering, Audio Classification, Audio To Audio, Text To Audio, Text To Image, Text To Speech, Text To Video
Language
English
Format
Parquet
Rows / instances
N/A
Creator
omegalabsinc
Year
2024
Download

FAQ