Skip to content

HuggingFaceFV/finevideo

Visual Question AnsweringVideo Text To TextEN

HuggingFaceFV/finevideo is a visual question answering dataset in EN from HuggingFaceFV with 43,751 records in Parquet format. It is distributed under the cc license and falls in the 10K<n<100K size category, and has been downloaded 18.2K times.

About HuggingFaceFV/finevideo

FineVideo FineVideo Description Dataset Explorer Revisions Dataset Distribution How to download and use FineVideo Using datasets Using huggingface_hub Load a subset of the dataset Dataset StructureData Instances Data Fields Dataset...

Details

Task
Visual Question Answering, Video Text To Text
Language
EN
Format
Parquet
Rows / instances
43751
Size
10K<n<100K
Creator
HuggingFaceFV
Year
2024
License
cc
Downloads
18189
Likes
367
Download Homepage

Related Visual Question Answering, Video Text To Text datasets

FAQ