TVR Dataset

by Jie LeiResearch Only

TVR Dataset

TV show Retrieval is a new multimodal retrieval task, in which a short video moment has to be localized from a large video (with subtitle) corpus, given a natural language query. Its associated TVR dataset is a large-scale, high-quality dataset consisting of 108,965 queries on 21,793 videos from 6 TV shows of diverse genres, where each query is associated with a tight temporal alignment.

Dataset Attributes

Label SVG
TasksVideo Retrieval
Label SVG
CategoriesPop Culture
Label SVG
SensorRGB Camera