PHD2

by Ana Garcia del Molino,Michael GygliUnknown

PHD2

The dataset contains information on what video segments a specific user considers a highlight. Having this kind of data allows for strong personalization models, as specific examples of what a user is interested in help models obtain a fine-grained understanding of that specific user. The data consists of YouTube videos, from which gifs.com users manually extracted their highlights, by creating GIFs from a segment of the full video. Thus, the dataset is similar to that of [1], with two major differences. Each selection is associated with a user, which is what allows personalization. [1] used visual matching to find the position in the video from which a GIF was selected. Instead, we directly use the timestamps, which we have internally available. Thus, the ground truth is free from any alignment errors. Training set The training set contains highlights from 12'972 users. Test set The test set contains highlights from 850 users.

Dataset Attributes

Label SVG
TasksHighlight Detection
Label SVG
CategoriesYoutube, Media Interestingness
Label SVG
SensorWeb Sampling