PHD2
by Ana Garcia del Molino,Michael GygliUnknown
PHD2
by Ana Garcia del Molino,Michael GygliLicense : Unknown
The dataset contains information on what video segments a specific user considers a highlight. Having this kind of data allows for strong personalization models, as specific examples of what a user is interested in help models obtain a fine-grained understanding of that specific user. The data consists of YouTube videos, from which gifs.com users manually extracted their highlights, by creating GIFs from a segment of the full video. Thus, the dataset is similar to that of [1], with two major differences. Each selection is associated with a user, which is what allows personalization. [1] used visual matching to find the position in the video from which a GIF was selected. Instead, we directly use the timestamps, which we have internally available. Thus, the ground truth is free from any alignment errors. Training set The training set contains highlights from 12'972 users. Test set The test set contains highlights from 850 users.
Dataset Attributes
TasksHighlight Detection
CategoriesYoutube, Media Interestingness
SensorWeb Sampling