YFCC100M

by Bart Thomee,David A. Shamma,Gerald Friedland,Benjamin Elizalde,Karl Ni,Douglas Poland,Damian Borth,Li-Jia LiUnknown

YFCC100M

The YFCC100M is the largest publicly and freely usable multimedia collection, containing around 99.2 million photos and 0.8 million videos from Flickr, all of which were shared under one of the various Creative Commons licenses. This dataset, however, only includes the metadata of the photos and videos (e.g. the photographers that captured them, the cameras that were used, the locations where they were taken if available, etc.) and does not include their actual content (i.e. the image and video files). To make it easier for everyone, we therefore downloaded all of the photos and videos in the dataset from Flickr, processed their content to generate additional data (e.g. visual features, ground truth annotations) that researchers often find useful, and released utilities and tools to assist with using and visualizing the dataset. We have made all of this material available as part of the Multimedia Commons initiative.

Dataset Attributes

Label SVG
TasksVisual Reasoning
Label SVG
CategoriesMultimedia, Flickr, Photos
Label SVG
SensorRGB Camera