The Massively Multilingual Image Dataset

by Research Only

The Massively Multilingual Image Dataset

MMID is a large-scale, massively multilingual dataset of images paired with the words they represent collected at the University of Pennsylvania. The dataset is doubly parallel: for each language, words are stored parallel to images that represent the word, and parallel to the word’s translation into English (and corresponding images.) By far the largest dataset of its kind, it has 100 languages (including English) and up to 10,000 words per language! (and many more for English.)

Dataset Attributes