Cornell NLVR

by Alane Suhr,Stephanie Zhou,Ally Zhang,Iris Zhang,Huajun Bai,Yoav ArtziUnknown

Cornell NLVR

Cornell Natural Language Visual Reasoning (NLVR) is a language grounding dataset. It contains 92,244 pairs of natural language statements grounded in synthetic images. The task is to determine whether a sentence is true or false about an image. The data was collected through crowdsourcing, and requires reasoning about sets of objects, quantities, comparisons, and spatial relations.

Dataset Attributes

Label SVG
TasksVisual Reasoning
Label SVG
CategoriesPairs Of 2D Images