CAD 120 affordance

by No License

CAD 120 affordance

This is the CAD 120 Affordance Segmentation Dataset based on the Cornell Activity Dataset CAD 120 (see http://pr.cs.cornell.edu/humanactivities/data.php).Contentframes/*.png:RGB frames selected from Cornell Activity Dataset. To find out the location of the framein the original videos, see video_info.txt.object_crop_images/*.pngimage crops taken from the selected frames and resized to 321*321. Each crop is a paddedbounding box of an object the human interacts with in the video. Due to the padding,the crops may contain background and other objects.In each selected frame, each bounding box was processed. The bounding boxes are alreadygiven in the Cornell Activity Dataset.The 5-digit number gives the frame number, the second number gives the bounding box numberwithin the frame.segmentation_mat/*.mat321*321*6 segmentation masks for the image crops. Each channel corresponds to anaffordance (openabe, cuttable, pourable, containable, supportable, holdable, in this order).All pixels belonging to a particular affordance are labeled 1 in the respective channel,otherwise 0.segmentation_png/*.png321*321 png images, each containing the binary mask for one of the affordances.lists/*.txtLists containing the train and test sets for two splits. The actor split ensures thattrain and test images stem from different videos with different actors while the object split ensuresthat train and test data have no (central) object classes in common.The train sets are additionally subdivided into 3 subsets A,B and C. For the actor split,the subsets stem from different videos. For the object split, each subset containsevery third crop of the train set.crop_coordinate_info.txtMaps image crops to their coordinates in the frames.hpose_info.txtMaps frames to 2d human pose coordinates. Hand annotated by us.object_info.txtMaps image crops to the (central) object it contains.visible_affordance_info.txtMaps image crops to affordances visible in this cropThe crops contain the following object classes:1.table2.kettle3.plate4.bottle5.thermal cup6.knife7.medicine box8.can9.microwave10.paper box11.bowl12.mugAffordances in our set:1.openable2.cuttable3.pourable4.containable5.supportable6.holdableNote that our object affordance labeling differs from the Cornell Activity Dataset:E.g. the cap of a pizza box is considered to be supportable.Johann Sawatzky, Abhilash Srikantha, Juergen Gall.Weakly Supervised Affordance Detection.IEEE Conference on Computer Vision and Pattern Recognition (CVPR17)H. S. Koppula and A. Saxena.Physically grounded spatio-temporal object affordances.European Conference on Computer Vision (ECCV14)

Dataset Attributes

Label SVG
CategoriesAffordance, Action, Cad, Attribute, Human