D²-City

by DiDi ChuxingResearch Only

D²-City

Background D²-City is a large-scale driving video dataset that provides more than 10,000 dashcam videos recorded in 720p HD or 1080p FHD. Around 1000 of the videos come with detection and tracking annotation in each frame of all road objects, including bounding boxes and the tracking IDs of cars, vans, buses, trucks, pedestrians, motorcycles, bicycles, open- and closed-tricycles, forklifts, and large- and small-blocks. Some of the remainder of the videos come with road objects annotated in keyframes. Compared with existing datasets, D²-City benefits from its huge amount of diversity as data is collected from several cities throughout China and features varying weather, road, and traffic conditions. D²-City pays special attention to challenges in complex and various traffic scenarios. By bring more challenging cases to the community, we hope that this dataset will encourage and help new advances in the perception area of intelligent driving. Description The D²-City dataset is a comprehensive collection of dashcam videos collected by vehicles on DiDi’s platform in 5 Chinese cities. All videos are 30-second clips in 720p or 1080p resolution at a frame rate of 25fps. The dataset will be divided into training, validation, and testing subsets. The data files and statistics will then be released in stages. For around 1000 of the videos, we have annotated the bounding boxes and tracking IDs of road objects into 12 different categories, shown in the following table. For some of the remainder of the videos, we annotate the bounding boxes in key frames.

Dataset Attributes

Label SVG
TasksDomain Adaptation, Detection
Label SVG
CategoriesAutonomous, Self-Driving, Car, Dashcam, Traffic, Road
Label SVG
SensorRGB Dashcam

Class Labels

carvanbustruckpedestrianmotorcyclebicycleopen- and closed-tricyclesforkliftsand large- or small-block.