Biwi 3D Audiovisual Corpus of Affective Communication

by No License

Biwi 3D Audiovisual Corpus of Affective Communication

The corpus comprises a total of 1109 sentences uttered by 14 native English speakers (6 males and 8 females). A real time 3D scanner and a professional microphone were used to capture the facial movements and the speech of the speakers. The dense dynamic face scans were acquired at 25 frames per second and the RMS error in the 3D reconstruction is about 0.5 mm. In order to ease automatic speech segmentation, we carried out the recordings in a anechoic room, with walls covered by sound wave-absorbing materials, as shown in the picture.

Dataset Attributes

Label SVG
CategoriesDataset