TY - GEN
T1 - MDAD: A Multimodal and Multiview in-Vehicle Driver Action Dataset
AU - Jegham, Imen
AU - Khalifa, Anouar Ben
AU - Alouani, Ihsen
AU - Mahjoub, Mohamed Ali
PY - 2019/8/22
Y1 - 2019/8/22
N2 - “Driver’s distraction is deadly!”. Due to its crucial role in saving lives, driver action recognition is an important and trending topic in the field of computer vision. However, a very limited number of public datasets are available to validate proposed methods. This paper introduces a new public, well structured and extensive dataset, named Multiview and multimodal in-vehicle Driver Action Dataset (MDAD). MDAD consists of two temporally synchronised data modalities from side and frontal views. These modalities include RGB and depth data from different Kinect cameras. Many subjects with various body sizes, gender and ages are asked to perform 16 in-vehicle actions in several weather conditions. Each subject drives the vehicle on multiple trip routes in Sousse, Tunisia, at different times to describe a large range of head rotations, changes in lighting conditions and some occlusions. Our recorded dataset provides researchers with a testbed to develop new algorithms across multiple modalities and views under different illumination conditions. To demonstrate the utility of our dataset, we analyze driver action recognition results from each modality and every view independently, and then we combine modalities and views. This public dataset is of benefit to research activities for humans driver action analysis.
AB - “Driver’s distraction is deadly!”. Due to its crucial role in saving lives, driver action recognition is an important and trending topic in the field of computer vision. However, a very limited number of public datasets are available to validate proposed methods. This paper introduces a new public, well structured and extensive dataset, named Multiview and multimodal in-vehicle Driver Action Dataset (MDAD). MDAD consists of two temporally synchronised data modalities from side and frontal views. These modalities include RGB and depth data from different Kinect cameras. Many subjects with various body sizes, gender and ages are asked to perform 16 in-vehicle actions in several weather conditions. Each subject drives the vehicle on multiple trip routes in Sousse, Tunisia, at different times to describe a large range of head rotations, changes in lighting conditions and some occlusions. Our recorded dataset provides researchers with a testbed to develop new algorithms across multiple modalities and views under different illumination conditions. To demonstrate the utility of our dataset, we analyze driver action recognition results from each modality and every view independently, and then we combine modalities and views. This public dataset is of benefit to research activities for humans driver action analysis.
U2 - 10.1007/978-3-030-29888-3_42
DO - 10.1007/978-3-030-29888-3_42
M3 - Conference contribution
SN - 9783030298876
VL - Part 1
T3 - Lecture Notes in Computer Science
SP - 518
EP - 529
BT - International Conference on Computer Analysis of Images and Patterns: Computer Analysis of Images and Patterns: Proceedings
A2 - Vento, Mario
A2 - Percannella, Gennaro
PB - Springer
T2 - 18th International Conference, CAIP 2019
Y2 - 3 September 2019 through 5 September 2019
ER -