{"title":"Non-intrusive Automatic 3D Gaze Ground-truth System","authors":"Feng Hu","doi":"10.1145/3573942.3574068","DOIUrl":null,"url":null,"abstract":"Driver distraction has surfaced as a significant safety issue worldwide, and the capacity to track a driver's attention via monitoring its gaze direction is one of the most critical features in the modern Driver Monitoring System (DMS). Deep learning based gaze estimation has grown in popularity due to its robustness across operating conditions. Though appropriate network structure design and parameters tuning are important, accurate ground-truth estimation for millions of gaze training images to build the model also plays a critical role in achieving high-quality gaze estimation results. This paper proposes a non-intrusive automatic 3D ground-truth data collection system for large-scale on-bench and in-car data collection, using gamified camera calibration, occlusion invariant mirror-based camera localization, and noise-robust 3D reconstruction algorithms. Experimental results are provided to demonstrate the system's accuracy and robustness even in challenging conditions.","PeriodicalId":103293,"journal":{"name":"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition","volume":"206 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3573942.3574068","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Driver distraction has surfaced as a significant safety issue worldwide, and the capacity to track a driver's attention via monitoring its gaze direction is one of the most critical features in the modern Driver Monitoring System (DMS). Deep learning based gaze estimation has grown in popularity due to its robustness across operating conditions. Though appropriate network structure design and parameters tuning are important, accurate ground-truth estimation for millions of gaze training images to build the model also plays a critical role in achieving high-quality gaze estimation results. This paper proposes a non-intrusive automatic 3D ground-truth data collection system for large-scale on-bench and in-car data collection, using gamified camera calibration, occlusion invariant mirror-based camera localization, and noise-robust 3D reconstruction algorithms. Experimental results are provided to demonstrate the system's accuracy and robustness even in challenging conditions.