基于头部姿态估计的物体自动视觉跟踪定性分析

2022 4th International Conference on Advancements in Computing (ICAC) Pub Date : 2022-12-09 DOI:10.1109/ICAC57685.2022.10025053

Ayeshka Abeysinghe, Isuri Devlini Arachchige, Pradeepa Samarasinghe, Vidushani Dhanawansa, Menan Velayuthan

{"title":"基于头部姿态估计的物体自动视觉跟踪定性分析","authors":"Ayeshka Abeysinghe, Isuri Devlini Arachchige, Pradeepa Samarasinghe, Vidushani Dhanawansa, Menan Velayuthan","doi":"10.1109/ICAC57685.2022.10025053","DOIUrl":null,"url":null,"abstract":"An automated approach for object tracking and gaze estimation via head pose estimation is crucial, to facilitate a range of applications in the domain of -human-computer interfacing, this includes the analysis of head movement with respect to a stimulus in assessing one’s level of attention. While varied approaches for gaze estimation and object tracking exist, their suitability within such applications have not been justified. In order to address this gap, this paper conducts a quantitative comparison of existing models for gaze estimation including Mediapipe and standalone models of Openface and custom head pose estimation with MTCNN face detection; and object detection including models from CSRT object tracker, YOLO object detector, and a custom object detector. The accuracy of the aforementioned models were compared against the annotations of the EYEDIAP dataset, to evaluate their accuracy both relative and non-relative to each other. The analysis revealed that the custom object detector and the Openface models are relatively more accurate than the others when comparing the number of annotations, absolute mean error, and the relationship between x displacement-yaw, and y displacement-pitch, and thereby can be used in combination for gaze tracking tasks.","PeriodicalId":292397,"journal":{"name":"2022 4th International Conference on Advancements in Computing (ICAC)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Qualitative Analysis of Automated Visual Tracking of Objects Through Head Pose Estimation\",\"authors\":\"Ayeshka Abeysinghe, Isuri Devlini Arachchige, Pradeepa Samarasinghe, Vidushani Dhanawansa, Menan Velayuthan\",\"doi\":\"10.1109/ICAC57685.2022.10025053\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An automated approach for object tracking and gaze estimation via head pose estimation is crucial, to facilitate a range of applications in the domain of -human-computer interfacing, this includes the analysis of head movement with respect to a stimulus in assessing one’s level of attention. While varied approaches for gaze estimation and object tracking exist, their suitability within such applications have not been justified. In order to address this gap, this paper conducts a quantitative comparison of existing models for gaze estimation including Mediapipe and standalone models of Openface and custom head pose estimation with MTCNN face detection; and object detection including models from CSRT object tracker, YOLO object detector, and a custom object detector. The accuracy of the aforementioned models were compared against the annotations of the EYEDIAP dataset, to evaluate their accuracy both relative and non-relative to each other. The analysis revealed that the custom object detector and the Openface models are relatively more accurate than the others when comparing the number of annotations, absolute mean error, and the relationship between x displacement-yaw, and y displacement-pitch, and thereby can be used in combination for gaze tracking tasks.\",\"PeriodicalId\":292397,\"journal\":{\"name\":\"2022 4th International Conference on Advancements in Computing (ICAC)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 4th International Conference on Advancements in Computing (ICAC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAC57685.2022.10025053\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 4th International Conference on Advancements in Computing (ICAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAC57685.2022.10025053","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

一种通过头部姿势估计来进行目标跟踪和凝视估计的自动化方法至关重要，这有助于在人机界面领域的一系列应用，这包括在评估一个人的注意力水平时分析与刺激有关的头部运动。虽然存在各种凝视估计和目标跟踪方法，但它们在此类应用中的适用性尚未得到证明。为了解决这一问题，本文对现有的凝视估计模型(包括Mediapipe和Openface的独立模型)和基于MTCNN人脸检测的自定义头姿估计模型进行了定量比较;以及对象检测，包括来自CSRT对象跟踪器、YOLO对象检测器和自定义对象检测器的模型。将上述模型的准确性与EYEDIAP数据集的注释进行比较，以评估它们之间的相对和非相对准确性。分析表明，在对比标注数量、绝对平均误差、x位移-偏航和y位移-俯仰关系等方面，自定义目标检测器和Openface模型相对更准确，可以组合用于注视跟踪任务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Qualitative Analysis of Automated Visual Tracking of Objects Through Head Pose Estimation

An automated approach for object tracking and gaze estimation via head pose estimation is crucial, to facilitate a range of applications in the domain of -human-computer interfacing, this includes the analysis of head movement with respect to a stimulus in assessing one’s level of attention. While varied approaches for gaze estimation and object tracking exist, their suitability within such applications have not been justified. In order to address this gap, this paper conducts a quantitative comparison of existing models for gaze estimation including Mediapipe and standalone models of Openface and custom head pose estimation with MTCNN face detection; and object detection including models from CSRT object tracker, YOLO object detector, and a custom object detector. The accuracy of the aforementioned models were compared against the annotations of the EYEDIAP dataset, to evaluate their accuracy both relative and non-relative to each other. The analysis revealed that the custom object detector and the Openface models are relatively more accurate than the others when comparing the number of annotations, absolute mean error, and the relationship between x displacement-yaw, and y displacement-pitch, and thereby can be used in combination for gaze tracking tasks.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 4th International Conference on Advancements in Computing (ICAC)

自引率

0.00%

发文量