Sarah Mroz, N. Baddour, Connor McGuirk, P. Juneau, Albert Tu, Kevin Cheung, E. Lemaire
{"title":"Comparing the Quality of Human Pose Estimation with BlazePose or OpenPose","authors":"Sarah Mroz, N. Baddour, Connor McGuirk, P. Juneau, Albert Tu, Kevin Cheung, E. Lemaire","doi":"10.1109/BioSMART54244.2021.9677850","DOIUrl":null,"url":null,"abstract":"Human pose estimation is a computer vision task that predicts the position of person's body landmarks within a given image or video. This technology could help provide virtual motion assessments by analyzing videos captured when the patient is outside a clinical setting. In this study, a newer pose estimation model that can run on a smartphone (BlazePose) was compared to a well-accepted solution (OpenPose) to determine if these models can provide clinically viable body keypoints for virtual motion assessment. Using ten videos of clinically relevant movements (recorded by physicians), keypoint coordinates were generated from each model. Using OpenPose as a baseline, Pearson correlation and root mean square error were calculated between the BlazePose and OpenPose keypoint trajectories. BlazePose had more instances where keypoints deviated from anatomical joint centres, compared to OpenPose, indicating the BlazePose was not yet viable for clinically relevant assessments. However, BlazePose runtime was much faster than OpenPose and returned metrics that could be incorporated into a smartphone solution. Future designs of a smartphone-based system for conducting virtual motion assessments should utilize OpenPose for pose estimation; however, BlazePose could be used for other design aspects such as movement pre-screening or activity classification.","PeriodicalId":286026,"journal":{"name":"2021 4th International Conference on Bio-Engineering for Smart Technologies (BioSMART)","volume":"76 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 4th International Conference on Bio-Engineering for Smart Technologies (BioSMART)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BioSMART54244.2021.9677850","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18
Abstract
Human pose estimation is a computer vision task that predicts the position of person's body landmarks within a given image or video. This technology could help provide virtual motion assessments by analyzing videos captured when the patient is outside a clinical setting. In this study, a newer pose estimation model that can run on a smartphone (BlazePose) was compared to a well-accepted solution (OpenPose) to determine if these models can provide clinically viable body keypoints for virtual motion assessment. Using ten videos of clinically relevant movements (recorded by physicians), keypoint coordinates were generated from each model. Using OpenPose as a baseline, Pearson correlation and root mean square error were calculated between the BlazePose and OpenPose keypoint trajectories. BlazePose had more instances where keypoints deviated from anatomical joint centres, compared to OpenPose, indicating the BlazePose was not yet viable for clinically relevant assessments. However, BlazePose runtime was much faster than OpenPose and returned metrics that could be incorporated into a smartphone solution. Future designs of a smartphone-based system for conducting virtual motion assessments should utilize OpenPose for pose estimation; however, BlazePose could be used for other design aspects such as movement pre-screening or activity classification.