Sai Ma, Haibo Ge, Wenhao He, Chaofeng Huang, Yu An, Ting Zhou
{"title":"一种基于高分辨率网络的轻量级人体姿态估计算法","authors":"Sai Ma, Haibo Ge, Wenhao He, Chaofeng Huang, Yu An, Ting Zhou","doi":"10.1109/icnlp58431.2023.00020","DOIUrl":null,"url":null,"abstract":"Human pose estimation is an important research direction in the field of computer vision. At present, the mainstream human pose estimation algorithms have high complexity, large amount of calculation, and cannot be run on resource-constrained devices such as mobile terminals, which severely limits the popularization and application of this technology. Aiming at the problem of increased network model parameters and computational complexity, based on the High-Resolution Network (HRNet), a lightweight human pose estimation network incorporating Ghost module and attention mechanism is proposed. Replaced with Ghost convolution, and added the attention mechanism Concurrent Spatial and Channel Squeeze and Channel Excitation Net module on this basis to ensure the prediction accuracy of the network. Under the same image resolution and environment configuration, the experimental results on the COCO dataset show that the improved network model reduces the number of parameters by 98.3% compared to the high-resolution network model, and reduces the computational complexity by 67.6%. The experimental results show that the improved network model can effectively reduce the amount of network parameters and reduce the computational complexity while maintaining a certain prediction accuracy.","PeriodicalId":53637,"journal":{"name":"Icon","volume":"49 1","pages":"73-77"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Lightweight Human Pose Estimation Algorithm Based on High Resolution Network\",\"authors\":\"Sai Ma, Haibo Ge, Wenhao He, Chaofeng Huang, Yu An, Ting Zhou\",\"doi\":\"10.1109/icnlp58431.2023.00020\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Human pose estimation is an important research direction in the field of computer vision. At present, the mainstream human pose estimation algorithms have high complexity, large amount of calculation, and cannot be run on resource-constrained devices such as mobile terminals, which severely limits the popularization and application of this technology. Aiming at the problem of increased network model parameters and computational complexity, based on the High-Resolution Network (HRNet), a lightweight human pose estimation network incorporating Ghost module and attention mechanism is proposed. Replaced with Ghost convolution, and added the attention mechanism Concurrent Spatial and Channel Squeeze and Channel Excitation Net module on this basis to ensure the prediction accuracy of the network. Under the same image resolution and environment configuration, the experimental results on the COCO dataset show that the improved network model reduces the number of parameters by 98.3% compared to the high-resolution network model, and reduces the computational complexity by 67.6%. The experimental results show that the improved network model can effectively reduce the amount of network parameters and reduce the computational complexity while maintaining a certain prediction accuracy.\",\"PeriodicalId\":53637,\"journal\":{\"name\":\"Icon\",\"volume\":\"49 1\",\"pages\":\"73-77\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Icon\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/icnlp58431.2023.00020\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Arts and Humanities\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Icon","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icnlp58431.2023.00020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Arts and Humanities","Score":null,"Total":0}
A Lightweight Human Pose Estimation Algorithm Based on High Resolution Network
Human pose estimation is an important research direction in the field of computer vision. At present, the mainstream human pose estimation algorithms have high complexity, large amount of calculation, and cannot be run on resource-constrained devices such as mobile terminals, which severely limits the popularization and application of this technology. Aiming at the problem of increased network model parameters and computational complexity, based on the High-Resolution Network (HRNet), a lightweight human pose estimation network incorporating Ghost module and attention mechanism is proposed. Replaced with Ghost convolution, and added the attention mechanism Concurrent Spatial and Channel Squeeze and Channel Excitation Net module on this basis to ensure the prediction accuracy of the network. Under the same image resolution and environment configuration, the experimental results on the COCO dataset show that the improved network model reduces the number of parameters by 98.3% compared to the high-resolution network model, and reduces the computational complexity by 67.6%. The experimental results show that the improved network model can effectively reduce the amount of network parameters and reduce the computational complexity while maintaining a certain prediction accuracy.