Tian-Fei Zhang, J. Ding, Rong-Qiang Zhou, Haiyan Long
{"title":"改进的HRNET及其在人群统计中的应用","authors":"Tian-Fei Zhang, J. Ding, Rong-Qiang Zhou, Haiyan Long","doi":"10.1109/ISPDS56360.2022.9874015","DOIUrl":null,"url":null,"abstract":"Aiming at the low accuracy of crowd counting caused by scale change and occlusion in dense scenes, this paper proposes to generate the truth map into non overlapping independent areas in HRNet to facilitate the crowd location statistics of network density map; Then the 3D attention mechanism is introduced to make the network focus on the useful information of the feature map; Finally, during the training, the mean square error loss (MSE loss), L1 loss and cross entropy loss are combined into the total loss function to optimize the generalization ability of the model; The combination of the above methods improves the accuracy of the model in crowd counting and crowd location. Compared with the main methods in recent years in the public datasets NWPU, Shanghai Tech, the experimental results show that the proposed model can effectively improve the accuracy and robustness of crowd location counting.","PeriodicalId":280244,"journal":{"name":"2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Improved HRNET and its application in crowd counting\",\"authors\":\"Tian-Fei Zhang, J. Ding, Rong-Qiang Zhou, Haiyan Long\",\"doi\":\"10.1109/ISPDS56360.2022.9874015\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aiming at the low accuracy of crowd counting caused by scale change and occlusion in dense scenes, this paper proposes to generate the truth map into non overlapping independent areas in HRNet to facilitate the crowd location statistics of network density map; Then the 3D attention mechanism is introduced to make the network focus on the useful information of the feature map; Finally, during the training, the mean square error loss (MSE loss), L1 loss and cross entropy loss are combined into the total loss function to optimize the generalization ability of the model; The combination of the above methods improves the accuracy of the model in crowd counting and crowd location. Compared with the main methods in recent years in the public datasets NWPU, Shanghai Tech, the experimental results show that the proposed model can effectively improve the accuracy and robustness of crowd location counting.\",\"PeriodicalId\":280244,\"journal\":{\"name\":\"2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS)\",\"volume\":\"56 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISPDS56360.2022.9874015\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 3rd International Conference on Information Science, Parallel and Distributed Systems (ISPDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPDS56360.2022.9874015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Improved HRNET and its application in crowd counting
Aiming at the low accuracy of crowd counting caused by scale change and occlusion in dense scenes, this paper proposes to generate the truth map into non overlapping independent areas in HRNet to facilitate the crowd location statistics of network density map; Then the 3D attention mechanism is introduced to make the network focus on the useful information of the feature map; Finally, during the training, the mean square error loss (MSE loss), L1 loss and cross entropy loss are combined into the total loss function to optimize the generalization ability of the model; The combination of the above methods improves the accuracy of the model in crowd counting and crowd location. Compared with the main methods in recent years in the public datasets NWPU, Shanghai Tech, the experimental results show that the proposed model can effectively improve the accuracy and robustness of crowd location counting.