{"title":"用于智能城市人群计数和定位的规模聚合网络","authors":"Wenzhe Zhai, Mingliang Gao, Xiangyu Guo, Guofeng Zou, Qilei Li, Gwanggil Jeon","doi":"10.1145/3653454","DOIUrl":null,"url":null,"abstract":"<p>Recent years have witnessed a remarkable proliferation of applications in smart cities. Crowd analysis is a crucial subject, and it incorporates two subtasks in smart city systems, <i>i.e.</i>, crowd counting and crowd localization. Nevertheless, the presence of adverse intrinsic factors, <i>i.e.</i>, scale variation and background noise severely degrades the performance of counting and localization. Although great efforts have been made on separate research on counting and localization, few works are capable of performing both tasks at the same time. To this aim, the scale attentive aggregation network (SA<sup>2</sup>Net) is proposed to solve the problems of scale variation and background noise in crowd counting and localization tasks synchronously. Specifically, the SA<sup>2</sup>Net has two vital modules, namely multiscale feature aggregator (MFA) module and background noise suppressor (BNS) module. The MFA module is designed in a four-pathway structure, and it aggregates the multiscale feature so as to facilitate the correlation between different scales. The BNS module utilizes the contextual information between the input keys matrix and self-attention matrix to suppress the background noise. Furthermore, a global consistency loss combined with the Euclidean loss is utilized to optimize the network in counting and localization tasks. Extensive experimental results prove that the SA<sup>2</sup>Net outperforms the state-of-the-art competitors both subjectively and objectively.</p>","PeriodicalId":50910,"journal":{"name":"ACM Transactions on Sensor Networks","volume":"19 1","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Scale Attentive Aggregation Network for Crowd Counting and Localization in Smart City\",\"authors\":\"Wenzhe Zhai, Mingliang Gao, Xiangyu Guo, Guofeng Zou, Qilei Li, Gwanggil Jeon\",\"doi\":\"10.1145/3653454\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Recent years have witnessed a remarkable proliferation of applications in smart cities. Crowd analysis is a crucial subject, and it incorporates two subtasks in smart city systems, <i>i.e.</i>, crowd counting and crowd localization. Nevertheless, the presence of adverse intrinsic factors, <i>i.e.</i>, scale variation and background noise severely degrades the performance of counting and localization. Although great efforts have been made on separate research on counting and localization, few works are capable of performing both tasks at the same time. To this aim, the scale attentive aggregation network (SA<sup>2</sup>Net) is proposed to solve the problems of scale variation and background noise in crowd counting and localization tasks synchronously. Specifically, the SA<sup>2</sup>Net has two vital modules, namely multiscale feature aggregator (MFA) module and background noise suppressor (BNS) module. The MFA module is designed in a four-pathway structure, and it aggregates the multiscale feature so as to facilitate the correlation between different scales. The BNS module utilizes the contextual information between the input keys matrix and self-attention matrix to suppress the background noise. Furthermore, a global consistency loss combined with the Euclidean loss is utilized to optimize the network in counting and localization tasks. Extensive experimental results prove that the SA<sup>2</sup>Net outperforms the state-of-the-art competitors both subjectively and objectively.</p>\",\"PeriodicalId\":50910,\"journal\":{\"name\":\"ACM Transactions on Sensor Networks\",\"volume\":\"19 1\",\"pages\":\"\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2024-03-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM Transactions on Sensor Networks\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1145/3653454\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Sensor Networks","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3653454","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Scale Attentive Aggregation Network for Crowd Counting and Localization in Smart City
Recent years have witnessed a remarkable proliferation of applications in smart cities. Crowd analysis is a crucial subject, and it incorporates two subtasks in smart city systems, i.e., crowd counting and crowd localization. Nevertheless, the presence of adverse intrinsic factors, i.e., scale variation and background noise severely degrades the performance of counting and localization. Although great efforts have been made on separate research on counting and localization, few works are capable of performing both tasks at the same time. To this aim, the scale attentive aggregation network (SA2Net) is proposed to solve the problems of scale variation and background noise in crowd counting and localization tasks synchronously. Specifically, the SA2Net has two vital modules, namely multiscale feature aggregator (MFA) module and background noise suppressor (BNS) module. The MFA module is designed in a four-pathway structure, and it aggregates the multiscale feature so as to facilitate the correlation between different scales. The BNS module utilizes the contextual information between the input keys matrix and self-attention matrix to suppress the background noise. Furthermore, a global consistency loss combined with the Euclidean loss is utilized to optimize the network in counting and localization tasks. Extensive experimental results prove that the SA2Net outperforms the state-of-the-art competitors both subjectively and objectively.
期刊介绍:
ACM Transactions on Sensor Networks (TOSN) is a central publication by the ACM in the interdisciplinary area of sensor networks spanning a broad discipline from signal processing, networking and protocols, embedded systems, information management, to distributed algorithms. It covers research contributions that introduce new concepts, techniques, analyses, or architectures, as well as applied contributions that report on development of new tools and systems or experiences and experiments with high-impact, innovative applications. The Transactions places special attention on contributions to systemic approaches to sensor networks as well as fundamental contributions.