Renjie Shi, Liming Li, Shubin Zheng, Yizhou Mao, Xiaoxue An
{"title":"基于多尺度条带汇集关注机制的受电弓语义分割算法及应用研究","authors":"Renjie Shi, Liming Li, Shubin Zheng, Yizhou Mao, Xiaoxue An","doi":"10.1063/5.0230117","DOIUrl":null,"url":null,"abstract":"Detecting pantographs remains a challenging task due to complex scenes, variable weather conditions, and noise interference. Existing pantograph detection methods struggle to effectively segment the complete shape of the pantograph from intricate backgrounds and adverse weather, and they often exhibit inadequate real-time performance. To address these challenges, we propose a novel pantograph segmentation method that leverages a deep learning multi-scale strip pooling attention mechanism. Our approach utilizes the PidNet semantic segmentation network as the baseline architecture, while we introduce a newly designed multi-scale strip pooling attention mechanism specifically for the detail extraction branch. The multi-scale strip convolution branch effectively extracts the pantograph pixel-level detail features, while the pooling branch effectively extracts the macroscopic features of the pantograph. The unique linear interpolation method effectively mitigates the influence of weather, enhancing segmentation accuracy while maintaining a lightweight structure. In the context aggregation branch, a multi-scale context aggregation module utilizing gated convolution has been developed to replace the original network’s module, which possesses strong pantograph positioning capabilities. In comparison to existing pantograph detection methods, our model demonstrates the ability to accurately segment the pantograph with a clearly defined shape, effectively filter out extraneous background noise, and exhibit high robustness to variations in illumination and weather conditions. In addition, a rich pantograph dataset was created, including various scenarios and weather conditions, which also enhanced the robustness of the model. When the IOU and accuracy are 92.91% and 96.04%, respectively, the inference speed can still exceed 30 FPS on a single 2080Ti GPU.","PeriodicalId":7619,"journal":{"name":"AIP Advances","volume":"6 1","pages":""},"PeriodicalIF":1.4000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Semantic segmentation algorithm for pantograph based on multi-scale strip pooling attention mechanism and application research\",\"authors\":\"Renjie Shi, Liming Li, Shubin Zheng, Yizhou Mao, Xiaoxue An\",\"doi\":\"10.1063/5.0230117\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Detecting pantographs remains a challenging task due to complex scenes, variable weather conditions, and noise interference. Existing pantograph detection methods struggle to effectively segment the complete shape of the pantograph from intricate backgrounds and adverse weather, and they often exhibit inadequate real-time performance. To address these challenges, we propose a novel pantograph segmentation method that leverages a deep learning multi-scale strip pooling attention mechanism. Our approach utilizes the PidNet semantic segmentation network as the baseline architecture, while we introduce a newly designed multi-scale strip pooling attention mechanism specifically for the detail extraction branch. The multi-scale strip convolution branch effectively extracts the pantograph pixel-level detail features, while the pooling branch effectively extracts the macroscopic features of the pantograph. The unique linear interpolation method effectively mitigates the influence of weather, enhancing segmentation accuracy while maintaining a lightweight structure. In the context aggregation branch, a multi-scale context aggregation module utilizing gated convolution has been developed to replace the original network’s module, which possesses strong pantograph positioning capabilities. In comparison to existing pantograph detection methods, our model demonstrates the ability to accurately segment the pantograph with a clearly defined shape, effectively filter out extraneous background noise, and exhibit high robustness to variations in illumination and weather conditions. In addition, a rich pantograph dataset was created, including various scenarios and weather conditions, which also enhanced the robustness of the model. When the IOU and accuracy are 92.91% and 96.04%, respectively, the inference speed can still exceed 30 FPS on a single 2080Ti GPU.\",\"PeriodicalId\":7619,\"journal\":{\"name\":\"AIP Advances\",\"volume\":\"6 1\",\"pages\":\"\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2024-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"AIP Advances\",\"FirstCategoryId\":\"88\",\"ListUrlMain\":\"https://doi.org/10.1063/5.0230117\",\"RegionNum\":4,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"MATERIALS SCIENCE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"AIP Advances","FirstCategoryId":"88","ListUrlMain":"https://doi.org/10.1063/5.0230117","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}
Semantic segmentation algorithm for pantograph based on multi-scale strip pooling attention mechanism and application research
Detecting pantographs remains a challenging task due to complex scenes, variable weather conditions, and noise interference. Existing pantograph detection methods struggle to effectively segment the complete shape of the pantograph from intricate backgrounds and adverse weather, and they often exhibit inadequate real-time performance. To address these challenges, we propose a novel pantograph segmentation method that leverages a deep learning multi-scale strip pooling attention mechanism. Our approach utilizes the PidNet semantic segmentation network as the baseline architecture, while we introduce a newly designed multi-scale strip pooling attention mechanism specifically for the detail extraction branch. The multi-scale strip convolution branch effectively extracts the pantograph pixel-level detail features, while the pooling branch effectively extracts the macroscopic features of the pantograph. The unique linear interpolation method effectively mitigates the influence of weather, enhancing segmentation accuracy while maintaining a lightweight structure. In the context aggregation branch, a multi-scale context aggregation module utilizing gated convolution has been developed to replace the original network’s module, which possesses strong pantograph positioning capabilities. In comparison to existing pantograph detection methods, our model demonstrates the ability to accurately segment the pantograph with a clearly defined shape, effectively filter out extraneous background noise, and exhibit high robustness to variations in illumination and weather conditions. In addition, a rich pantograph dataset was created, including various scenarios and weather conditions, which also enhanced the robustness of the model. When the IOU and accuracy are 92.91% and 96.04%, respectively, the inference speed can still exceed 30 FPS on a single 2080Ti GPU.
期刊介绍:
AIP Advances is an open access journal publishing in all areas of physical sciences—applied, theoretical, and experimental. All published articles are freely available to read, download, and share. The journal prides itself on the belief that all good science is important and relevant. Our inclusive scope and publication standards make it an essential outlet for scientists in the physical sciences.
AIP Advances is a community-based journal, with a fast production cycle. The quick publication process and open-access model allows us to quickly distribute new scientific concepts. Our Editors, assisted by peer review, determine whether a manuscript is technically correct and original. After publication, the readership evaluates whether a manuscript is timely, relevant, or significant.