Semantic segmentation algorithm for pantograph based on multi-scale strip pooling attention mechanism and application research

IF 1.4 4区 物理与天体物理 Q4 MATERIALS SCIENCE, MULTIDISCIPLINARY AIP Advances Pub Date : 2024-09-13 DOI:10.1063/5.0230117
Renjie Shi, Liming Li, Shubin Zheng, Yizhou Mao, Xiaoxue An
{"title":"Semantic segmentation algorithm for pantograph based on multi-scale strip pooling attention mechanism and application research","authors":"Renjie Shi, Liming Li, Shubin Zheng, Yizhou Mao, Xiaoxue An","doi":"10.1063/5.0230117","DOIUrl":null,"url":null,"abstract":"Detecting pantographs remains a challenging task due to complex scenes, variable weather conditions, and noise interference. Existing pantograph detection methods struggle to effectively segment the complete shape of the pantograph from intricate backgrounds and adverse weather, and they often exhibit inadequate real-time performance. To address these challenges, we propose a novel pantograph segmentation method that leverages a deep learning multi-scale strip pooling attention mechanism. Our approach utilizes the PidNet semantic segmentation network as the baseline architecture, while we introduce a newly designed multi-scale strip pooling attention mechanism specifically for the detail extraction branch. The multi-scale strip convolution branch effectively extracts the pantograph pixel-level detail features, while the pooling branch effectively extracts the macroscopic features of the pantograph. The unique linear interpolation method effectively mitigates the influence of weather, enhancing segmentation accuracy while maintaining a lightweight structure. In the context aggregation branch, a multi-scale context aggregation module utilizing gated convolution has been developed to replace the original network’s module, which possesses strong pantograph positioning capabilities. In comparison to existing pantograph detection methods, our model demonstrates the ability to accurately segment the pantograph with a clearly defined shape, effectively filter out extraneous background noise, and exhibit high robustness to variations in illumination and weather conditions. In addition, a rich pantograph dataset was created, including various scenarios and weather conditions, which also enhanced the robustness of the model. When the IOU and accuracy are 92.91% and 96.04%, respectively, the inference speed can still exceed 30 FPS on a single 2080Ti GPU.","PeriodicalId":7619,"journal":{"name":"AIP Advances","volume":"6 1","pages":""},"PeriodicalIF":1.4000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AIP Advances","FirstCategoryId":"88","ListUrlMain":"https://doi.org/10.1063/5.0230117","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Detecting pantographs remains a challenging task due to complex scenes, variable weather conditions, and noise interference. Existing pantograph detection methods struggle to effectively segment the complete shape of the pantograph from intricate backgrounds and adverse weather, and they often exhibit inadequate real-time performance. To address these challenges, we propose a novel pantograph segmentation method that leverages a deep learning multi-scale strip pooling attention mechanism. Our approach utilizes the PidNet semantic segmentation network as the baseline architecture, while we introduce a newly designed multi-scale strip pooling attention mechanism specifically for the detail extraction branch. The multi-scale strip convolution branch effectively extracts the pantograph pixel-level detail features, while the pooling branch effectively extracts the macroscopic features of the pantograph. The unique linear interpolation method effectively mitigates the influence of weather, enhancing segmentation accuracy while maintaining a lightweight structure. In the context aggregation branch, a multi-scale context aggregation module utilizing gated convolution has been developed to replace the original network’s module, which possesses strong pantograph positioning capabilities. In comparison to existing pantograph detection methods, our model demonstrates the ability to accurately segment the pantograph with a clearly defined shape, effectively filter out extraneous background noise, and exhibit high robustness to variations in illumination and weather conditions. In addition, a rich pantograph dataset was created, including various scenarios and weather conditions, which also enhanced the robustness of the model. When the IOU and accuracy are 92.91% and 96.04%, respectively, the inference speed can still exceed 30 FPS on a single 2080Ti GPU.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于多尺度条带汇集关注机制的受电弓语义分割算法及应用研究
由于场景复杂、天气条件多变和噪声干扰,受电弓的检测仍然是一项具有挑战性的任务。现有的受电弓检测方法难以从错综复杂的背景和恶劣的天气中有效地分割出受电弓的完整形状,而且它们往往表现出不足的实时性。为了应对这些挑战,我们提出了一种利用深度学习多尺度条带汇集注意力机制的新型受电弓分割方法。我们的方法利用 PidNet 语义分割网络作为基线架构,同时专门为细节提取分支引入了新设计的多尺度条带汇集关注机制。多尺度条带卷积分支能有效提取受电弓像素级的细节特征,而池化分支则能有效提取受电弓的宏观特征。独特的线性插值方法有效减轻了天气的影响,在保持轻量级结构的同时提高了分割精度。在上下文聚合分支中,开发了一个利用门控卷积的多尺度上下文聚合模块,以取代原有网络的模块,该模块具有很强的受电弓定位能力。与现有的受电弓检测方法相比,我们的模型能够准确分割形状清晰的受电弓,有效过滤无关的背景噪声,并对光照和天气条件的变化表现出较高的鲁棒性。此外,我们还创建了一个丰富的受电弓数据集,其中包括各种场景和天气条件,这也增强了模型的鲁棒性。当 IOU 和准确率分别为 92.91% 和 96.04% 时,在单个 2080Ti GPU 上的推理速度仍可超过 30 FPS。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
AIP Advances
AIP Advances NANOSCIENCE & NANOTECHNOLOGY-MATERIALS SCIENCE, MULTIDISCIPLINARY
CiteScore
2.80
自引率
6.20%
发文量
1233
审稿时长
2-4 weeks
期刊介绍: AIP Advances is an open access journal publishing in all areas of physical sciences—applied, theoretical, and experimental. All published articles are freely available to read, download, and share. The journal prides itself on the belief that all good science is important and relevant. Our inclusive scope and publication standards make it an essential outlet for scientists in the physical sciences. AIP Advances is a community-based journal, with a fast production cycle. The quick publication process and open-access model allows us to quickly distribute new scientific concepts. Our Editors, assisted by peer review, determine whether a manuscript is technically correct and original. After publication, the readership evaluates whether a manuscript is timely, relevant, or significant.
期刊最新文献
Dimensional analysis of diffusive association rate equations. Mathematical analysis of the Wiener processes with time-delayed feedback Numerical simulation and experimental study of the dynamic characteristics of a gas turbine rotor system with beam sea and head sea excitation Design and simulation of a Ka-band frequency doubling gyroklystron amplifier Flexible and anisotropically conductive film by assembly of silicone rubber and cobalt-coated glass fiber composites
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1