Pedestrian Intrusion Detection in Railway Station Based on Mirror Translation Attention and Feature Pooling Enhancement

IF 3.2 2区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Signal Processing Letters Pub Date : 2024-09-30 DOI:10.1109/LSP.2024.3471180

Zhufeng Jiang;Hui Wang;Guoliang Luo;Zizhu Fan;Lu Xu

{"title":"Pedestrian Intrusion Detection in Railway Station Based on Mirror Translation Attention and Feature Pooling Enhancement","authors":"Zhufeng Jiang;Hui Wang;Guoliang Luo;Zizhu Fan;Lu Xu","doi":"10.1109/LSP.2024.3471180","DOIUrl":null,"url":null,"abstract":"Pedestrian intrusion detection is crucial to ensuring safe railway operation. Current pedestrian detection algorithms lack consideration for real-world railway scenarios, such as the reflective properties of screen doors and train windows, may mistakenly trigger pedestrian intrusion alerts. Scale variability and pedestrian overlap often lead to detection inaccuracy, making them inadequate for addressing the specific requirements of railway perimeter security. This letter introduces an innovative pedestrian detection algorithm that incorporates Mirror Translation Attention (MTA) and Feature Pooling Enhancement (FPE). MTA, including mirror flipping and offsetting the feature mapping, could significantly mitigate missed detection caused by reflective surfaces. Additionally, we introduce sparsity to the inputs of the self-attention, which significantly enhancing the model's inference speed. A multi-scale approach is adopted to accommodate the diversity in pedestrian sizes, while the FPE addresses occlusion issues across various scales. Compared to the advanced YOLOv8 model, the proposed method improves AP50 by 1.6% to 92.11% and reduces model parameters by 63.55% in our self-built railway pedestrian intrusion dataset.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.2000,"publicationDate":"2024-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Signal Processing Letters","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10700649/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

Pedestrian intrusion detection is crucial to ensuring safe railway operation. Current pedestrian detection algorithms lack consideration for real-world railway scenarios, such as the reflective properties of screen doors and train windows, may mistakenly trigger pedestrian intrusion alerts. Scale variability and pedestrian overlap often lead to detection inaccuracy, making them inadequate for addressing the specific requirements of railway perimeter security. This letter introduces an innovative pedestrian detection algorithm that incorporates Mirror Translation Attention (MTA) and Feature Pooling Enhancement (FPE). MTA, including mirror flipping and offsetting the feature mapping, could significantly mitigate missed detection caused by reflective surfaces. Additionally, we introduce sparsity to the inputs of the self-attention, which significantly enhancing the model's inference speed. A multi-scale approach is adopted to accommodate the diversity in pedestrian sizes, while the FPE addresses occlusion issues across various scales. Compared to the advanced YOLOv8 model, the proposed method improves AP50 by 1.6% to 92.11% and reduces model parameters by 63.55% in our self-built railway pedestrian intrusion dataset.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于镜像平移注意和特征集合增强的火车站行人入侵检测

行人入侵检测对于确保铁路安全运行至关重要。目前的行人检测算法缺乏对现实世界铁路场景的考虑，例如屏蔽门和列车窗户的反射特性，可能会错误地触发行人入侵警报。尺度变化和行人重叠经常导致检测不准确，使其无法满足铁路周边安全的特殊要求。这封信介绍了一种创新的行人检测算法，该算法结合了镜像平移注意（MTA）和特征集合增强（FPE）。MTA 包括镜像翻转和偏移特征映射，可显著减少反射表面造成的漏检。此外，我们还为自我注意的输入引入了稀疏性，从而大大提高了模型的推理速度。我们采用了多尺度方法来适应行人大小的多样性，而 FPE 则解决了不同尺度上的遮挡问题。在我们自建的铁路行人入侵数据集中，与先进的 YOLOv8 模型相比，所提出的方法将 AP50 提高了 1.6% 至 92.11%，并将模型参数减少了 63.55%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Signal Processing Letters 工程技术-工程：电子与电气

CiteScore

7.40

自引率

12.80%

发文量

339

审稿时长

2.8 months

期刊介绍： The IEEE Signal Processing Letters is a monthly, archival publication designed to provide rapid dissemination of original, cutting-edge ideas and timely, significant contributions in signal, image, speech, language and audio processing. Papers published in the Letters can be presented within one year of their appearance in signal processing conferences such as ICASSP, GlobalSIP and ICIP, and also in several workshop organized by the Signal Processing Society.