{"title":"Pedestrian Intrusion Detection in Railway Station Based on Mirror Translation Attention and Feature Pooling Enhancement","authors":"Zhufeng Jiang;Hui Wang;Guoliang Luo;Zizhu Fan;Lu Xu","doi":"10.1109/LSP.2024.3471180","DOIUrl":null,"url":null,"abstract":"Pedestrian intrusion detection is crucial to ensuring safe railway operation. Current pedestrian detection algorithms lack consideration for real-world railway scenarios, such as the reflective properties of screen doors and train windows, may mistakenly trigger pedestrian intrusion alerts. Scale variability and pedestrian overlap often lead to detection inaccuracy, making them inadequate for addressing the specific requirements of railway perimeter security. This letter introduces an innovative pedestrian detection algorithm that incorporates Mirror Translation Attention (MTA) and Feature Pooling Enhancement (FPE). MTA, including mirror flipping and offsetting the feature mapping, could significantly mitigate missed detection caused by reflective surfaces. Additionally, we introduce sparsity to the inputs of the self-attention, which significantly enhancing the model's inference speed. A multi-scale approach is adopted to accommodate the diversity in pedestrian sizes, while the FPE addresses occlusion issues across various scales. Compared to the advanced YOLOv8 model, the proposed method improves AP50 by 1.6% to 92.11% and reduces model parameters by 63.55% in our self-built railway pedestrian intrusion dataset.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.2000,"publicationDate":"2024-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Signal Processing Letters","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10700649/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Pedestrian intrusion detection is crucial to ensuring safe railway operation. Current pedestrian detection algorithms lack consideration for real-world railway scenarios, such as the reflective properties of screen doors and train windows, may mistakenly trigger pedestrian intrusion alerts. Scale variability and pedestrian overlap often lead to detection inaccuracy, making them inadequate for addressing the specific requirements of railway perimeter security. This letter introduces an innovative pedestrian detection algorithm that incorporates Mirror Translation Attention (MTA) and Feature Pooling Enhancement (FPE). MTA, including mirror flipping and offsetting the feature mapping, could significantly mitigate missed detection caused by reflective surfaces. Additionally, we introduce sparsity to the inputs of the self-attention, which significantly enhancing the model's inference speed. A multi-scale approach is adopted to accommodate the diversity in pedestrian sizes, while the FPE addresses occlusion issues across various scales. Compared to the advanced YOLOv8 model, the proposed method improves AP50 by 1.6% to 92.11% and reduces model parameters by 63.55% in our self-built railway pedestrian intrusion dataset.
期刊介绍:
The IEEE Signal Processing Letters is a monthly, archival publication designed to provide rapid dissemination of original, cutting-edge ideas and timely, significant contributions in signal, image, speech, language and audio processing. Papers published in the Letters can be presented within one year of their appearance in signal processing conferences such as ICASSP, GlobalSIP and ICIP, and also in several workshop organized by the Signal Processing Society.