Daniele Berardini , Lucia Migliorelli , Alessandro Galdelli , Manuel J. Marín-Jiménez
{"title":"Edge artificial intelligence and super-resolution for enhanced weapon detection in video surveillance","authors":"Daniele Berardini , Lucia Migliorelli , Alessandro Galdelli , Manuel J. Marín-Jiménez","doi":"10.1016/j.engappai.2024.109684","DOIUrl":null,"url":null,"abstract":"<div><div>The prevalence of crimes involving handguns and knives underscores the importance of early weapon detection. This, along with the spread of video surveillance systems, boosted the development of automatic approaches for weapon detection from surveillance cameras. Despite the advancements from classical computer vision to Deep Learning (DL) techniques, accurately detecting weapons in real-time remains challenging due to their small size. Current DL methods, which attempt to mitigate this issue using complex detection architectures, are resource-intensive, resulting in high costs and energy usage, and hindering their deployment on efficient edge devices. This creates challenges in resource-limited environments, making these methods impractical for edge and real-time applications. To address these shortcomings, our work proposes YOLOSR, which integrates a You Only Look Once (YOLO) v8-small model with an Enhanced Deep Super Resolution (EDSR)-based network using a shared backbone. During training, the auxiliary Super Resolution (SR) helps in learning better features, which could benefit the weapon detection task. During inference, the SR branch is removed, keeping the detector’s computational complexity unchanged. The YOLOSR’s accuracy and efficiency were validated on our WeaponSense dataset and on a NVIDIA Jetson Nano, against other weapon detectors. The results exhibited that YOLOSR, compared to the state-of-the-art YOLOv8-small model, maintained the same computational complexity with 28.8 billion floating point operations and on-device latency of 101 ms per image, while increasing the Average Precision by 10.2 percentage points. Thus, the YOLOSR emerges as an effective solution for real-time weapon detection in resource-constrained environments, achieving an optimal trade-off between efficiency and accuracy.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"140 ","pages":"Article 109684"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624018426","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The prevalence of crimes involving handguns and knives underscores the importance of early weapon detection. This, along with the spread of video surveillance systems, boosted the development of automatic approaches for weapon detection from surveillance cameras. Despite the advancements from classical computer vision to Deep Learning (DL) techniques, accurately detecting weapons in real-time remains challenging due to their small size. Current DL methods, which attempt to mitigate this issue using complex detection architectures, are resource-intensive, resulting in high costs and energy usage, and hindering their deployment on efficient edge devices. This creates challenges in resource-limited environments, making these methods impractical for edge and real-time applications. To address these shortcomings, our work proposes YOLOSR, which integrates a You Only Look Once (YOLO) v8-small model with an Enhanced Deep Super Resolution (EDSR)-based network using a shared backbone. During training, the auxiliary Super Resolution (SR) helps in learning better features, which could benefit the weapon detection task. During inference, the SR branch is removed, keeping the detector’s computational complexity unchanged. The YOLOSR’s accuracy and efficiency were validated on our WeaponSense dataset and on a NVIDIA Jetson Nano, against other weapon detectors. The results exhibited that YOLOSR, compared to the state-of-the-art YOLOv8-small model, maintained the same computational complexity with 28.8 billion floating point operations and on-device latency of 101 ms per image, while increasing the Average Precision by 10.2 percentage points. Thus, the YOLOSR emerges as an effective solution for real-time weapon detection in resource-constrained environments, achieving an optimal trade-off between efficiency and accuracy.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.