{"title":"Real-time traffic object detection algorithm with deep stochastic configuration networks","authors":"Yongfu Wang , Yang Liu , Ran Yi , Yanchen Jiang","doi":"10.1016/j.ins.2024.121848","DOIUrl":null,"url":null,"abstract":"<div><div>In computer vision and intelligent transportation, object detection algorithms are a major research hotspot for improving the perception of autonomous vehicles. Although deep learning-based object identification algorithms perform well in terms of traffic object detection, they have low accuracy in complex road settings and poor real-time performance. In order to address these issues, this paper proposes a real-time traffic object recognition technique, namely the Toward Our Dream (TOD)-You Only Look Once version 7 (YOLOv7) method, that makes use of a lightweight network model with an improved Deep Stochastic Configuration Networks (DeepSCN). Firstly, the Dilatation Mingle (DM)-Spatial Pyramid Pooling Cross-Stage Partial Convolution (SPPCSPC) module boosts object identification performance for multi-scale objects, unifies semantic information from several size feature maps, and improves the network architecture of the YOLOv7 algorithm. Secondly, an improved DeepSCN is proposed, which improves the classification performance of the detecting head. Lastly, we carried out experiments on ablation, comparison, and visual validation. The experimental findings show that our lightweight technique is better than state-of-the-art methods in terms of accuracy and real-time performance for object detection.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"700 ","pages":"Article 121848"},"PeriodicalIF":8.1000,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025524017626","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
In computer vision and intelligent transportation, object detection algorithms are a major research hotspot for improving the perception of autonomous vehicles. Although deep learning-based object identification algorithms perform well in terms of traffic object detection, they have low accuracy in complex road settings and poor real-time performance. In order to address these issues, this paper proposes a real-time traffic object recognition technique, namely the Toward Our Dream (TOD)-You Only Look Once version 7 (YOLOv7) method, that makes use of a lightweight network model with an improved Deep Stochastic Configuration Networks (DeepSCN). Firstly, the Dilatation Mingle (DM)-Spatial Pyramid Pooling Cross-Stage Partial Convolution (SPPCSPC) module boosts object identification performance for multi-scale objects, unifies semantic information from several size feature maps, and improves the network architecture of the YOLOv7 algorithm. Secondly, an improved DeepSCN is proposed, which improves the classification performance of the detecting head. Lastly, we carried out experiments on ablation, comparison, and visual validation. The experimental findings show that our lightweight technique is better than state-of-the-art methods in terms of accuracy and real-time performance for object detection.
期刊介绍:
Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions.
Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.