{"title":"Real-time traffic object detection algorithm with deep stochastic configuration networks","authors":"Yongfu Wang , Yang Liu , Ran Yi , Yanchen Jiang","doi":"10.1016/j.ins.2024.121848","DOIUrl":null,"url":null,"abstract":"<div><div>In computer vision and intelligent transportation, object detection algorithms are a major research hotspot for improving the perception of autonomous vehicles. Although deep learning-based object identification algorithms perform well in terms of traffic object detection, they have low accuracy in complex road settings and poor real-time performance. In order to address these issues, this paper proposes a real-time traffic object recognition technique, namely the Toward Our Dream (TOD)-You Only Look Once version 7 (YOLOv7) method, that makes use of a lightweight network model with an improved Deep Stochastic Configuration Networks (DeepSCN). Firstly, the Dilatation Mingle (DM)-Spatial Pyramid Pooling Cross-Stage Partial Convolution (SPPCSPC) module boosts object identification performance for multi-scale objects, unifies semantic information from several size feature maps, and improves the network architecture of the YOLOv7 algorithm. Secondly, an improved DeepSCN is proposed, which improves the classification performance of the detecting head. Lastly, we carried out experiments on ablation, comparison, and visual validation. The experimental findings show that our lightweight technique is better than state-of-the-art methods in terms of accuracy and real-time performance for object detection.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"700 ","pages":"Article 121848"},"PeriodicalIF":6.8000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025524017626","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/6 0:00:00","PubModel":"Epub","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
In computer vision and intelligent transportation, object detection algorithms are a major research hotspot for improving the perception of autonomous vehicles. Although deep learning-based object identification algorithms perform well in terms of traffic object detection, they have low accuracy in complex road settings and poor real-time performance. In order to address these issues, this paper proposes a real-time traffic object recognition technique, namely the Toward Our Dream (TOD)-You Only Look Once version 7 (YOLOv7) method, that makes use of a lightweight network model with an improved Deep Stochastic Configuration Networks (DeepSCN). Firstly, the Dilatation Mingle (DM)-Spatial Pyramid Pooling Cross-Stage Partial Convolution (SPPCSPC) module boosts object identification performance for multi-scale objects, unifies semantic information from several size feature maps, and improves the network architecture of the YOLOv7 algorithm. Secondly, an improved DeepSCN is proposed, which improves the classification performance of the detecting head. Lastly, we carried out experiments on ablation, comparison, and visual validation. The experimental findings show that our lightweight technique is better than state-of-the-art methods in terms of accuracy and real-time performance for object detection.
在计算机视觉和智能交通领域,目标检测算法是提高自动驾驶车辆感知能力的一个重要研究热点。尽管基于深度学习的目标识别算法在交通目标检测方面表现良好,但在复杂道路环境下准确率较低,实时性较差。为了解决这些问题,本文提出了一种实时交通目标识别技术,即迈向我们的梦想(TOD)-You Only Look Once version 7 (YOLOv7)方法,该方法利用轻量级网络模型和改进的深度随机配置网络(DeepSCN)。首先,扩展混合(DM)-空间金字塔池跨阶段部分卷积(SPPCSPC)模块提高了多尺度目标的识别性能,统一了多个尺寸特征图的语义信息,改进了YOLOv7算法的网络结构;其次,提出了一种改进的深度神经网络,提高了检测头的分类性能。最后,我们进行了消融实验、对比实验和视觉验证实验。实验结果表明,我们的轻量化技术在目标检测的准确性和实时性方面优于目前最先进的方法。
期刊介绍:
Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions.
Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.