{"title":"基于 YOLOv7-tiny 的新型复杂道路场景实时物体检测方法","authors":"Yunfa Li, Hui Li","doi":"10.1007/s10586-024-04595-0","DOIUrl":null,"url":null,"abstract":"<p>Road object detection is a key technology in intelligent transportation systems, playing a crucial role in ensuring driving safety and enhancing driving experience. However, due to factors such as weather and visual occlusions, particularly in complex traffic scenes, the recognition rate and accuracy of object detection are often less than satisfactory, far from meeting the application demands of intelligent driving. In order to address the issues of weak generalization and low regression accuracy of image similarity evaluation metrics, we propose a new anchor box calculation algorithm. Building upon this, to tackle the problem of weak graphic attention and feature capture capabilities in the backbone network,We propose an improved CA attention mechanism. In addition, to address the issues of low detection accuracy and imprecise positioning of the model in complex traffic scenarios, we propose a new image enhancement module. we select the road traffic dataset BDD(Berkeley Deep Drive)100K as the benchmark evaluation dataset and divide the training and validation sets into six new categories. Through this series of strategies, a new real-time road object detection method suitable for complex traffic scenes is formed. To validate the effectiveness of this method, we conducted a series of experiments. The experimental results demonstrate that our proposed method achieves a mean average precision improvement of 3.61% compared to the YOLOv7-tiny method.</p>","PeriodicalId":501576,"journal":{"name":"Cluster Computing","volume":"24 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A novel real-time object detection method for complex road scenes based on YOLOv7-tiny\",\"authors\":\"Yunfa Li, Hui Li\",\"doi\":\"10.1007/s10586-024-04595-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Road object detection is a key technology in intelligent transportation systems, playing a crucial role in ensuring driving safety and enhancing driving experience. However, due to factors such as weather and visual occlusions, particularly in complex traffic scenes, the recognition rate and accuracy of object detection are often less than satisfactory, far from meeting the application demands of intelligent driving. In order to address the issues of weak generalization and low regression accuracy of image similarity evaluation metrics, we propose a new anchor box calculation algorithm. Building upon this, to tackle the problem of weak graphic attention and feature capture capabilities in the backbone network,We propose an improved CA attention mechanism. In addition, to address the issues of low detection accuracy and imprecise positioning of the model in complex traffic scenarios, we propose a new image enhancement module. we select the road traffic dataset BDD(Berkeley Deep Drive)100K as the benchmark evaluation dataset and divide the training and validation sets into six new categories. Through this series of strategies, a new real-time road object detection method suitable for complex traffic scenes is formed. To validate the effectiveness of this method, we conducted a series of experiments. The experimental results demonstrate that our proposed method achieves a mean average precision improvement of 3.61% compared to the YOLOv7-tiny method.</p>\",\"PeriodicalId\":501576,\"journal\":{\"name\":\"Cluster Computing\",\"volume\":\"24 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cluster Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/s10586-024-04595-0\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cluster Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s10586-024-04595-0","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
道路物体检测是智能交通系统中的一项关键技术,在确保驾驶安全和提升驾驶体验方面发挥着至关重要的作用。然而,由于天气、视觉遮挡等因素的影响,特别是在复杂的交通场景中,物体检测的识别率和准确率往往不尽如人意,远远不能满足智能驾驶的应用需求。针对图像相似性评价指标泛化能力弱、回归精度低的问题,我们提出了一种新的锚框计算算法。在此基础上,针对骨干网络图形注意力和特征捕捉能力较弱的问题,我们提出了一种改进的 CA 注意机制。此外,针对复杂交通场景下模型检测精度低、定位不精确等问题,我们提出了新的图像增强模块。我们选择道路交通数据集 BDD(Berkeley Deep Drive)100K 作为基准评估数据集,并将训练集和验证集划分为六个新类别。通过这一系列策略,形成了一种适用于复杂交通场景的新型实时道路物体检测方法。为了验证该方法的有效性,我们进行了一系列实验。实验结果表明,与 YOLOv7-tiny 方法相比,我们提出的方法平均精度提高了 3.61%。
A novel real-time object detection method for complex road scenes based on YOLOv7-tiny
Road object detection is a key technology in intelligent transportation systems, playing a crucial role in ensuring driving safety and enhancing driving experience. However, due to factors such as weather and visual occlusions, particularly in complex traffic scenes, the recognition rate and accuracy of object detection are often less than satisfactory, far from meeting the application demands of intelligent driving. In order to address the issues of weak generalization and low regression accuracy of image similarity evaluation metrics, we propose a new anchor box calculation algorithm. Building upon this, to tackle the problem of weak graphic attention and feature capture capabilities in the backbone network,We propose an improved CA attention mechanism. In addition, to address the issues of low detection accuracy and imprecise positioning of the model in complex traffic scenarios, we propose a new image enhancement module. we select the road traffic dataset BDD(Berkeley Deep Drive)100K as the benchmark evaluation dataset and divide the training and validation sets into six new categories. Through this series of strategies, a new real-time road object detection method suitable for complex traffic scenes is formed. To validate the effectiveness of this method, we conducted a series of experiments. The experimental results demonstrate that our proposed method achieves a mean average precision improvement of 3.61% compared to the YOLOv7-tiny method.