文献互助智能选刊最新文献

高级搜索发布求助登录注册

Transfer Learning-Based YOLOv3 Model for Road Dense Object Detection

IF 2.9 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Information (Switzerland) Pub Date : 2023-10-12 DOI:10.3390/info14100560

Chunhua Zhu, Jiarui Liang, Fei Zhou

{"title":"Transfer Learning-Based YOLOv3 Model for Road Dense Object Detection","authors":"Chunhua Zhu, Jiarui Liang, Fei Zhou","doi":"10.3390/info14100560","DOIUrl":null,"url":null,"abstract":"Stemming from the overlap of objects and undertraining due to few samples, road dense object detection is confronted with poor object identification performance and the inability to recognize edge objects. Based on this, one transfer learning-based YOLOv3 approach for identifying dense objects on the road has been proposed. Firstly, the Darknet-53 network structure is adopted to obtain a pre-trained YOLOv3 model. Then, the transfer training is introduced as the output layer for the special dataset of 2000 images containing vehicles. In the proposed model, one random function is adapted to initialize and optimize the weights of the transfer training model, which is separately designed from the pre-trained YOLOv3. The object detection classifier replaces the fully connected layer, which further improves the detection effect. The reduced size of the network model can further reduce the training and detection time. As a result, it can be better applied to actual scenarios. The experimental results demonstrate that the object detection accuracy of the presented approach is 87.75% for the Pascal VOC 2007 dataset, which is superior to the traditional YOLOv3 and the YOLOv5 by 4% and 0.59%, respectively. Additionally, the test was carried out using UA-DETRAC, a public road vehicle detection dataset. The object detection accuracy of the presented approach reaches 79.23% in detecting images, which is 4.13% better than the traditional YOLOv3, and compared with the existing relatively new object detection algorithm YOLOv5, the detection accuracy is 1.36% better. Moreover, the detection speed of the proposed YOLOv3 method reaches 31.2 Fps/s in detecting images, which is 7.6 Fps/s faster than the traditional YOLOv3, and compared with the existing new object detection algorithm YOLOv7, the speed is 1.5 Fps/s faster. The proposed YOLOv3 performs 67.36 Bn of floating point operations per second in detecting video, which is obviously less than the traditional YOLOv3 and the newer object detection algorithm YOLOv5.","PeriodicalId":38479,"journal":{"name":"Information (Switzerland)","volume":"11 1","pages":"0"},"PeriodicalIF":2.9000,"publicationDate":"2023-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information (Switzerland)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/info14100560","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Stemming from the overlap of objects and undertraining due to few samples, road dense object detection is confronted with poor object identification performance and the inability to recognize edge objects. Based on this, one transfer learning-based YOLOv3 approach for identifying dense objects on the road has been proposed. Firstly, the Darknet-53 network structure is adopted to obtain a pre-trained YOLOv3 model. Then, the transfer training is introduced as the output layer for the special dataset of 2000 images containing vehicles. In the proposed model, one random function is adapted to initialize and optimize the weights of the transfer training model, which is separately designed from the pre-trained YOLOv3. The object detection classifier replaces the fully connected layer, which further improves the detection effect. The reduced size of the network model can further reduce the training and detection time. As a result, it can be better applied to actual scenarios. The experimental results demonstrate that the object detection accuracy of the presented approach is 87.75% for the Pascal VOC 2007 dataset, which is superior to the traditional YOLOv3 and the YOLOv5 by 4% and 0.59%, respectively. Additionally, the test was carried out using UA-DETRAC, a public road vehicle detection dataset. The object detection accuracy of the presented approach reaches 79.23% in detecting images, which is 4.13% better than the traditional YOLOv3, and compared with the existing relatively new object detection algorithm YOLOv5, the detection accuracy is 1.36% better. Moreover, the detection speed of the proposed YOLOv3 method reaches 31.2 Fps/s in detecting images, which is 7.6 Fps/s faster than the traditional YOLOv3, and compared with the existing new object detection algorithm YOLOv7, the speed is 1.5 Fps/s faster. The proposed YOLOv3 performs 67.36 Bn of floating point operations per second in detecting video, which is obviously less than the traditional YOLOv3 and the newer object detection algorithm YOLOv5.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于迁移学习的YOLOv3道路密集目标检测模型

道路密集目标检测由于目标重叠和样本少导致训练不足，存在目标识别性能差和无法识别边缘目标的问题。在此基础上，提出了一种基于迁移学习的YOLOv3道路密集物体识别方法。首先，采用Darknet-53网络结构，得到预训练好的YOLOv3模型;然后，对包含2000张车辆图像的特殊数据集引入迁移训练作为输出层。在该模型中，采用一个随机函数来初始化和优化迁移训练模型的权重，该模型与预训练的YOLOv3分开设计。目标检测分类器取代了全连通层，进一步提高了检测效果。网络模型的缩小可以进一步减少训练和检测时间。因此，它可以更好地应用于实际场景。实验结果表明，对于Pascal VOC 2007数据集，该方法的目标检测准确率为87.75%，比传统的YOLOv3和YOLOv5分别提高了4%和0.59%。此外，测试还使用了公共道路车辆检测数据集UA-DETRAC进行。在检测图像时，该方法的目标检测准确率达到79.23%，比传统的YOLOv3算法提高4.13%，与现有相对较新的目标检测算法YOLOv5算法相比，检测准确率提高1.36%。此外，所提出的YOLOv3方法在检测图像时的检测速度达到31.2 Fps/s，比传统的YOLOv3提高了7.6 Fps/s，与现有的新目标检测算法YOLOv7相比，速度提高了1.5 Fps/s。本文提出的YOLOv3在检测视频时每秒进行673.6亿次浮点运算，明显低于传统的YOLOv3和较新的目标检测算法YOLOv5。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Information (Switzerland)

Information (Switzerland) Computer Science-Information Systems

CiteScore

6.90

自引率

0.00%

发文量

515

审稿时长

11 weeks

期刊最新文献

Effect of Elevated Temperature on Physical Activity and Falls in Low-Income Older Adults Using Zero-Inflated Poisson and Graphical Models. AI-Based Detection of Optical Microscopic Images of Pseudomonas aeruginosa in Planktonic and Biofilm States. Multimodal Brain Growth Patterns: Insights from Canonical Correlation Analysis and Deep Canonical Correlation Analysis with Auto-Encoder. Multi-Modal Fusion of Routine Care Electronic Health Records (EHR): A Scoping Review. Weakly Supervised Learning Approach for Implicit Aspect Extraction

0

微信

客服QQ

Book学术公众号

扫码关注我们

反馈

Book学术官方微信

Book学术文献互助

Book学术文献互助群
群号：604180095

文献互助智能选刊最新文献互助须知联系我们：info@booksci.cn

Book学术提供免费学术资源搜索服务，方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。

Copyright © 2023 Book学术 All rights reserved.

京公网安备 11010802042870号京ICP备2023020795号-1