PDT-YOLO: A Roadside Object-Detection Algorithm for Multiscale and Occluded Targets

Ruoying Liu, Miaohua Huang, Liangzi Wang, Chengcheng Bi, Ye Tao
{"title":"PDT-YOLO: A Roadside Object-Detection Algorithm for Multiscale and Occluded Targets","authors":"Ruoying Liu, Miaohua Huang, Liangzi Wang, Chengcheng Bi, Ye Tao","doi":"10.3390/s24072302","DOIUrl":null,"url":null,"abstract":"To tackle the challenges of weak sensing capacity for multi-scale objects, high missed detection rates for occluded targets, and difficulties for model deployment in detection tasks of intelligent roadside perception systems, the PDT-YOLO algorithm based on YOLOv7-tiny is proposed. Firstly, we introduce the intra-scale feature interaction module (AIFI) and reconstruct the feature pyramid structure to enhance the detection accuracy of multi-scale targets. Secondly, a lightweight convolution module (GSConv) is introduced to construct a multi-scale efficient layer aggregation network module (ETG), enhancing the network feature extraction ability while maintaining weight. Thirdly, multi-attention mechanisms are integrated to optimize the feature expression ability of occluded targets in complex scenarios, Finally, Wise-IoU with a dynamic non-monotonic focusing mechanism improves the accuracy and generalization ability of model sensing. Compared with YOLOv7-tiny, PDT-YOLO on the DAIR-V2X-C dataset improves mAP50 and mAP50:95 by 4.6% and 12.8%, with a parameter count of 6.1 million; on the IVODC dataset by 15.7% and 11.1%. We deployed the PDT-YOLO in an actual traffic environment based on a robot operating system (ROS), with a detection frame rate of 90 FPS, which can meet the needs of roadside object detection and edge deployment in complex traffic scenes.","PeriodicalId":221960,"journal":{"name":"Sensors (Basel, Switzerland)","volume":"28 ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sensors (Basel, Switzerland)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/s24072302","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

To tackle the challenges of weak sensing capacity for multi-scale objects, high missed detection rates for occluded targets, and difficulties for model deployment in detection tasks of intelligent roadside perception systems, the PDT-YOLO algorithm based on YOLOv7-tiny is proposed. Firstly, we introduce the intra-scale feature interaction module (AIFI) and reconstruct the feature pyramid structure to enhance the detection accuracy of multi-scale targets. Secondly, a lightweight convolution module (GSConv) is introduced to construct a multi-scale efficient layer aggregation network module (ETG), enhancing the network feature extraction ability while maintaining weight. Thirdly, multi-attention mechanisms are integrated to optimize the feature expression ability of occluded targets in complex scenarios, Finally, Wise-IoU with a dynamic non-monotonic focusing mechanism improves the accuracy and generalization ability of model sensing. Compared with YOLOv7-tiny, PDT-YOLO on the DAIR-V2X-C dataset improves mAP50 and mAP50:95 by 4.6% and 12.8%, with a parameter count of 6.1 million; on the IVODC dataset by 15.7% and 11.1%. We deployed the PDT-YOLO in an actual traffic environment based on a robot operating system (ROS), with a detection frame rate of 90 FPS, which can meet the needs of roadside object detection and edge deployment in complex traffic scenes.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
PDT-YOLO:针对多尺度和隐蔽目标的路边物体检测算法
针对多尺度物体感知能力弱、遮挡目标漏检率高、智能路侧感知系统检测任务中模型部署困难等难题,提出了基于 YOLOv7-tiny 的 PDT-YOLO 算法。首先,引入尺度内特征交互模块(AIFI),重构特征金字塔结构,提高多尺度目标的检测精度。其次,引入轻量级卷积模块(GSConv),构建多尺度高效层聚合网络模块(ETG),在保持权重的同时增强网络特征提取能力。最后,采用动态非单调聚焦机制的 Wise-IoU 提高了模型感知的精度和泛化能力。与 YOLOv7-tiny 相比,PDT-YOLO 在 DAIR-V2X-C 数据集上的 mAP50 和 mAP50:95 提高了 4.6% 和 12.8%,参数数达到 610 万;在 IVODC 数据集上的 mAP50 和 mAP50:95 提高了 15.7% 和 11.1%。我们基于机器人操作系统(ROS)在实际交通环境中部署了 PDT-YOLO,其检测帧速率为 90 FPS,可以满足复杂交通场景中路边物体检测和边缘部署的需求。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Flexible Ammonia Gas Sensor Based on a Grafted Polyaniline Grown on a Polyethylene Terephthalate Film Investigation of Appropriate Scaling of Networks and Images for Convolutional Neural Network-Based Nerve Detection in Ultrasound-Guided Nerve Blocks EMR-HRNet: A Multi-Scale Feature Fusion Network for Landslide Segmentation from Remote Sensing Images Measuring DNI with a New Radiometer Based on an Optical Fiber and Photodiode Highly Sensitive Balloon-like Fiber Interferometer Based on Ethanol Coated for Temperature Measurement
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1