基于多阶语义融合的行人多目标跟踪网络研究

IF 2.6 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC World Electric Vehicle Journal Pub Date : 2023-10-01 DOI:10.3390/wevj14100272

Cong Liu, Chao Han

{"title":"基于多阶语义融合的行人多目标跟踪网络研究","authors":"Cong Liu, Chao Han","doi":"10.3390/wevj14100272","DOIUrl":null,"url":null,"abstract":"Aiming at the problem of insufficient tracking accuracy caused by object occlusion in the process of multi-object tracking, this paper proposes a multi-order semantic fusion pedestrian multi-object tracking network. Firstly, the feature pyramid attention module is used in the backbone network to enlarge the receptive field and obtain more abundant feature information to improve the detection accuracy of different scale objects. Secondly, a size-aware module is integrated into the pedestrian re-identification branch network to fuse semantic features from different resolutions and extract more basic pedestrian features, thereby improving the tracking accuracy. Finally, the detection head is reconstructed and the small object detection layer is fused to make the proposed network adapt to objects of different sizes. Experiments on the MOT16 and MOT17 datasets show that the multi-object tracking accuracy of the proposed network reaches 75.4% (MOT16) and 74.3% (MOT17), which effectively deals with the problem of low tracking accuracy caused by occlusion in the field of autonomous driving, and achieves good tracking results. The network proposed in this paper improves the tracking accuracy of pedestrians and provides a basis for further practical applications.","PeriodicalId":38979,"journal":{"name":"World Electric Vehicle Journal","volume":"14 1","pages":"0"},"PeriodicalIF":2.6000,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Research on Pedestrian Multi-Object Tracking Network Based on Multi-Order Semantic Fusion\",\"authors\":\"Cong Liu, Chao Han\",\"doi\":\"10.3390/wevj14100272\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aiming at the problem of insufficient tracking accuracy caused by object occlusion in the process of multi-object tracking, this paper proposes a multi-order semantic fusion pedestrian multi-object tracking network. Firstly, the feature pyramid attention module is used in the backbone network to enlarge the receptive field and obtain more abundant feature information to improve the detection accuracy of different scale objects. Secondly, a size-aware module is integrated into the pedestrian re-identification branch network to fuse semantic features from different resolutions and extract more basic pedestrian features, thereby improving the tracking accuracy. Finally, the detection head is reconstructed and the small object detection layer is fused to make the proposed network adapt to objects of different sizes. Experiments on the MOT16 and MOT17 datasets show that the multi-object tracking accuracy of the proposed network reaches 75.4% (MOT16) and 74.3% (MOT17), which effectively deals with the problem of low tracking accuracy caused by occlusion in the field of autonomous driving, and achieves good tracking results. The network proposed in this paper improves the tracking accuracy of pedestrians and provides a basis for further practical applications.\",\"PeriodicalId\":38979,\"journal\":{\"name\":\"World Electric Vehicle Journal\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2023-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"World Electric Vehicle Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3390/wevj14100272\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"World Electric Vehicle Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/wevj14100272","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

针对多目标跟踪过程中物体遮挡导致跟踪精度不足的问题，本文提出了一种多阶语义融合行人多目标跟踪网络。首先，在骨干网络中使用特征金字塔关注模块，扩大接收野，获得更丰富的特征信息，提高对不同尺度目标的检测精度;其次，在行人再识别分支网络中集成尺寸感知模块，融合不同分辨率的语义特征，提取更多行人基本特征，提高跟踪精度;最后，对检测头进行重构，并融合小目标检测层，使所提网络能够适应不同大小的目标。在MOT16和MOT17数据集上的实验表明，本文提出的网络的多目标跟踪精度达到75.4% (MOT16)和74.3% (MOT17)，有效地解决了自动驾驶领域由于遮挡导致的跟踪精度低的问题，取得了良好的跟踪效果。本文提出的网络提高了行人的跟踪精度，为进一步的实际应用提供了基础。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Research on Pedestrian Multi-Object Tracking Network Based on Multi-Order Semantic Fusion

Aiming at the problem of insufficient tracking accuracy caused by object occlusion in the process of multi-object tracking, this paper proposes a multi-order semantic fusion pedestrian multi-object tracking network. Firstly, the feature pyramid attention module is used in the backbone network to enlarge the receptive field and obtain more abundant feature information to improve the detection accuracy of different scale objects. Secondly, a size-aware module is integrated into the pedestrian re-identification branch network to fuse semantic features from different resolutions and extract more basic pedestrian features, thereby improving the tracking accuracy. Finally, the detection head is reconstructed and the small object detection layer is fused to make the proposed network adapt to objects of different sizes. Experiments on the MOT16 and MOT17 datasets show that the multi-object tracking accuracy of the proposed network reaches 75.4% (MOT16) and 74.3% (MOT17), which effectively deals with the problem of low tracking accuracy caused by occlusion in the field of autonomous driving, and achieves good tracking results. The network proposed in this paper improves the tracking accuracy of pedestrians and provides a basis for further practical applications.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊