Cunshu Pan , Zhenhua Dai , Yuhao Zhang , Heshan Zhang , Mengwei Fan , Jin Xu
{"title":"基于计算机视觉从航空视频中精确提取车辆轨迹的方法","authors":"Cunshu Pan , Zhenhua Dai , Yuhao Zhang , Heshan Zhang , Mengwei Fan , Jin Xu","doi":"10.1016/j.measurement.2024.116212","DOIUrl":null,"url":null,"abstract":"<div><div>Vehicle trajectory data holds valuable information for advanced driving development and traffic analysis. While unmanned aerial vehicle (UAV) offer a broader perspective, the detection of small-scale vehicles in video frames still suffers from low accuracy or is even missed. This study proposes a comprehensive technical framework for accurate vehicle trajectory extraction, encompassing six main components: video stabilization, vehicle detection, vehicle tracking, lane marking detection, coordinate transformation, and data denoising. To mitigate video jitter, the SURF and FLANN stabilization algorithms are utilized. An enhanced detector based on You Only Look Once X (YOLOX) is employed for multi-target vehicle detection, incorporating a shallow feature extraction module within the detection head to improve the performance for low-level and small-scale features. Efficient Channel Attention (ECA) modules are integrated before the neck to further boost the expressiveness. Additionally, a sliding window inference method is applied at the input stage to prevent compression of high-resolution video frames. The Savitzky-Golay filter is used for trajectory noise reduction. Verification results demonstrate that the improved YOLOX achieves a mean average precision (mAP) of 88.7 %, an enhancement of 5.6 % over the original model. When compared to advanced YOLOv7 and YOLOv8 models, the proposed method increases mAP@50 by 7.63 % and 1.07 %, respectively. The Mostly Tracked (MT) trajectories metric reaches 98.9 %, and the root-mean-square error of one-sided localization is approximately 0.05 m. These results confirm that the proposed framework is an effective tool for high-accuracy vehicle trajectory data collection in traffic studies. Additionally, a vehicle trajectory dataset has been developed and is publicly accessible at <span><span>www.cqskyeyex.com</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":18349,"journal":{"name":"Measurement","volume":"242 ","pages":"Article 116212"},"PeriodicalIF":5.2000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An approach for accurately extracting vehicle trajectory from aerial videos based on computer vision\",\"authors\":\"Cunshu Pan , Zhenhua Dai , Yuhao Zhang , Heshan Zhang , Mengwei Fan , Jin Xu\",\"doi\":\"10.1016/j.measurement.2024.116212\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Vehicle trajectory data holds valuable information for advanced driving development and traffic analysis. While unmanned aerial vehicle (UAV) offer a broader perspective, the detection of small-scale vehicles in video frames still suffers from low accuracy or is even missed. This study proposes a comprehensive technical framework for accurate vehicle trajectory extraction, encompassing six main components: video stabilization, vehicle detection, vehicle tracking, lane marking detection, coordinate transformation, and data denoising. To mitigate video jitter, the SURF and FLANN stabilization algorithms are utilized. An enhanced detector based on You Only Look Once X (YOLOX) is employed for multi-target vehicle detection, incorporating a shallow feature extraction module within the detection head to improve the performance for low-level and small-scale features. Efficient Channel Attention (ECA) modules are integrated before the neck to further boost the expressiveness. Additionally, a sliding window inference method is applied at the input stage to prevent compression of high-resolution video frames. The Savitzky-Golay filter is used for trajectory noise reduction. Verification results demonstrate that the improved YOLOX achieves a mean average precision (mAP) of 88.7 %, an enhancement of 5.6 % over the original model. When compared to advanced YOLOv7 and YOLOv8 models, the proposed method increases mAP@50 by 7.63 % and 1.07 %, respectively. The Mostly Tracked (MT) trajectories metric reaches 98.9 %, and the root-mean-square error of one-sided localization is approximately 0.05 m. These results confirm that the proposed framework is an effective tool for high-accuracy vehicle trajectory data collection in traffic studies. Additionally, a vehicle trajectory dataset has been developed and is publicly accessible at <span><span>www.cqskyeyex.com</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":18349,\"journal\":{\"name\":\"Measurement\",\"volume\":\"242 \",\"pages\":\"Article 116212\"},\"PeriodicalIF\":5.2000,\"publicationDate\":\"2024-11-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Measurement\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0263224124020979\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Measurement","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0263224124020979","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
摘要
车辆轨迹数据为先进的驾驶开发和交通分析提供了宝贵的信息。虽然无人驾驶飞行器(UAV)提供了更广阔的视角,但视频帧中小规模车辆的检测仍然存在准确率低甚至遗漏的问题。本研究提出了精确提取车辆轨迹的综合技术框架,包括六个主要部分:视频稳定、车辆检测、车辆跟踪、车道标记检测、坐标变换和数据去噪。为减少视频抖动,采用了 SURF 和 FLANN 稳定算法。多目标车辆检测采用了基于 You Only Look Once X (YOLOX) 的增强型检测器,并在检测头中加入了浅层特征提取模块,以提高低层次和小尺度特征的性能。在颈部之前集成了高效通道注意(ECA)模块,以进一步提高表现力。此外,在输入阶段还采用了滑动窗口推理方法,以防止压缩高分辨率视频帧。萨维茨基-戈莱滤波器用于轨迹降噪。验证结果表明,改进后的 YOLOX 平均精度 (mAP) 达到 88.7%,比原始模型提高了 5.6%。与先进的 YOLOv7 和 YOLOv8 模型相比,拟议方法的 mAP@50 分别提高了 7.63 % 和 1.07 %。这些结果证实了所提出的框架是交通研究中高精度车辆轨迹数据收集的有效工具。此外,还开发了一个车辆轨迹数据集,可在 www.cqskyeyex.com 上公开访问。
An approach for accurately extracting vehicle trajectory from aerial videos based on computer vision
Vehicle trajectory data holds valuable information for advanced driving development and traffic analysis. While unmanned aerial vehicle (UAV) offer a broader perspective, the detection of small-scale vehicles in video frames still suffers from low accuracy or is even missed. This study proposes a comprehensive technical framework for accurate vehicle trajectory extraction, encompassing six main components: video stabilization, vehicle detection, vehicle tracking, lane marking detection, coordinate transformation, and data denoising. To mitigate video jitter, the SURF and FLANN stabilization algorithms are utilized. An enhanced detector based on You Only Look Once X (YOLOX) is employed for multi-target vehicle detection, incorporating a shallow feature extraction module within the detection head to improve the performance for low-level and small-scale features. Efficient Channel Attention (ECA) modules are integrated before the neck to further boost the expressiveness. Additionally, a sliding window inference method is applied at the input stage to prevent compression of high-resolution video frames. The Savitzky-Golay filter is used for trajectory noise reduction. Verification results demonstrate that the improved YOLOX achieves a mean average precision (mAP) of 88.7 %, an enhancement of 5.6 % over the original model. When compared to advanced YOLOv7 and YOLOv8 models, the proposed method increases mAP@50 by 7.63 % and 1.07 %, respectively. The Mostly Tracked (MT) trajectories metric reaches 98.9 %, and the root-mean-square error of one-sided localization is approximately 0.05 m. These results confirm that the proposed framework is an effective tool for high-accuracy vehicle trajectory data collection in traffic studies. Additionally, a vehicle trajectory dataset has been developed and is publicly accessible at www.cqskyeyex.com.
期刊介绍:
Contributions are invited on novel achievements in all fields of measurement and instrumentation science and technology. Authors are encouraged to submit novel material, whose ultimate goal is an advancement in the state of the art of: measurement and metrology fundamentals, sensors, measurement instruments, measurement and estimation techniques, measurement data processing and fusion algorithms, evaluation procedures and methodologies for plants and industrial processes, performance analysis of systems, processes and algorithms, mathematical models for measurement-oriented purposes, distributed measurement systems in a connected world.