{"title":"利用单目特定物体距离估计和多模态真实世界位置计算的视障人士侧面碰撞检测模型","authors":"Wenqing Song, Yumeng Sun, Qixuan Huang, Junyang Cheok","doi":"10.47852/bonviewaia42022098","DOIUrl":null,"url":null,"abstract":"Targeting the potential risk of side-vehicle collisions when the visually impaired crosses roads, this study proposed a side collision detection model, including monocular distance estimation, multimodal real-world location estimation, future location prediction and collision warning strategies tailored for visually impaired pedestrians. The proposed model employs YOLOv8 and DeepSort for vehicle detection and tracking, utilizing shallow neural networks for distance estimation based on image information and vehicle position data. Predicted vehicle distances are combined with magnetic field sensor and GPS data to compute and store real-world vehicle locations, and these location data will be used for linear regression to forecast future locations. A warning strategy is then implemented to alert users. Experimental validation shows that the monocular distance estimation network has an Absolute Relative Error of 0.043 and an ALE (Average Localization Error) of 1.249m. In real-world location estimation, the view angle ALE is 0.019, and the location ALE is 1.778m. Regarding location prediction, the accuracy in distinguishing stationary and moving vehicles reaches 0.962, and the predicted curve, based on ground truth and predicted locations, exhibits good alignment. The proposed warning strategy, evaluated on Kitti Tracking Dataset and a self-created dataset, accurately detects the majority of potential collision risks.","PeriodicalId":518162,"journal":{"name":"Artificial Intelligence and Applications","volume":"16 9","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Side Collision Detection Model for Visually Impaired Using Monocular Object-specific Distance Estimation and Multimodal Real-World Location Calculation\",\"authors\":\"Wenqing Song, Yumeng Sun, Qixuan Huang, Junyang Cheok\",\"doi\":\"10.47852/bonviewaia42022098\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Targeting the potential risk of side-vehicle collisions when the visually impaired crosses roads, this study proposed a side collision detection model, including monocular distance estimation, multimodal real-world location estimation, future location prediction and collision warning strategies tailored for visually impaired pedestrians. The proposed model employs YOLOv8 and DeepSort for vehicle detection and tracking, utilizing shallow neural networks for distance estimation based on image information and vehicle position data. Predicted vehicle distances are combined with magnetic field sensor and GPS data to compute and store real-world vehicle locations, and these location data will be used for linear regression to forecast future locations. A warning strategy is then implemented to alert users. Experimental validation shows that the monocular distance estimation network has an Absolute Relative Error of 0.043 and an ALE (Average Localization Error) of 1.249m. In real-world location estimation, the view angle ALE is 0.019, and the location ALE is 1.778m. Regarding location prediction, the accuracy in distinguishing stationary and moving vehicles reaches 0.962, and the predicted curve, based on ground truth and predicted locations, exhibits good alignment. The proposed warning strategy, evaluated on Kitti Tracking Dataset and a self-created dataset, accurately detects the majority of potential collision risks.\",\"PeriodicalId\":518162,\"journal\":{\"name\":\"Artificial Intelligence and Applications\",\"volume\":\"16 9\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial Intelligence and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.47852/bonviewaia42022098\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47852/bonviewaia42022098","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
针对视障人士横穿马路时与侧面车辆发生碰撞的潜在风险,本研究提出了一种侧面碰撞检测模型,包括单目距离估计、多模态真实世界位置估计、未来位置预测以及为视障行人量身定制的碰撞预警策略。建议的模型采用 YOLOv8 和 DeepSort 进行车辆检测和跟踪,利用浅层神经网络根据图像信息和车辆位置数据进行距离估计。预测的车辆距离与磁场传感器和 GPS 数据相结合,计算并存储真实世界中的车辆位置,这些位置数据将用于线性回归,以预测未来的位置。然后实施警告策略,提醒用户注意。实验验证表明,单目距离估计网络的绝对相对误差为 0.043,平均定位误差(ALE)为 1.249 米。在真实世界的位置估计中,视角 ALE 为 0.019,位置 ALE 为 1.778 米。在位置预测方面,区分静止和移动车辆的准确度达到 0.962,基于地面实况和预测位置的预测曲线显示出良好的一致性。在 Kitti 跟踪数据集和一个自建数据集上对所提出的预警策略进行了评估,结果表明该策略能准确检测到大多数潜在的碰撞风险。
Side Collision Detection Model for Visually Impaired Using Monocular Object-specific Distance Estimation and Multimodal Real-World Location Calculation
Targeting the potential risk of side-vehicle collisions when the visually impaired crosses roads, this study proposed a side collision detection model, including monocular distance estimation, multimodal real-world location estimation, future location prediction and collision warning strategies tailored for visually impaired pedestrians. The proposed model employs YOLOv8 and DeepSort for vehicle detection and tracking, utilizing shallow neural networks for distance estimation based on image information and vehicle position data. Predicted vehicle distances are combined with magnetic field sensor and GPS data to compute and store real-world vehicle locations, and these location data will be used for linear regression to forecast future locations. A warning strategy is then implemented to alert users. Experimental validation shows that the monocular distance estimation network has an Absolute Relative Error of 0.043 and an ALE (Average Localization Error) of 1.249m. In real-world location estimation, the view angle ALE is 0.019, and the location ALE is 1.778m. Regarding location prediction, the accuracy in distinguishing stationary and moving vehicles reaches 0.962, and the predicted curve, based on ground truth and predicted locations, exhibits good alignment. The proposed warning strategy, evaluated on Kitti Tracking Dataset and a self-created dataset, accurately detects the majority of potential collision risks.