Ivan V. Saetchnikov;Victor V. Skakun;Elina A. Tcherniavskaia
{"title":"基于深度神经网络的机载无人机动态目标识别与鲁棒多目标跟踪技术","authors":"Ivan V. Saetchnikov;Victor V. Skakun;Elina A. Tcherniavskaia","doi":"10.1109/JMASS.2023.3274929","DOIUrl":null,"url":null,"abstract":"Computer vision-based systems seem highly perspective for semantic analysis of the dynamical objects. However, considering dynamical object recognition and tracking from the unmanned aerial vehicle (UAV) the task to design a robust model for data association is highly challenging due to additional issues, e.g., image degradation, nonfixed object camera distance and shooting focus, and real-time issues. Thus, we propose an accurate deep neural network-based dynamical object recognition and robust multiobject tracking technique based on bidirectional LSTM with the optimized motion and appearance gates as a multiobject tracking backbone, supported by an advanced single-shot detector network improved with residual prediction model and implemented a DenseNet network as well as a YOLOv4eff network as feature extraction. The technique has been trained on VisDrone 2022 and UAVDT datasets with the side-shoot dynamical objects at a height of up to 50 m. The performance analysis on the test stage performed on seven metrics demonstrate that the proposed technique surpasses, by accuracy and robustness ability, other state-of-the-art techniques based on two cumulative MOTA and MOTP, as well as MT and IDsw. In particular, we have dramatically decreased the number of IDsw which implies a better capability to handle several occlusions, which is a desirable property in real-time multiple object tracking. We have pointed out the sensitivity of the tracking performance of our technique on the number of utilizing different sequence lengths and have defined an optimum. Finally, the applicability and reliability of the proposed technique for onboard UAV computer-based systems have been discussed.","PeriodicalId":100624,"journal":{"name":"IEEE Journal on Miniaturization for Air and Space Systems","volume":"4 3","pages":"250-256"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep Neural Network-Based Dynamical Object Recognition and Robust Multiobject Tracking Technique for Onboard Unmanned Aerial Vehicle’s Computer Vision-Based Systems\",\"authors\":\"Ivan V. Saetchnikov;Victor V. Skakun;Elina A. Tcherniavskaia\",\"doi\":\"10.1109/JMASS.2023.3274929\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Computer vision-based systems seem highly perspective for semantic analysis of the dynamical objects. However, considering dynamical object recognition and tracking from the unmanned aerial vehicle (UAV) the task to design a robust model for data association is highly challenging due to additional issues, e.g., image degradation, nonfixed object camera distance and shooting focus, and real-time issues. Thus, we propose an accurate deep neural network-based dynamical object recognition and robust multiobject tracking technique based on bidirectional LSTM with the optimized motion and appearance gates as a multiobject tracking backbone, supported by an advanced single-shot detector network improved with residual prediction model and implemented a DenseNet network as well as a YOLOv4eff network as feature extraction. The technique has been trained on VisDrone 2022 and UAVDT datasets with the side-shoot dynamical objects at a height of up to 50 m. The performance analysis on the test stage performed on seven metrics demonstrate that the proposed technique surpasses, by accuracy and robustness ability, other state-of-the-art techniques based on two cumulative MOTA and MOTP, as well as MT and IDsw. In particular, we have dramatically decreased the number of IDsw which implies a better capability to handle several occlusions, which is a desirable property in real-time multiple object tracking. We have pointed out the sensitivity of the tracking performance of our technique on the number of utilizing different sequence lengths and have defined an optimum. Finally, the applicability and reliability of the proposed technique for onboard UAV computer-based systems have been discussed.\",\"PeriodicalId\":100624,\"journal\":{\"name\":\"IEEE Journal on Miniaturization for Air and Space Systems\",\"volume\":\"4 3\",\"pages\":\"250-256\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Journal on Miniaturization for Air and Space Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10122792/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal on Miniaturization for Air and Space Systems","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10122792/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deep Neural Network-Based Dynamical Object Recognition and Robust Multiobject Tracking Technique for Onboard Unmanned Aerial Vehicle’s Computer Vision-Based Systems
Computer vision-based systems seem highly perspective for semantic analysis of the dynamical objects. However, considering dynamical object recognition and tracking from the unmanned aerial vehicle (UAV) the task to design a robust model for data association is highly challenging due to additional issues, e.g., image degradation, nonfixed object camera distance and shooting focus, and real-time issues. Thus, we propose an accurate deep neural network-based dynamical object recognition and robust multiobject tracking technique based on bidirectional LSTM with the optimized motion and appearance gates as a multiobject tracking backbone, supported by an advanced single-shot detector network improved with residual prediction model and implemented a DenseNet network as well as a YOLOv4eff network as feature extraction. The technique has been trained on VisDrone 2022 and UAVDT datasets with the side-shoot dynamical objects at a height of up to 50 m. The performance analysis on the test stage performed on seven metrics demonstrate that the proposed technique surpasses, by accuracy and robustness ability, other state-of-the-art techniques based on two cumulative MOTA and MOTP, as well as MT and IDsw. In particular, we have dramatically decreased the number of IDsw which implies a better capability to handle several occlusions, which is a desirable property in real-time multiple object tracking. We have pointed out the sensitivity of the tracking performance of our technique on the number of utilizing different sequence lengths and have defined an optimum. Finally, the applicability and reliability of the proposed technique for onboard UAV computer-based systems have been discussed.