{"title":"基于改进的 Yolov7 目标检测的遥感图像定位","authors":"Cui Li, Jiao Wang","doi":"10.1007/s10044-024-01276-x","DOIUrl":null,"url":null,"abstract":"<p>Target detection, as a core issue in the field of computer vision, is widely applied in many key areas such as face recognition, license plate recognition, security protection, and driverless driving. Although its detection speed and accuracy continue to break records, there are still many challenges and difficulties in target detection of remote sensing images, which require further in-depth research and exploration. Remote sensing images can be regarded as a \"three-dimensional data cube\", with more complex background information, dense and small object targets, and more severe weather interference factors. These factors lead to large positioning errors and low detection accuracy in the target detection process of remote sensing images. An improved YOLOv7 object detection model is proposed to address the problem of high false negative rate for dense and small objects in remote sensing images. Firstly, the GAM attention mechanism is introduced, and a global scheduling mechanism is proposed to improve the performance of deep neural networks by reducing information reduction and expanding global interaction representations, thus enhancing the network's sensitivity to targets. Secondly, the loss function CIoU in the original Yolov7 network model is replaced by SIoU, aiming to optimize the loss function, reduce losses, and improve the generalization of the network. Finally, the model is tested on the public available RSOD remote sensing dataset, and its generalization is verified on the Okahublot FloW-Img sub-dataset. The results showed that the accuracy (MAP@0.5) of detecting objects improved by 1.7 percentage points and 1.5 percentage points respectively for the improved Yolov7 network model compared to the original model, effectively improves the accuracy of detecting small targets in remote sensing images and solves the problem of leakage detection of small targets in remote sensing images.</p>","PeriodicalId":54639,"journal":{"name":"Pattern Analysis and Applications","volume":"11 1","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2024-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Remote sensing image location based on improved Yolov7 target detection\",\"authors\":\"Cui Li, Jiao Wang\",\"doi\":\"10.1007/s10044-024-01276-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Target detection, as a core issue in the field of computer vision, is widely applied in many key areas such as face recognition, license plate recognition, security protection, and driverless driving. Although its detection speed and accuracy continue to break records, there are still many challenges and difficulties in target detection of remote sensing images, which require further in-depth research and exploration. Remote sensing images can be regarded as a \\\"three-dimensional data cube\\\", with more complex background information, dense and small object targets, and more severe weather interference factors. These factors lead to large positioning errors and low detection accuracy in the target detection process of remote sensing images. An improved YOLOv7 object detection model is proposed to address the problem of high false negative rate for dense and small objects in remote sensing images. Firstly, the GAM attention mechanism is introduced, and a global scheduling mechanism is proposed to improve the performance of deep neural networks by reducing information reduction and expanding global interaction representations, thus enhancing the network's sensitivity to targets. Secondly, the loss function CIoU in the original Yolov7 network model is replaced by SIoU, aiming to optimize the loss function, reduce losses, and improve the generalization of the network. Finally, the model is tested on the public available RSOD remote sensing dataset, and its generalization is verified on the Okahublot FloW-Img sub-dataset. The results showed that the accuracy (MAP@0.5) of detecting objects improved by 1.7 percentage points and 1.5 percentage points respectively for the improved Yolov7 network model compared to the original model, effectively improves the accuracy of detecting small targets in remote sensing images and solves the problem of leakage detection of small targets in remote sensing images.</p>\",\"PeriodicalId\":54639,\"journal\":{\"name\":\"Pattern Analysis and Applications\",\"volume\":\"11 1\",\"pages\":\"\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2024-04-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Analysis and Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s10044-024-01276-x\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Analysis and Applications","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s10044-024-01276-x","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Remote sensing image location based on improved Yolov7 target detection
Target detection, as a core issue in the field of computer vision, is widely applied in many key areas such as face recognition, license plate recognition, security protection, and driverless driving. Although its detection speed and accuracy continue to break records, there are still many challenges and difficulties in target detection of remote sensing images, which require further in-depth research and exploration. Remote sensing images can be regarded as a "three-dimensional data cube", with more complex background information, dense and small object targets, and more severe weather interference factors. These factors lead to large positioning errors and low detection accuracy in the target detection process of remote sensing images. An improved YOLOv7 object detection model is proposed to address the problem of high false negative rate for dense and small objects in remote sensing images. Firstly, the GAM attention mechanism is introduced, and a global scheduling mechanism is proposed to improve the performance of deep neural networks by reducing information reduction and expanding global interaction representations, thus enhancing the network's sensitivity to targets. Secondly, the loss function CIoU in the original Yolov7 network model is replaced by SIoU, aiming to optimize the loss function, reduce losses, and improve the generalization of the network. Finally, the model is tested on the public available RSOD remote sensing dataset, and its generalization is verified on the Okahublot FloW-Img sub-dataset. The results showed that the accuracy (MAP@0.5) of detecting objects improved by 1.7 percentage points and 1.5 percentage points respectively for the improved Yolov7 network model compared to the original model, effectively improves the accuracy of detecting small targets in remote sensing images and solves the problem of leakage detection of small targets in remote sensing images.
期刊介绍:
The journal publishes high quality articles in areas of fundamental research in intelligent pattern analysis and applications in computer science and engineering. It aims to provide a forum for original research which describes novel pattern analysis techniques and industrial applications of the current technology. In addition, the journal will also publish articles on pattern analysis applications in medical imaging. The journal solicits articles that detail new technology and methods for pattern recognition and analysis in applied domains including, but not limited to, computer vision and image processing, speech analysis, robotics, multimedia, document analysis, character recognition, knowledge engineering for pattern recognition, fractal analysis, and intelligent control. The journal publishes articles on the use of advanced pattern recognition and analysis methods including statistical techniques, neural networks, genetic algorithms, fuzzy pattern recognition, machine learning, and hardware implementations which are either relevant to the development of pattern analysis as a research area or detail novel pattern analysis applications. Papers proposing new classifier systems or their development, pattern analysis systems for real-time applications, fuzzy and temporal pattern recognition and uncertainty management in applied pattern recognition are particularly solicited.