{"title":"基于遥感图像的浅层复用和多尺度扩张卷积组合注意力定向目标检测","authors":"Jiangtao Wang , Jiawei Shi","doi":"10.1016/j.dsp.2024.104865","DOIUrl":null,"url":null,"abstract":"<div><div>Remote sensing images are becoming increasingly important in many areas of life because of the valuable information they provide. However, detecting objects in these images remains a difficult task due to their complex and variable characteristics, such as size, scale, and orientation. Moreover, there is a growing demand for efficient and speedy detection methods in practical applications. Therefore, in this paper, we propose a framework for oriented object detection in remote sensing images based on shallow multiplexing and multiscale dilation convolution combined attention. To achieve a lightweight network structure, we utilize ResNet18 as the backbone network. First, a shallow multiplexing module (SM) is designed to improve the utilization of detailed information in the shallow layer of the network. It enhances the interaction between the shallow and deep layers, resulting in a richer representation of network features. Second, a multiscale dilation convolution combined attention module (MDCA) is proposed to prioritize contextual information by using convolution with different dilation rates. This guides the network to focus more on the object information in remote sensing images. Then, the dilated encoder (DE) is employed at the feature fusion stage to enhance the semantic information of the context and produce a feature map with multiple receptive fields. Finally, the log<sub>2</sub> loss function is applied to improve the training results. The experiments are being conducted on three publicly available remote sensing image datasets, and the results demonstrate that the proposed algorithm outperforms other algorithms in terms of detection performance on these datasets. Code is available at <span><span>https://github.com/sbsfsum/SM-and-MDCA</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"156 ","pages":"Article 104865"},"PeriodicalIF":2.9000,"publicationDate":"2024-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Shallow multiplexing and multiscale dilation convolution combined attention based oriented object detection in remote sensing images\",\"authors\":\"Jiangtao Wang , Jiawei Shi\",\"doi\":\"10.1016/j.dsp.2024.104865\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Remote sensing images are becoming increasingly important in many areas of life because of the valuable information they provide. However, detecting objects in these images remains a difficult task due to their complex and variable characteristics, such as size, scale, and orientation. Moreover, there is a growing demand for efficient and speedy detection methods in practical applications. Therefore, in this paper, we propose a framework for oriented object detection in remote sensing images based on shallow multiplexing and multiscale dilation convolution combined attention. To achieve a lightweight network structure, we utilize ResNet18 as the backbone network. First, a shallow multiplexing module (SM) is designed to improve the utilization of detailed information in the shallow layer of the network. It enhances the interaction between the shallow and deep layers, resulting in a richer representation of network features. Second, a multiscale dilation convolution combined attention module (MDCA) is proposed to prioritize contextual information by using convolution with different dilation rates. This guides the network to focus more on the object information in remote sensing images. Then, the dilated encoder (DE) is employed at the feature fusion stage to enhance the semantic information of the context and produce a feature map with multiple receptive fields. Finally, the log<sub>2</sub> loss function is applied to improve the training results. The experiments are being conducted on three publicly available remote sensing image datasets, and the results demonstrate that the proposed algorithm outperforms other algorithms in terms of detection performance on these datasets. Code is available at <span><span>https://github.com/sbsfsum/SM-and-MDCA</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":51011,\"journal\":{\"name\":\"Digital Signal Processing\",\"volume\":\"156 \",\"pages\":\"Article 104865\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2024-11-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1051200424004895\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200424004895","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Shallow multiplexing and multiscale dilation convolution combined attention based oriented object detection in remote sensing images
Remote sensing images are becoming increasingly important in many areas of life because of the valuable information they provide. However, detecting objects in these images remains a difficult task due to their complex and variable characteristics, such as size, scale, and orientation. Moreover, there is a growing demand for efficient and speedy detection methods in practical applications. Therefore, in this paper, we propose a framework for oriented object detection in remote sensing images based on shallow multiplexing and multiscale dilation convolution combined attention. To achieve a lightweight network structure, we utilize ResNet18 as the backbone network. First, a shallow multiplexing module (SM) is designed to improve the utilization of detailed information in the shallow layer of the network. It enhances the interaction between the shallow and deep layers, resulting in a richer representation of network features. Second, a multiscale dilation convolution combined attention module (MDCA) is proposed to prioritize contextual information by using convolution with different dilation rates. This guides the network to focus more on the object information in remote sensing images. Then, the dilated encoder (DE) is employed at the feature fusion stage to enhance the semantic information of the context and produce a feature map with multiple receptive fields. Finally, the log2 loss function is applied to improve the training results. The experiments are being conducted on three publicly available remote sensing image datasets, and the results demonstrate that the proposed algorithm outperforms other algorithms in terms of detection performance on these datasets. Code is available at https://github.com/sbsfsum/SM-and-MDCA.
期刊介绍:
Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal.
The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as:
• big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,