基于遥感图像的浅层复用和多尺度扩张卷积组合注意力定向目标检测

IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Digital Signal Processing Pub Date : 2024-11-10 DOI:10.1016/j.dsp.2024.104865
Jiangtao Wang , Jiawei Shi
{"title":"基于遥感图像的浅层复用和多尺度扩张卷积组合注意力定向目标检测","authors":"Jiangtao Wang ,&nbsp;Jiawei Shi","doi":"10.1016/j.dsp.2024.104865","DOIUrl":null,"url":null,"abstract":"<div><div>Remote sensing images are becoming increasingly important in many areas of life because of the valuable information they provide. However, detecting objects in these images remains a difficult task due to their complex and variable characteristics, such as size, scale, and orientation. Moreover, there is a growing demand for efficient and speedy detection methods in practical applications. Therefore, in this paper, we propose a framework for oriented object detection in remote sensing images based on shallow multiplexing and multiscale dilation convolution combined attention. To achieve a lightweight network structure, we utilize ResNet18 as the backbone network. First, a shallow multiplexing module (SM) is designed to improve the utilization of detailed information in the shallow layer of the network. It enhances the interaction between the shallow and deep layers, resulting in a richer representation of network features. Second, a multiscale dilation convolution combined attention module (MDCA) is proposed to prioritize contextual information by using convolution with different dilation rates. This guides the network to focus more on the object information in remote sensing images. Then, the dilated encoder (DE) is employed at the feature fusion stage to enhance the semantic information of the context and produce a feature map with multiple receptive fields. Finally, the log<sub>2</sub> loss function is applied to improve the training results. The experiments are being conducted on three publicly available remote sensing image datasets, and the results demonstrate that the proposed algorithm outperforms other algorithms in terms of detection performance on these datasets. Code is available at <span><span>https://github.com/sbsfsum/SM-and-MDCA</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"156 ","pages":"Article 104865"},"PeriodicalIF":2.9000,"publicationDate":"2024-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Shallow multiplexing and multiscale dilation convolution combined attention based oriented object detection in remote sensing images\",\"authors\":\"Jiangtao Wang ,&nbsp;Jiawei Shi\",\"doi\":\"10.1016/j.dsp.2024.104865\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Remote sensing images are becoming increasingly important in many areas of life because of the valuable information they provide. However, detecting objects in these images remains a difficult task due to their complex and variable characteristics, such as size, scale, and orientation. Moreover, there is a growing demand for efficient and speedy detection methods in practical applications. Therefore, in this paper, we propose a framework for oriented object detection in remote sensing images based on shallow multiplexing and multiscale dilation convolution combined attention. To achieve a lightweight network structure, we utilize ResNet18 as the backbone network. First, a shallow multiplexing module (SM) is designed to improve the utilization of detailed information in the shallow layer of the network. It enhances the interaction between the shallow and deep layers, resulting in a richer representation of network features. Second, a multiscale dilation convolution combined attention module (MDCA) is proposed to prioritize contextual information by using convolution with different dilation rates. This guides the network to focus more on the object information in remote sensing images. Then, the dilated encoder (DE) is employed at the feature fusion stage to enhance the semantic information of the context and produce a feature map with multiple receptive fields. Finally, the log<sub>2</sub> loss function is applied to improve the training results. The experiments are being conducted on three publicly available remote sensing image datasets, and the results demonstrate that the proposed algorithm outperforms other algorithms in terms of detection performance on these datasets. Code is available at <span><span>https://github.com/sbsfsum/SM-and-MDCA</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":51011,\"journal\":{\"name\":\"Digital Signal Processing\",\"volume\":\"156 \",\"pages\":\"Article 104865\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2024-11-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1051200424004895\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200424004895","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

遥感图像提供了宝贵的信息,因此在许多生活领域变得越来越重要。然而,由于这些图像具有复杂多变的特征,如大小、比例和方向,因此检测这些图像中的物体仍然是一项艰巨的任务。此外,在实际应用中,对高效、快速检测方法的需求也在不断增长。因此,在本文中,我们提出了一种基于浅层复用和多尺度扩张卷积组合关注的遥感图像定向物体检测框架。为了实现轻量级网络结构,我们使用 ResNet18 作为骨干网络。首先,我们设计了浅层复用模块(SM),以提高网络浅层详细信息的利用率。它增强了浅层和深层之间的互动,从而更丰富地呈现网络特征。其次,提出了多尺度扩张卷积组合注意力模块(MDCA),通过使用不同扩张率的卷积来优先处理上下文信息。这将引导网络更加关注遥感图像中的物体信息。然后,在特征融合阶段采用扩张编码器(DE)来增强上下文的语义信息,并生成具有多个感受野的特征图。最后,应用 log2 损失函数来改善训练结果。实验在三个公开的遥感图像数据集上进行,结果表明,在这些数据集上,拟议算法的检测性能优于其他算法。代码见 https://github.com/sbsfsum/SM-and-MDCA。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Shallow multiplexing and multiscale dilation convolution combined attention based oriented object detection in remote sensing images
Remote sensing images are becoming increasingly important in many areas of life because of the valuable information they provide. However, detecting objects in these images remains a difficult task due to their complex and variable characteristics, such as size, scale, and orientation. Moreover, there is a growing demand for efficient and speedy detection methods in practical applications. Therefore, in this paper, we propose a framework for oriented object detection in remote sensing images based on shallow multiplexing and multiscale dilation convolution combined attention. To achieve a lightweight network structure, we utilize ResNet18 as the backbone network. First, a shallow multiplexing module (SM) is designed to improve the utilization of detailed information in the shallow layer of the network. It enhances the interaction between the shallow and deep layers, resulting in a richer representation of network features. Second, a multiscale dilation convolution combined attention module (MDCA) is proposed to prioritize contextual information by using convolution with different dilation rates. This guides the network to focus more on the object information in remote sensing images. Then, the dilated encoder (DE) is employed at the feature fusion stage to enhance the semantic information of the context and produce a feature map with multiple receptive fields. Finally, the log2 loss function is applied to improve the training results. The experiments are being conducted on three publicly available remote sensing image datasets, and the results demonstrate that the proposed algorithm outperforms other algorithms in terms of detection performance on these datasets. Code is available at https://github.com/sbsfsum/SM-and-MDCA.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Digital Signal Processing
Digital Signal Processing 工程技术-工程:电子与电气
CiteScore
5.30
自引率
17.20%
发文量
435
审稿时长
66 days
期刊介绍: Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal. The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as: • big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,
期刊最新文献
Adaptive polarimetric persymmetric detection for distributed subspace targets in lognormal texture clutter MFFR-net: Multi-scale feature fusion and attentive recalibration network for deep neural speech enhancement PV-YOLO: A lightweight pedestrian and vehicle detection model based on improved YOLOv8 Efficient recurrent real video restoration IGGCN: Individual-guided graph convolution network for pedestrian trajectory prediction
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1