CABDet:基于上下文和注意力的探测器,用于遥感图像中的小物体探测

IF 1.4 4区 地球科学 Q4 ENVIRONMENTAL SCIENCES Journal of Applied Remote Sensing Pub Date : 2023-10-01 DOI:10.1117/1.JRS.17.044515
Mingzhi Zhang, Xiaohai He, Qizhi Teng, Tong Niu, Honggang Chen
{"title":"CABDet:基于上下文和注意力的探测器,用于遥感图像中的小物体探测","authors":"Mingzhi Zhang, Xiaohai He, Qizhi Teng, Tong Niu, Honggang Chen","doi":"10.1117/1.JRS.17.044515","DOIUrl":null,"url":null,"abstract":"Abstract. Detecting small objects in remote sensing images is a challenging task. Existing object detectors for remote sensing images suffer from two issues: (1) insufficient feature extraction for small objects in the backbone network and (2) feature misalignment and information loss for small objects in the neck network, leading to poor detection performance on small objects. To address these challenges, a small object detector named CABDet for remote sensing images that combines context and attention mechanisms is proposed. Specifically, an enhanced ResNet50 is designed as a novel backbone network that adaptively adjusts the size of receptive fields to fully extract feature information of small objects. Additionally, an adaptive multiscale feature pyramid network (AM-FPN) is proposed. To alleviate the problem of feature misalignment for small objects, AM-FPN leverages self-attention mechanisms to establish semantic and spatial dependencies between adjacent feature layers. Then to mitigate the issue of information loss for small objects, AM-FPN captures semantic dependencies between subregions of current layer features through self-attention mechanisms to preserve channel information. Extensive experiments were conducted on two demanding remote sensing datasets, namely dataset for object detection in aerial images and UCAS-high resolution aerial object detection dataset, to demonstrate the effectiveness of the proposed methodology in achieving superior detection performance when compared with contemporary state-of-the-art approaches.","PeriodicalId":54879,"journal":{"name":"Journal of Applied Remote Sensing","volume":"168-169 1","pages":"044515 - 044515"},"PeriodicalIF":1.4000,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CABDet: context-and-attention-based detector for small object detection in remote sensing images\",\"authors\":\"Mingzhi Zhang, Xiaohai He, Qizhi Teng, Tong Niu, Honggang Chen\",\"doi\":\"10.1117/1.JRS.17.044515\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract. Detecting small objects in remote sensing images is a challenging task. Existing object detectors for remote sensing images suffer from two issues: (1) insufficient feature extraction for small objects in the backbone network and (2) feature misalignment and information loss for small objects in the neck network, leading to poor detection performance on small objects. To address these challenges, a small object detector named CABDet for remote sensing images that combines context and attention mechanisms is proposed. Specifically, an enhanced ResNet50 is designed as a novel backbone network that adaptively adjusts the size of receptive fields to fully extract feature information of small objects. Additionally, an adaptive multiscale feature pyramid network (AM-FPN) is proposed. To alleviate the problem of feature misalignment for small objects, AM-FPN leverages self-attention mechanisms to establish semantic and spatial dependencies between adjacent feature layers. Then to mitigate the issue of information loss for small objects, AM-FPN captures semantic dependencies between subregions of current layer features through self-attention mechanisms to preserve channel information. Extensive experiments were conducted on two demanding remote sensing datasets, namely dataset for object detection in aerial images and UCAS-high resolution aerial object detection dataset, to demonstrate the effectiveness of the proposed methodology in achieving superior detection performance when compared with contemporary state-of-the-art approaches.\",\"PeriodicalId\":54879,\"journal\":{\"name\":\"Journal of Applied Remote Sensing\",\"volume\":\"168-169 1\",\"pages\":\"044515 - 044515\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2023-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Applied Remote Sensing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1117/1.JRS.17.044515\",\"RegionNum\":4,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1117/1.JRS.17.044515","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

摘要检测遥感图像中的小物体是一项具有挑战性的任务。现有的遥感图像小物体检测器存在两个问题:(1) 主干网络中的小物体特征提取不足;(2) 颈部网络中的小物体特征错位和信息丢失,导致小物体检测性能低下。为了应对这些挑战,我们提出了一种名为 CABDet 的遥感图像小物体检测器,它结合了上下文和注意力机制。具体来说,设计了一个增强型 ResNet50 作为新型骨干网络,它能自适应地调整感受野的大小,以充分提取小物体的特征信息。此外,还提出了自适应多尺度特征金字塔网络(AM-FPN)。为了缓解小物体的特征错位问题,AM-FPN 利用自注意机制在相邻特征层之间建立语义和空间依赖关系。然后,为了减轻小物体的信息丢失问题,AM-FPN 通过自注意机制捕捉当前层特征子区域之间的语义依赖关系,以保留信道信息。我们在两个要求较高的遥感数据集(即航空图像中的物体检测数据集和 UCAS 高分辨率航空物体检测数据集)上进行了广泛的实验,以证明与当代最先进的方法相比,所提出的方法在实现卓越检测性能方面的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
CABDet: context-and-attention-based detector for small object detection in remote sensing images
Abstract. Detecting small objects in remote sensing images is a challenging task. Existing object detectors for remote sensing images suffer from two issues: (1) insufficient feature extraction for small objects in the backbone network and (2) feature misalignment and information loss for small objects in the neck network, leading to poor detection performance on small objects. To address these challenges, a small object detector named CABDet for remote sensing images that combines context and attention mechanisms is proposed. Specifically, an enhanced ResNet50 is designed as a novel backbone network that adaptively adjusts the size of receptive fields to fully extract feature information of small objects. Additionally, an adaptive multiscale feature pyramid network (AM-FPN) is proposed. To alleviate the problem of feature misalignment for small objects, AM-FPN leverages self-attention mechanisms to establish semantic and spatial dependencies between adjacent feature layers. Then to mitigate the issue of information loss for small objects, AM-FPN captures semantic dependencies between subregions of current layer features through self-attention mechanisms to preserve channel information. Extensive experiments were conducted on two demanding remote sensing datasets, namely dataset for object detection in aerial images and UCAS-high resolution aerial object detection dataset, to demonstrate the effectiveness of the proposed methodology in achieving superior detection performance when compared with contemporary state-of-the-art approaches.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Applied Remote Sensing
Journal of Applied Remote Sensing 环境科学-成像科学与照相技术
CiteScore
3.40
自引率
11.80%
发文量
194
审稿时长
3 months
期刊介绍: The Journal of Applied Remote Sensing is a peer-reviewed journal that optimizes the communication of concepts, information, and progress among the remote sensing community.
期刊最新文献
Monitoring soil moisture in cotton fields with synthetic aperture radar and optical data in arid and semi-arid regions Cascaded CNN and global–local attention transformer network-based semantic segmentation for high-resolution remote sensing image Coastal chlorophyll-a concentration estimation by fusion of Sentinel-2 multispectral instrument and in-situ hyperspectral data Spectral index for estimating leaf water content across diverse plant species using multiple viewing angles Optimal band selection using explainable artificial intelligence for machine learning-based hyperspectral image classification
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1