OptiSAR-Net:针对多源遥感数据的跨域船舶探测方法

IF 8.6 1区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Geoscience and Remote Sensing Pub Date : 2024-11-19 DOI:10.1109/TGRS.2024.3502447
Jun Dong;Jiewen Feng;Xiaoyu Tang
{"title":"OptiSAR-Net:针对多源遥感数据的跨域船舶探测方法","authors":"Jun Dong;Jiewen Feng;Xiaoyu Tang","doi":"10.1109/TGRS.2024.3502447","DOIUrl":null,"url":null,"abstract":"Optical and synthetic aperture radar (SAR) remote sensing are crucial for ship detection. Integrating SAR’s all-weather imaging with optical data’s shape recognition enhances downstream applications. However, current cross-domain methods often use unsupervised or semi-supervised techniques for single-source detection, limiting their practical use in cross-domain ship detection. Inspired by human visual cortex mechanisms, this article proposes OptiSAR-Net, an end-to-end cross-domain multisource ship detection network. Specifically, OptiSAR-Net features dual adaptive attention (DAA) for extracting standard features from SAR and optical images, and bilevel routing deformable spatial pyramid pooling-fast (BSPPF) for adapting to multiscale changes. To mitigate SAR noise, we employ VoV-GSCSP with spatial shuffling attention (VSSA) in the neck. OptiSAR-Net achieved state-of-the-art average precisions (APs) of 88.6% and 91.3% on the optical datasets DOTA and HRSC2016, respectively, and showed strong performance on the SAR datasets HRSID and SSDD. On the cross-domain heterogeneous dataset (CDHD), OptiSAR-Net differentiated ship targets effectively with only 2.7 million parameters and 11.7 GFLOPs, achieving an inference speed of 89 FPS on an NVIDIA RTX 3090. These results demonstrate that cross-domain multisource detection significantly enhances performance and application potential compared to single-source detection. Code is available at \n<uri>https://github.com/SCNU-RISLAB/OptiSAR-Net</uri>\n.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"62 ","pages":"1-11"},"PeriodicalIF":8.6000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"OptiSAR-Net: A Cross-Domain Ship Detection Method for Multisource Remote Sensing Data\",\"authors\":\"Jun Dong;Jiewen Feng;Xiaoyu Tang\",\"doi\":\"10.1109/TGRS.2024.3502447\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Optical and synthetic aperture radar (SAR) remote sensing are crucial for ship detection. Integrating SAR’s all-weather imaging with optical data’s shape recognition enhances downstream applications. However, current cross-domain methods often use unsupervised or semi-supervised techniques for single-source detection, limiting their practical use in cross-domain ship detection. Inspired by human visual cortex mechanisms, this article proposes OptiSAR-Net, an end-to-end cross-domain multisource ship detection network. Specifically, OptiSAR-Net features dual adaptive attention (DAA) for extracting standard features from SAR and optical images, and bilevel routing deformable spatial pyramid pooling-fast (BSPPF) for adapting to multiscale changes. To mitigate SAR noise, we employ VoV-GSCSP with spatial shuffling attention (VSSA) in the neck. OptiSAR-Net achieved state-of-the-art average precisions (APs) of 88.6% and 91.3% on the optical datasets DOTA and HRSC2016, respectively, and showed strong performance on the SAR datasets HRSID and SSDD. On the cross-domain heterogeneous dataset (CDHD), OptiSAR-Net differentiated ship targets effectively with only 2.7 million parameters and 11.7 GFLOPs, achieving an inference speed of 89 FPS on an NVIDIA RTX 3090. These results demonstrate that cross-domain multisource detection significantly enhances performance and application potential compared to single-source detection. Code is available at \\n<uri>https://github.com/SCNU-RISLAB/OptiSAR-Net</uri>\\n.\",\"PeriodicalId\":13213,\"journal\":{\"name\":\"IEEE Transactions on Geoscience and Remote Sensing\",\"volume\":\"62 \",\"pages\":\"1-11\"},\"PeriodicalIF\":8.6000,\"publicationDate\":\"2024-11-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Geoscience and Remote Sensing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10757443/\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10757443/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

光学和合成孔径雷达(SAR)遥感是舰船探测的重要手段。将SAR的全天候成像与光学数据的形状识别相结合,增强了下游应用。然而,目前的跨域方法通常使用无监督或半监督技术进行单源检测,限制了它们在跨域船舶检测中的实际应用。受人类视觉皮层机制的启发,本文提出了端到端跨域多源船舶检测网络OptiSAR-Net。具体而言,OptiSAR-Net采用双自适应关注(DAA)技术从SAR和光学图像中提取标准特征,采用双层路由可变形空间金字塔池快速(BSPPF)技术适应多尺度变化。为了减轻SAR噪声,我们在颈部采用了带有空间变换注意(VSSA)的VoV-GSCSP。OptiSAR-Net在光学数据集DOTA和HRSC2016上分别达到了最先进的平均精度(ap) 88.6%和91.3%,在SAR数据集HRSID和SSDD上表现出色。在跨域异构数据集(CDHD)上,OptiSAR-Net仅使用270万个参数和11.7 GFLOPs就能有效地区分船舶目标,在NVIDIA RTX 3090上实现了89 FPS的推理速度。这些结果表明,与单源检测相比,跨域多源检测显著提高了性能和应用潜力。代码可从https://github.com/SCNU-RISLAB/OptiSAR-Net获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
OptiSAR-Net: A Cross-Domain Ship Detection Method for Multisource Remote Sensing Data
Optical and synthetic aperture radar (SAR) remote sensing are crucial for ship detection. Integrating SAR’s all-weather imaging with optical data’s shape recognition enhances downstream applications. However, current cross-domain methods often use unsupervised or semi-supervised techniques for single-source detection, limiting their practical use in cross-domain ship detection. Inspired by human visual cortex mechanisms, this article proposes OptiSAR-Net, an end-to-end cross-domain multisource ship detection network. Specifically, OptiSAR-Net features dual adaptive attention (DAA) for extracting standard features from SAR and optical images, and bilevel routing deformable spatial pyramid pooling-fast (BSPPF) for adapting to multiscale changes. To mitigate SAR noise, we employ VoV-GSCSP with spatial shuffling attention (VSSA) in the neck. OptiSAR-Net achieved state-of-the-art average precisions (APs) of 88.6% and 91.3% on the optical datasets DOTA and HRSC2016, respectively, and showed strong performance on the SAR datasets HRSID and SSDD. On the cross-domain heterogeneous dataset (CDHD), OptiSAR-Net differentiated ship targets effectively with only 2.7 million parameters and 11.7 GFLOPs, achieving an inference speed of 89 FPS on an NVIDIA RTX 3090. These results demonstrate that cross-domain multisource detection significantly enhances performance and application potential compared to single-source detection. Code is available at https://github.com/SCNU-RISLAB/OptiSAR-Net .
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Transactions on Geoscience and Remote Sensing
IEEE Transactions on Geoscience and Remote Sensing 工程技术-地球化学与地球物理
CiteScore
11.50
自引率
28.00%
发文量
1912
审稿时长
4.0 months
期刊介绍: IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.
期刊最新文献
Fine-Scale Structure Reconstruction of Weather Radar Echoes via Blind Super-Resolution Generalized Iterative Sparse Maximum Likelihood Algorithm for the Detection of Buried Targets Unsupervised Snowy-Weather Point Cloud Denoising via Two-Stage Filter-Network Collaboration Noise2Map: End-to-End Diffusion Model for Semantic Segmentation and Change Detection OAADet: One-stage Anchor-free Arbitrary Oriented Object Detector via Center-ness Shift Correction
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1