Syn2Real Domain Generalization for Underwater Mine-Like Object Detection Using Side-Scan Sonar

Aayush Agrawal;Aniruddh Sikdar;Rajini Makam;Suresh Sundaram;Suresh Kumar Besai;Mahesh Gopi
{"title":"Syn2Real Domain Generalization for Underwater Mine-Like Object Detection Using Side-Scan Sonar","authors":"Aayush Agrawal;Aniruddh Sikdar;Rajini Makam;Suresh Sundaram;Suresh Kumar Besai;Mahesh Gopi","doi":"10.1109/LGRS.2025.3550037","DOIUrl":null,"url":null,"abstract":"Underwater mine-like object (MLO) detection with deep learning suffers from limitations due to the scarcity of real-world side-scan sonar (SSS) data. This scarcity leads to overfitting, where models perform well on training data but poorly on unseen data. In this letter, we propose a synthetic to real (Syn2Real) domain generalization approach using diffusion models to address this challenge. Synthetic data generated by DDPM and DDIM models effectively enhances the training dataset. The residual noise in the final sampled images improves the model’s ability to generalize to real-world data with inherent noise and high variation. The baseline mask-region-based convolutional neural network (RCNN) model when trained on a combination of synthetic and original SSS training datasets, exhibited approximately a 35% increase in average precision (AP) compared to being trained solely on the original training data. This significant improvement highlights the potential of Syn2Real domain generalization for underwater mine detection.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10919029/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Underwater mine-like object (MLO) detection with deep learning suffers from limitations due to the scarcity of real-world side-scan sonar (SSS) data. This scarcity leads to overfitting, where models perform well on training data but poorly on unseen data. In this letter, we propose a synthetic to real (Syn2Real) domain generalization approach using diffusion models to address this challenge. Synthetic data generated by DDPM and DDIM models effectively enhances the training dataset. The residual noise in the final sampled images improves the model’s ability to generalize to real-world data with inherent noise and high variation. The baseline mask-region-based convolutional neural network (RCNN) model when trained on a combination of synthetic and original SSS training datasets, exhibited approximately a 35% increase in average precision (AP) compared to being trained solely on the original training data. This significant improvement highlights the potential of Syn2Real domain generalization for underwater mine detection.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于Syn2Real域的侧扫声纳类水雷目标探测
基于深度学习的水下类地雷目标(MLO)探测受到现实世界侧扫声纳(SSS)数据稀缺的限制。这种稀缺性导致过拟合,即模型在训练数据上表现良好,但在未见数据上表现不佳。在这封信中,我们提出了一种使用扩散模型的合成到实(Syn2Real)域泛化方法来解决这一挑战。DDPM和DDIM模型生成的合成数据有效地增强了训练数据集。最终采样图像中的残余噪声提高了模型对具有固有噪声和高变化的真实数据的泛化能力。基线基于掩模区域的卷积神经网络(RCNN)模型在合成和原始SSS训练数据集的组合上训练时,与仅在原始训练数据上训练相比,平均精度(AP)提高了约35%。这一重大改进突出了Syn2Real域泛化用于水下水雷探测的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Multiclass Training Dataset and Hybrid Neural Network for Simultaneous Karst and Channel Detection An Improved Ground-Based GNSS-R Soil Moisture Retrieval Algorithm Incorporating Precipitation Effects MCD-YOLO: An Improved YOLOv11 Framework for Manhole Cover Detection in UAV Imagery Robust Recognition of Anomalous Distribution From Electrical Resistivity Tomography Dip-Guided Poststack Inversion via Structure-Tensor Regularization
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1