A diffusion-based super resolution model for enhancing sonar images.

IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Journal of the Acoustical Society of America Pub Date : 2025-01-01 DOI:10.1121/10.0034882
Oscar Bryan, Thibaud Berthomier, Benoit D'Ales, Thomas Furfaro, Tom S F Haines, Yan Pailhas, Alan Hunter
{"title":"A diffusion-based super resolution model for enhancing sonar images.","authors":"Oscar Bryan, Thibaud Berthomier, Benoit D'Ales, Thomas Furfaro, Tom S F Haines, Yan Pailhas, Alan Hunter","doi":"10.1121/10.0034882","DOIUrl":null,"url":null,"abstract":"<p><p>Improved hardware and processing techniques such as synthetic aperture sonar have led to imaging sonar with centimeter resolution. However, practical limitations and old systems limit the resolution in modern and legacy datasets. This study proposes using single image super resolution based on a conditioned diffusion model to map between images at different resolutions. This approach focuses on upscaling legacy, low-resolution sonar datasets to enable backward compatibility with newer, high-resolution datasets, thus creating a unified dataset for machine learning applications. The study demonstrates improved performance for classifying upscaled images without increasing the probability of false detection. The increased probability of detection was 7% compared to bicubic interpolation, 6% compared to convolutional neural networks, and 2% compared to generative adversarial networks. The study also proposes two sonar specific evaluation metrics based on acoustic physics and utility to automatic target recognition.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"509-518"},"PeriodicalIF":2.3000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Acoustical Society of America","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1121/10.0034882","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0

Abstract

Improved hardware and processing techniques such as synthetic aperture sonar have led to imaging sonar with centimeter resolution. However, practical limitations and old systems limit the resolution in modern and legacy datasets. This study proposes using single image super resolution based on a conditioned diffusion model to map between images at different resolutions. This approach focuses on upscaling legacy, low-resolution sonar datasets to enable backward compatibility with newer, high-resolution datasets, thus creating a unified dataset for machine learning applications. The study demonstrates improved performance for classifying upscaled images without increasing the probability of false detection. The increased probability of detection was 7% compared to bicubic interpolation, 6% compared to convolutional neural networks, and 2% compared to generative adversarial networks. The study also proposes two sonar specific evaluation metrics based on acoustic physics and utility to automatic target recognition.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于扩散的声纳图像增强超分辨模型。
改进的硬件和处理技术,如合成孔径声纳已经导致成像声纳厘米分辨率。然而,实际限制和旧系统限制了现代和遗留数据集的分辨率。本研究提出基于条件扩散模型的单幅超分辨率图像在不同分辨率图像之间进行映射。该方法侧重于升级传统的低分辨率声纳数据集,以实现与更新的高分辨率数据集的向后兼容,从而为机器学习应用程序创建统一的数据集。该研究表明,在不增加误检概率的情况下,提高了对放大图像的分类性能。与双三次插值相比,检测概率增加了7%,与卷积神经网络相比增加了6%,与生成对抗网络相比增加了2%。研究还提出了基于声物理和自动目标识别的两种声纳专用评价指标。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
4.60
自引率
16.70%
发文量
1433
审稿时长
4.7 months
期刊介绍: Since 1929 The Journal of the Acoustical Society of America has been the leading source of theoretical and experimental research results in the broad interdisciplinary study of sound. Subject coverage includes: linear and nonlinear acoustics; aeroacoustics, underwater sound and acoustical oceanography; ultrasonics and quantum acoustics; architectural and structural acoustics and vibration; speech, music and noise; psychology and physiology of hearing; engineering acoustics, transduction; bioacoustics, animal bioacoustics.
期刊最新文献
Erratum: Effect of ambisonic order on spatial release from masking [J. Acoust. Soc. Am. 156(4), 2169-2176 (2024)]. Is pitch a smooth function of frequency? Evidence from octave adjustments. Multiple ultrasound image generation based on tuned alignment of amplitude hologram over spatially non-uniform ultrasound source. The image model applied to concert halls. Propeller self-noise suppression algorithm for unmanned underwater vehicles based on a two-stage denoising-inpainting framework.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1