Oscar Bryan, Thibaud Berthomier, Benoit D'Ales, Thomas Furfaro, Tom S F Haines, Yan Pailhas, Alan Hunter
{"title":"A diffusion-based super resolution model for enhancing sonar images.","authors":"Oscar Bryan, Thibaud Berthomier, Benoit D'Ales, Thomas Furfaro, Tom S F Haines, Yan Pailhas, Alan Hunter","doi":"10.1121/10.0034882","DOIUrl":null,"url":null,"abstract":"<p><p>Improved hardware and processing techniques such as synthetic aperture sonar have led to imaging sonar with centimeter resolution. However, practical limitations and old systems limit the resolution in modern and legacy datasets. This study proposes using single image super resolution based on a conditioned diffusion model to map between images at different resolutions. This approach focuses on upscaling legacy, low-resolution sonar datasets to enable backward compatibility with newer, high-resolution datasets, thus creating a unified dataset for machine learning applications. The study demonstrates improved performance for classifying upscaled images without increasing the probability of false detection. The increased probability of detection was 7% compared to bicubic interpolation, 6% compared to convolutional neural networks, and 2% compared to generative adversarial networks. The study also proposes two sonar specific evaluation metrics based on acoustic physics and utility to automatic target recognition.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"509-518"},"PeriodicalIF":2.1000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Acoustical Society of America","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1121/10.0034882","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Improved hardware and processing techniques such as synthetic aperture sonar have led to imaging sonar with centimeter resolution. However, practical limitations and old systems limit the resolution in modern and legacy datasets. This study proposes using single image super resolution based on a conditioned diffusion model to map between images at different resolutions. This approach focuses on upscaling legacy, low-resolution sonar datasets to enable backward compatibility with newer, high-resolution datasets, thus creating a unified dataset for machine learning applications. The study demonstrates improved performance for classifying upscaled images without increasing the probability of false detection. The increased probability of detection was 7% compared to bicubic interpolation, 6% compared to convolutional neural networks, and 2% compared to generative adversarial networks. The study also proposes two sonar specific evaluation metrics based on acoustic physics and utility to automatic target recognition.
期刊介绍:
Since 1929 The Journal of the Acoustical Society of America has been the leading source of theoretical and experimental research results in the broad interdisciplinary study of sound. Subject coverage includes: linear and nonlinear acoustics; aeroacoustics, underwater sound and acoustical oceanography; ultrasonics and quantum acoustics; architectural and structural acoustics and vibration; speech, music and noise; psychology and physiology of hearing; engineering acoustics, transduction; bioacoustics, animal bioacoustics.