Underwater acoustic classification using wavelet scattering transform and convolutional neural network with limited dataset

IF 3.4 2区 物理与天体物理 Q1 ACOUSTICS Applied Acoustics Pub Date : 2025-03-15 Epub Date: 2025-01-31 DOI:10.1016/j.apacoust.2025.110564
Yongxiang Liu , Biqi Zhang , Fantong Kong , Biao Wang , Chengming Luo , Lin Ma
{"title":"Underwater acoustic classification using wavelet scattering transform and convolutional neural network with limited dataset","authors":"Yongxiang Liu ,&nbsp;Biqi Zhang ,&nbsp;Fantong Kong ,&nbsp;Biao Wang ,&nbsp;Chengming Luo ,&nbsp;Lin Ma","doi":"10.1016/j.apacoust.2025.110564","DOIUrl":null,"url":null,"abstract":"<div><div>Underwater acoustic signal classification plays a pivotal role in maritime applications, requiring accurately identifying various acoustic sources in complex underwater environments. While deep learning has substantially enhanced performance in this domain, its success is often contingent on hand-crafted input features and intricate network architectures. The paper presents a novel method for classifying underwater acoustic signals by integrating the Wavelet Scattering Transform (WST) with Attention-augmented Convolutional Neural Networks (CNNs). The WST, based on wavelet analysis, effectively extracts multiscale features while retaining crucial time-frequency information, offering translation invariance and reducing the dependency on large training datasets. Furthermore, incorporating ResNet-18 with an attention mechanism improves extracted features by capturing richer semantic information, even from limited training data. The method was evaluated on the ShipsEar dataset, utilizing only 8.5% of samples for training, 1.5% for validation, and the remaining 90% for testing. Our approach achieved a classification accuracy of 0.93, surpassing the traditional Mel spectrogram with ResNet-18 by 9.8%. These results underscore the effectiveness of the proposed method in handling challenging underwater acoustic environments with limited training data.</div></div>","PeriodicalId":55506,"journal":{"name":"Applied Acoustics","volume":"232 ","pages":"Article 110564"},"PeriodicalIF":3.4000,"publicationDate":"2025-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Acoustics","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0003682X25000362","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/31 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0

Abstract

Underwater acoustic signal classification plays a pivotal role in maritime applications, requiring accurately identifying various acoustic sources in complex underwater environments. While deep learning has substantially enhanced performance in this domain, its success is often contingent on hand-crafted input features and intricate network architectures. The paper presents a novel method for classifying underwater acoustic signals by integrating the Wavelet Scattering Transform (WST) with Attention-augmented Convolutional Neural Networks (CNNs). The WST, based on wavelet analysis, effectively extracts multiscale features while retaining crucial time-frequency information, offering translation invariance and reducing the dependency on large training datasets. Furthermore, incorporating ResNet-18 with an attention mechanism improves extracted features by capturing richer semantic information, even from limited training data. The method was evaluated on the ShipsEar dataset, utilizing only 8.5% of samples for training, 1.5% for validation, and the remaining 90% for testing. Our approach achieved a classification accuracy of 0.93, surpassing the traditional Mel spectrogram with ResNet-18 by 9.8%. These results underscore the effectiveness of the proposed method in handling challenging underwater acoustic environments with limited training data.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于小波散射变换和卷积神经网络的有限数据集水声分类
水声信号分类在海事应用中起着举足轻重的作用,需要在复杂的水下环境中准确识别各种声源。虽然深度学习大大提高了该领域的性能,但它的成功往往取决于手工制作的输入特征和复杂的网络架构。提出了一种将小波散射变换(WST)与注意增强卷积神经网络(cnn)相结合的水声信号分类新方法。WST基于小波分析,有效提取多尺度特征,同时保留关键时频信息,提供平移不变性,减少对大型训练数据集的依赖。此外,将ResNet-18与注意力机制相结合,即使从有限的训练数据中也能捕获更丰富的语义信息,从而改进提取的特征。该方法在ShipsEar数据集上进行了评估,仅使用8.5%的样本进行训练,1.5%用于验证,其余90%用于测试。我们的方法实现了0.93的分类精度,比传统的基于ResNet-18的Mel谱图高出9.8%。这些结果强调了该方法在训练数据有限的情况下处理具有挑战性的水声环境的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Applied Acoustics
Applied Acoustics 物理-声学
CiteScore
7.40
自引率
11.80%
发文量
618
审稿时长
7.5 months
期刊介绍: Since its launch in 1968, Applied Acoustics has been publishing high quality research papers providing state-of-the-art coverage of research findings for engineers and scientists involved in applications of acoustics in the widest sense. Applied Acoustics looks not only at recent developments in the understanding of acoustics but also at ways of exploiting that understanding. The Journal aims to encourage the exchange of practical experience through publication and in so doing creates a fund of technological information that can be used for solving related problems. The presentation of information in graphical or tabular form is especially encouraged. If a report of a mathematical development is a necessary part of a paper it is important to ensure that it is there only as an integral part of a practical solution to a problem and is supported by data. Applied Acoustics encourages the exchange of practical experience in the following ways: • Complete Papers • Short Technical Notes • Review Articles; and thereby provides a wealth of technological information that can be used to solve related problems. Manuscripts that address all fields of applications of acoustics ranging from medicine and NDT to the environment and buildings are welcome.
期刊最新文献
Competitive noise generation behaviors and mechanisms of variable-displacement multi-row radial piston hydraulic motors Perceptual characterisation as a key approach for identifying acoustic factors relevant to community annoyance from helicopter noise A multi-sensing method using low‑acoustic‑impedance 1–3 piezoelectric composites for ultrasonic and electrical impedance measurements with single device Tuning the sound characteristics of thermal evaporation humidifier with broadband acoustic metamaterial absorbers Investigation into the performance of ultrasonic transducers under thermo-mechanically coupled loads
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1