首页 > 最新文献

Journal of the Acoustical Society of America最新文献

英文 中文
Subcortical encoding of harmonicity and frequency in autism: Insights from frequency-following responses. 自闭症中和声和频率的皮层下编码:来自频率跟随反应的见解。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042534
Tatiana Izmaylova, Jaime A Undurraga, Paul F Sowman

Autistic individuals often exhibit atypical sound perception, yet the specific acoustic properties involved-beyond sound intensity-remain unclear. This study examined whether subcortical processing of harmonicity and frequency differs in autistic children, potentially contributing to altered auditory experiences. Frequency-following responses (FFRs) were recorded from 15 autistic boys (ages 7-14) and nine age-matched neurotypical (NT) boys. In Experiment 1, participants heard three complex tones varying in their degree of harmonicity. No group differences emerged in FFRs to either the envelope or fine structure, suggesting comparable subcortical encoding of harmonicity. In Experiment 2, participants were presented with complex tones differing in the frequency of partials and/or fundamental frequency. Autistic children showed significantly weaker envelope responses across all stimuli and reduced fine structure responses around 500 Hz, while responses at lower and higher frequencies matched NT peers. These findings suggest that atypical subcortical encoding of frequency-specific information may contribute to altered sound perception in autism. Larger-scale studies are needed to confirm these results and connect them to behavioral measures.

自闭症患者通常表现出非典型的声音感知,但除了声强之外,具体的声学特性仍不清楚。这项研究调查了自闭症儿童的皮层下和声和频率处理是否不同,这可能导致听觉体验的改变。记录了15名自闭症男孩(7-14岁)和9名年龄匹配的神经正常男孩(NT)的频率跟随反应(FFRs)。在实验1中,参与者听到了三个复杂的音调,它们的和谐程度不同。无论是包膜还是精细结构,在ffr上都没有组间差异,这表明皮质下和声编码具有可比性。在实验2中,向参与者呈现了在偏频和/或基频频率上不同的复杂音调。自闭症儿童在所有刺激中表现出明显较弱的包络反应,在500 Hz左右的精细结构反应减少,而在较低和较高频率下的反应与NT同龄人一致。这些发现表明,非典型的皮层下频率特异性信息编码可能有助于自闭症患者声音感知的改变。需要更大规模的研究来证实这些结果,并将它们与行为测量联系起来。
{"title":"Subcortical encoding of harmonicity and frequency in autism: Insights from frequency-following responses.","authors":"Tatiana Izmaylova, Jaime A Undurraga, Paul F Sowman","doi":"10.1121/10.0042534","DOIUrl":"https://doi.org/10.1121/10.0042534","url":null,"abstract":"<p><p>Autistic individuals often exhibit atypical sound perception, yet the specific acoustic properties involved-beyond sound intensity-remain unclear. This study examined whether subcortical processing of harmonicity and frequency differs in autistic children, potentially contributing to altered auditory experiences. Frequency-following responses (FFRs) were recorded from 15 autistic boys (ages 7-14) and nine age-matched neurotypical (NT) boys. In Experiment 1, participants heard three complex tones varying in their degree of harmonicity. No group differences emerged in FFRs to either the envelope or fine structure, suggesting comparable subcortical encoding of harmonicity. In Experiment 2, participants were presented with complex tones differing in the frequency of partials and/or fundamental frequency. Autistic children showed significantly weaker envelope responses across all stimuli and reduced fine structure responses around 500 Hz, while responses at lower and higher frequencies matched NT peers. These findings suggest that atypical subcortical encoding of frequency-specific information may contribute to altered sound perception in autism. Larger-scale studies are needed to confirm these results and connect them to behavioral measures.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"1630-1646"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146258466","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The launch of statistical energy analysis simulation of double panels. 启动双面板统计能量分析仿真。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042351
Alexander Peiffer

The Reflections series takes a look back on historical articles from The Journal of the Acoustical Society of America that have had a significant impact on the science and practice of acoustics.

《反思》系列回顾了《美国声学学会杂志》上对声学科学和实践产生重大影响的历史文章。
{"title":"The launch of statistical energy analysis simulation of double panels.","authors":"Alexander Peiffer","doi":"10.1121/10.0042351","DOIUrl":"https://doi.org/10.1121/10.0042351","url":null,"abstract":"<p><p>The Reflections series takes a look back on historical articles from The Journal of the Acoustical Society of America that have had a significant impact on the science and practice of acoustics.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"R3-R4"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146105971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transarctic acoustic transmissions during the coordinated Arctic acoustic thermometry experiment in 2019-2020a). 2019-2020a年协调北极声测温实验期间的跨北极声传输
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042233
Matthew A Dzieciuch, Hanne Sagen, Peter F Worcester, Espen Storheim, F Hunter Akins, Stein Sandven, John A Colosi, John N Kemp, Geir Martin Leinebø

Acoustic signals with a center frequency of 35 Hz and a full bandwidth of about 4 Hz were transmitted over various ranges along a path extending from north of Svalbard to north of Alaska during the 2019-2020 US-Norwegian Coordinated Arctic Acoustic Thermometry Experiment (CAATEX). Three moorings were installed in the Canada Basin and three in the Nansen Basin, with one mooring in each basin hosting a source. All moorings had vertical receiving arrays, enabling spatial separation of the low-order acoustic normal modes. The modal group delays varied significantly over the year but were roughly consistent with predictions for the decade 2015-2022 based on the World Ocean Atlas 2023. The CAATEX signals traversed nearly the same trans-Arctic acoustic path as the 19.6-Hz signals in the 1994 Transarctic Acoustic Propagation (TAP) experiment. The TAP and CAATEX group delays cannot be directly compared because of the differing carrier frequencies. Thus, an indirect method using the group delays computed using WOA 2023 as a convenient standard was employed, but the large TAP mode-2 travel-time uncertainty precluded definitive comparisons. Nonetheless, CAATEX demonstrated that long-range acoustic transmissions provide precise, year-round measurements of large-scale ocean sound-speed (temperature) variability under the ice.

在2019-2020年美国-挪威协调北极声测温实验(CAATEX)期间,沿着从斯瓦尔巴群岛北部到阿拉斯加北部的路径在不同范围内传输了中心频率为35 Hz,全带宽约为4 Hz的声信号。在加拿大盆地安装了三个系泊,在南森盆地安装了三个系泊,每个盆地都有一个系泊。所有系泊都有垂直接收阵列,实现了低阶声正常模态的空间分离。模态组延迟在一年中变化很大,但与基于《2023年世界海洋地图集》的2015-2022年十年预测大致一致。CAATEX信号与1994年跨北极声学传播(TAP)实验中19.6 hz信号的跨北极声学路径几乎相同。由于载波频率不同,不能直接比较TAP和CAATEX组延迟。因此,采用了使用WOA 2023作为方便标准计算的群体延迟的间接方法,但较大的TAP模式2旅行时间不确定性妨碍了明确的比较。尽管如此,CAATEX证明了远程声波传输提供了精确的、全年的大规模冰下海洋声速(温度)变化测量。
{"title":"Transarctic acoustic transmissions during the coordinated Arctic acoustic thermometry experiment in 2019-2020a).","authors":"Matthew A Dzieciuch, Hanne Sagen, Peter F Worcester, Espen Storheim, F Hunter Akins, Stein Sandven, John A Colosi, John N Kemp, Geir Martin Leinebø","doi":"10.1121/10.0042233","DOIUrl":"https://doi.org/10.1121/10.0042233","url":null,"abstract":"<p><p>Acoustic signals with a center frequency of 35 Hz and a full bandwidth of about 4 Hz were transmitted over various ranges along a path extending from north of Svalbard to north of Alaska during the 2019-2020 US-Norwegian Coordinated Arctic Acoustic Thermometry Experiment (CAATEX). Three moorings were installed in the Canada Basin and three in the Nansen Basin, with one mooring in each basin hosting a source. All moorings had vertical receiving arrays, enabling spatial separation of the low-order acoustic normal modes. The modal group delays varied significantly over the year but were roughly consistent with predictions for the decade 2015-2022 based on the World Ocean Atlas 2023. The CAATEX signals traversed nearly the same trans-Arctic acoustic path as the 19.6-Hz signals in the 1994 Transarctic Acoustic Propagation (TAP) experiment. The TAP and CAATEX group delays cannot be directly compared because of the differing carrier frequencies. Thus, an indirect method using the group delays computed using WOA 2023 as a convenient standard was employed, but the large TAP mode-2 travel-time uncertainty precluded definitive comparisons. Nonetheless, CAATEX demonstrated that long-range acoustic transmissions provide precise, year-round measurements of large-scale ocean sound-speed (temperature) variability under the ice.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"1071-1085"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146105996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Classifying accuracy of fin whale range estimates from single seismic sensors. 单地震传感器估算长须鲸距离的分类精度。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042399
Andreia Pereira, Carolina Marques, Rose Hilmo, David K Mellinger, William S D Wilcock, Tiago A Marques, Danielle V Harris, Luis Matias

Ocean-bottom seismometers (OBSs) are used increasingly often to track baleen whale signals, employing single-station ranging techniques such as the three-component (3C) method. By using the orientation of ground motion from OBS components, the 3C method provides robust range estimates of direct-path signals within a validity range that relates to instrument depth. Consequently, the method requires a classification process to determine whether a signal falls within the validity range. Fin whale tracks, composed of 20-Hz notes from six locations, were used to develop and evaluate three classification models: decision trees (DTs), generalized additive models, and neural networks. Models were trained using different data combinations and incorporated a comprehensive set of variables related to channel amplitude, signal quality, polarization, and estimated signal angles. The DT achieved the highest performance, reaching an accuracy of 0.94 on the test data. Key variables for predicting the validity of the 3C ranges included the difference between observed horizontal-to-vertical amplitude ratios and its theoretical value, polarization metrics, and the amplitude of one horizontally oriented OBS component (Y-channel). The resulting framework contributes to improving the utility of seismic data for biological studies, which are critical for marine mammal monitoring and conservation strategies.

海底地震仪(OBSs)越来越多地用于跟踪须鲸的信号,采用单站测距技术,如三分量(3C)方法。通过使用OBS组件的地面运动方向,3C方法在与仪器深度相关的有效范围内提供了直接路径信号的鲁棒范围估计。因此,该方法需要一个分类过程来确定信号是否在有效范围内。由来自6个地点的20赫兹音符组成的长须鲸轨迹被用来开发和评估三种分类模型:决策树(dt)、广义加性模型和神经网络。模型使用不同的数据组合进行训练,并纳入了一系列与信道幅度、信号质量、极化和估计信号角度相关的变量。DT取得了最高的性能,在测试数据上达到了0.94的精度。预测3C范围有效性的关键变量包括观测到的水平与垂直振幅比与其理论值之差、极化指标和一个水平方向OBS分量(y通道)的振幅。由此产生的框架有助于提高地震数据在生物学研究中的效用,这对海洋哺乳动物的监测和保护策略至关重要。
{"title":"Classifying accuracy of fin whale range estimates from single seismic sensors.","authors":"Andreia Pereira, Carolina Marques, Rose Hilmo, David K Mellinger, William S D Wilcock, Tiago A Marques, Danielle V Harris, Luis Matias","doi":"10.1121/10.0042399","DOIUrl":"https://doi.org/10.1121/10.0042399","url":null,"abstract":"<p><p>Ocean-bottom seismometers (OBSs) are used increasingly often to track baleen whale signals, employing single-station ranging techniques such as the three-component (3C) method. By using the orientation of ground motion from OBS components, the 3C method provides robust range estimates of direct-path signals within a validity range that relates to instrument depth. Consequently, the method requires a classification process to determine whether a signal falls within the validity range. Fin whale tracks, composed of 20-Hz notes from six locations, were used to develop and evaluate three classification models: decision trees (DTs), generalized additive models, and neural networks. Models were trained using different data combinations and incorporated a comprehensive set of variables related to channel amplitude, signal quality, polarization, and estimated signal angles. The DT achieved the highest performance, reaching an accuracy of 0.94 on the test data. Key variables for predicting the validity of the 3C ranges included the difference between observed horizontal-to-vertical amplitude ratios and its theoretical value, polarization metrics, and the amplitude of one horizontally oriented OBS component (Y-channel). The resulting framework contributes to improving the utility of seismic data for biological studies, which are critical for marine mammal monitoring and conservation strategies.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"1430-1445"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146180750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bulk absorber acoustic characterization via the two-cavity impedance tube methoda). 通过双腔阻抗管方法的体吸收体声学特性分析)。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042384
Anthony Ciletti, Janith Godakawela, Martha Brown, Bhisham Sharma

The primary goal of this study is to investigate and refine the two-cavity impedance tube method for acoustic characterization of bulk porous materials, specifically addressing previously unexplained inaccuracies in the prediction of surface impedance and absorption coefficients. Unlike the conventional two-thickness approach, the two-cavity method requires only one sample thickness and involves conducting measurements at various air cavity depths behind the sample. The initial analyses revealed previously unidentified numerical instabilities, resulting in anomalous predictions of sound absorption at specific frequencies. Through systematic investigation and use of calculated data, the numerical origins of these anomalies are uncovered and a practical solution, involving the careful selection of cavity depths, is presented. This approach significantly improves predictive accuracy, validating the two-cavity impedance tube method as a robust and effective tool for the acoustic characterization of a wide variety of porous materials, including metallic and nonmetallic open-cell foams and additively manufactured lattice structures.

本研究的主要目标是研究和完善双腔阻抗管方法,用于块状多孔材料的声学表征,特别是解决先前无法解释的表面阻抗和吸收系数预测中的不准确性。与传统的双厚度方法不同,双腔方法只需要一个样品厚度,并涉及在样品后面的不同空腔深度进行测量。最初的分析揭示了先前未确定的数值不稳定性,导致在特定频率下对声吸收的异常预测。通过系统的调查和使用计算数据,揭示了这些异常的数值根源,并提出了一个实用的解决方案,包括仔细选择空腔深度。该方法显著提高了预测精度,验证了双腔阻抗管方法是一种稳健有效的工具,可用于各种多孔材料的声学表征,包括金属和非金属开孔泡沫和增材制造的晶格结构。
{"title":"Bulk absorber acoustic characterization via the two-cavity impedance tube methoda).","authors":"Anthony Ciletti, Janith Godakawela, Martha Brown, Bhisham Sharma","doi":"10.1121/10.0042384","DOIUrl":"https://doi.org/10.1121/10.0042384","url":null,"abstract":"<p><p>The primary goal of this study is to investigate and refine the two-cavity impedance tube method for acoustic characterization of bulk porous materials, specifically addressing previously unexplained inaccuracies in the prediction of surface impedance and absorption coefficients. Unlike the conventional two-thickness approach, the two-cavity method requires only one sample thickness and involves conducting measurements at various air cavity depths behind the sample. The initial analyses revealed previously unidentified numerical instabilities, resulting in anomalous predictions of sound absorption at specific frequencies. Through systematic investigation and use of calculated data, the numerical origins of these anomalies are uncovered and a practical solution, involving the careful selection of cavity depths, is presented. This approach significantly improves predictive accuracy, validating the two-cavity impedance tube method as a robust and effective tool for the acoustic characterization of a wide variety of porous materials, including metallic and nonmetallic open-cell foams and additively manufactured lattice structures.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"1138-1150"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146125434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the role of speech similarity in the detection of room acoustic differences. 语音相似度在室内声学差异检测中的作用。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042354
Thomas McKenzie, Nils Meyer-Kahlen, Sebastian J Schlecht

Spatial audio systems are typically evaluated in comparative listening tests using the same source signal for each condition {such as ABX: ITU-R BS.1116-3 [(2015a) Methods for the Subjective Assessment of Small Impairments in Audio Systems (International Telecommunication Union, Geneva, Switzerland)] and multiple stimulus with hidden reference and anchor ITU-R BS.1534-3 [(2015b) Methods for the Subjective Assessment of Intermediate Quality Level of Audio Systems (International Telecommunication Union, Geneva, Switzerland)]}. However, in augmented reality (AR) scenarios, it is infeasible that the same sound source would exist at the same position in space, both real and virtual; instead, each sound source will emit a different signal. To investigate this discrepancy, a perceptual study is conducted on the effect of source signal similarity when distinguishing different room acoustics conditions. Specifically, these conditions are binaural room impulse responses measured at different distances from the source, modified to all use the same direct sound. Three classes of source signal are investigated in a three-alternative forced choice paradigm: the same speech signal for all conditions, the same speaker but a different sentence for each condition, and a different speaker and a different sentence for each condition. Results show that using different speech recordings significantly reduces the ability to identify differences in room acoustics. This suggests that spatial audio system fidelity requirements could vary depending on the source signals used in the target application; AR audio evaluation should use different signals for comparisons.

空间音频系统通常在比较听力测试中进行评估,在每种条件下使用相同的源信号{例如ABX:ITU-R BS.1116-3 [(2015a)音频系统小损伤主观评估方法(国际电信联盟,日内瓦,瑞士)]和隐藏参考和锚定多重刺激[(2015b)音频系统中间质量水平主观评估方法(国际电信联盟,日内瓦,瑞士)]}。然而,在增强现实(AR)场景中,同一声源存在于空间的同一位置是不可能的,无论是真实的还是虚拟的;相反,每个声源会发出不同的信号。为了研究这种差异,在区分不同房间声学条件时,对源信号相似性的影响进行了感性研究。具体来说,这些条件是在距离声源不同距离处测量的双耳房间脉冲响应,修改后都使用相同的直接声音。在三选项强迫选择范式中研究了三类源信号:所有条件下的相同语音信号,相同的说话者但每种条件下的不同句子,以及每种条件下的不同说话者和不同句子。结果表明,使用不同的语音记录会显著降低识别室内声学差异的能力。这表明空间音频系统保真度要求可能会根据目标应用中使用的源信号而变化;AR音频评估应使用不同的信号进行比较。
{"title":"On the role of speech similarity in the detection of room acoustic differences.","authors":"Thomas McKenzie, Nils Meyer-Kahlen, Sebastian J Schlecht","doi":"10.1121/10.0042354","DOIUrl":"https://doi.org/10.1121/10.0042354","url":null,"abstract":"<p><p>Spatial audio systems are typically evaluated in comparative listening tests using the same source signal for each condition {such as ABX: ITU-R BS.1116-3 [(2015a) Methods for the Subjective Assessment of Small Impairments in Audio Systems (International Telecommunication Union, Geneva, Switzerland)] and multiple stimulus with hidden reference and anchor ITU-R BS.1534-3 [(2015b) Methods for the Subjective Assessment of Intermediate Quality Level of Audio Systems (International Telecommunication Union, Geneva, Switzerland)]}. However, in augmented reality (AR) scenarios, it is infeasible that the same sound source would exist at the same position in space, both real and virtual; instead, each sound source will emit a different signal. To investigate this discrepancy, a perceptual study is conducted on the effect of source signal similarity when distinguishing different room acoustics conditions. Specifically, these conditions are binaural room impulse responses measured at different distances from the source, modified to all use the same direct sound. Three classes of source signal are investigated in a three-alternative forced choice paradigm: the same speech signal for all conditions, the same speaker but a different sentence for each condition, and a different speaker and a different sentence for each condition. Results show that using different speech recordings significantly reduces the ability to identify differences in room acoustics. This suggests that spatial audio system fidelity requirements could vary depending on the source signals used in the target application; AR audio evaluation should use different signals for comparisons.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"1373-1384"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146165359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A deep learning framework for four-dimensional ocean sound speed field prediction. 四维海洋声速场预测的深度学习框架。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042423
Yingjie Li, Jixing Qin, Shuanglin Wu, Kang Zheng, Haiqiang Niu, Zhaohui Peng

The prediction of ocean sound speed fields (SSFs) is critical for underwater communication, marine resource exploration, and environmental monitoring. Due to the powerful generalization ability, deep learning technology has demonstrated its advantages in SSF prediction. However, limited by the processing capabilities of high-dimensional data, current research can only realize the three-dimensional characteristic extraction, without capturing the complete spatiotemporal information of SSF. In this work, we propose the Swin Transformer-UNet model (ST-UNet), which combines the convolutional networks U-Net and Swin Transformer networks, to approach the four-dimensional prediction of SSF. In this model, Swin Transformer network is applied to extract spatiotemporal characteristics through the multi-head self-attention mechanism, while U-Net enhances spatial details via the convolutional feature recovery. The availability and accuracy of the model are demonstrated by the real-life dataset from the South China Sea. It achieves a root mean square error of 0.783 m/s for 24-h SSF prediction based on 7-day historical data, outperforming baseline architectures by 33%-72%.

海洋声速场的预测对水下通信、海洋资源勘探和环境监测具有重要意义。由于强大的泛化能力,深度学习技术在SSF预测中已经显示出优势。然而,受限于高维数据的处理能力,目前的研究只能实现三维特征提取,无法捕捉到完整的SSF时空信息。在这项工作中,我们提出了Swin变压器- unet模型(ST-UNet),它结合了卷积网络U-Net和Swin变压器网络,来接近SSF的四维预测。在该模型中,Swin Transformer网络通过多头自注意机制提取时空特征,U-Net通过卷积特征恢复增强空间细节。通过南海实测数据验证了该模型的有效性和准确性。基于7天历史数据的24小时SSF预测的均方根误差为0.783 m/s,优于基准架构33%-72%。
{"title":"A deep learning framework for four-dimensional ocean sound speed field prediction.","authors":"Yingjie Li, Jixing Qin, Shuanglin Wu, Kang Zheng, Haiqiang Niu, Zhaohui Peng","doi":"10.1121/10.0042423","DOIUrl":"https://doi.org/10.1121/10.0042423","url":null,"abstract":"<p><p>The prediction of ocean sound speed fields (SSFs) is critical for underwater communication, marine resource exploration, and environmental monitoring. Due to the powerful generalization ability, deep learning technology has demonstrated its advantages in SSF prediction. However, limited by the processing capabilities of high-dimensional data, current research can only realize the three-dimensional characteristic extraction, without capturing the complete spatiotemporal information of SSF. In this work, we propose the Swin Transformer-UNet model (ST-UNet), which combines the convolutional networks U-Net and Swin Transformer networks, to approach the four-dimensional prediction of SSF. In this model, Swin Transformer network is applied to extract spatiotemporal characteristics through the multi-head self-attention mechanism, while U-Net enhances spatial details via the convolutional feature recovery. The availability and accuracy of the model are demonstrated by the real-life dataset from the South China Sea. It achieves a root mean square error of 0.783 m/s for 24-h SSF prediction based on 7-day historical data, outperforming baseline architectures by 33%-72%.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"1400-1415"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146165863","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Low-complexity iterative receiver based on variational Bayesian inference for multiple-input multiple-output underwater acoustic communication. 基于变分贝叶斯推理的多输入多输出水声通信低复杂度迭代接收机。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042465
Wei-Zhe Li, Xiao Han, Yi-Zhen Jia, Zheng Wu, Jing-Wei Yin

The design of high-performance receivers with manageable complexity is crucial for underwater acoustic communication, especially in multiple-input multiple-output (MIMO) scenarios. In this paper, a low-complexity MIMO receiver, which is based on variational Bayesian inference (VBI), is proposed. First, an iterative channel estimation model is constructed, which is based on VBI, the high-dimensional channel vector in the MIMO system is decomposed into a parallel connection of multiple low-dimensional channel vectors with different sparsity, and the prior distribution of the low-dimensional channel vectors is jointly modeled using temporal correlation (TC) and sparsity. Next, the low-complexity vector approximate message passing (VAMP) technique is integrated into the VBI framework and a channel estimation method is derived, based on TC-VAMP-VBI. Finally, to reduce the complexity of MIMO channel equalization, a serial iterative equalization algorithm is proposed, which incorporates passive time reversal under the VBI framework. The proposed algorithm was validated using simulations and MIMO communication data collected from two field experiments. The results show that the proposed algorithm can significantly reduce the computational complexity of the MIMO system while maintaining robustness of channel estimation in short data block scenarios.

设计可管理的高性能接收机是水声通信的关键,特别是在多输入多输出(MIMO)场景下。本文提出了一种基于变分贝叶斯推理(VBI)的低复杂度MIMO接收机。首先,构建基于VBI的迭代信道估计模型,将MIMO系统中的高维信道向量分解为多个不同稀疏度的低维信道向量的并行连接,并利用时间相关和稀疏度联合建模低维信道向量的先验分布;然后,将低复杂度的向量近似消息传递(VAMP)技术集成到VBI框架中,推导出基于TC-VAMP-VBI的信道估计方法。最后,为了降低MIMO信道均衡的复杂度,提出了一种在VBI框架下结合被动时间反转的串行迭代均衡算法。通过仿真和MIMO通信数据验证了该算法的有效性。结果表明,该算法在保持短数据块信道估计鲁棒性的同时,显著降低了MIMO系统的计算复杂度。
{"title":"Low-complexity iterative receiver based on variational Bayesian inference for multiple-input multiple-output underwater acoustic communication.","authors":"Wei-Zhe Li, Xiao Han, Yi-Zhen Jia, Zheng Wu, Jing-Wei Yin","doi":"10.1121/10.0042465","DOIUrl":"https://doi.org/10.1121/10.0042465","url":null,"abstract":"<p><p>The design of high-performance receivers with manageable complexity is crucial for underwater acoustic communication, especially in multiple-input multiple-output (MIMO) scenarios. In this paper, a low-complexity MIMO receiver, which is based on variational Bayesian inference (VBI), is proposed. First, an iterative channel estimation model is constructed, which is based on VBI, the high-dimensional channel vector in the MIMO system is decomposed into a parallel connection of multiple low-dimensional channel vectors with different sparsity, and the prior distribution of the low-dimensional channel vectors is jointly modeled using temporal correlation (TC) and sparsity. Next, the low-complexity vector approximate message passing (VAMP) technique is integrated into the VBI framework and a channel estimation method is derived, based on TC-VAMP-VBI. Finally, to reduce the complexity of MIMO channel equalization, a serial iterative equalization algorithm is proposed, which incorporates passive time reversal under the VBI framework. The proposed algorithm was validated using simulations and MIMO communication data collected from two field experiments. The results show that the proposed algorithm can significantly reduce the computational complexity of the MIMO system while maintaining robustness of channel estimation in short data block scenarios.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"1512-1528"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146180725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic multi-task neural network for end-to-end prediction of deep-sea acoustic transmission loss. 基于动态多任务神经网络的深海声传输损耗端到端预测。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042428
Yongxian Wang, Zhao Sun, Houwang Tu

Accurate and efficient prediction of deep-sea acoustic transmission loss is essential for underwater applications, such as sonar design and underwater communication. In practical acoustic field computation tasks, it is often necessary to analyze the spatial distribution characteristics of the acoustic field in different regions. However, traditional numerical models require large-scale simulations on dense grids to predict acoustic fields over multiple spatial ranges, resulting in high computational cost. To address these limitations, this study proposes a dynamic multi-task U-Net (DMT-UNet) neural network model. Built on a multi-task learning framework, the model can dynamically adjust its network structure for end-to-end joint modeling of acoustic transmission loss at different computation ranges. Incorporating source information and deep-sea sound speed profiles as inputs enhances adaptability to model complex environments. DMT-UNet achieves an average root mean square error of approximately 1.6 dB on simulated deep-sea acoustic field datasets, with computational efficiency improved by more than 98.8% compared to traditional numerical models. Gradient-based visualization reveals how the model reconstructs acoustic field distributions during decoding, demonstrating interpretability. Experimental results show that DMT-UNet yields high prediction accuracy and computational efficiency, while maintaining consistency with physical laws and generalization capability. Thus, DMT-UNet enables real-time, multi-task acoustic modeling in complex deep-sea environments.

准确、高效地预测深海声传输损耗对声纳设计和水下通信等水下应用至关重要。在实际声场计算任务中,往往需要分析不同区域声场的空间分布特征。然而,传统的数值模型需要在密集网格上进行大规模模拟,以预测多个空间范围的声场,计算成本高。为了解决这些限制,本研究提出了一个动态多任务U-Net (DMT-UNet)神经网络模型。该模型建立在多任务学习框架上,可以动态调整网络结构,实现不同计算范围声传输损失的端到端联合建模。将声源信息和深海声速剖面作为输入增强了模型对复杂环境的适应性。DMT-UNet在模拟深海声场数据集上的平均均方根误差约为1.6 dB,计算效率比传统数值模型提高了98.8%以上。基于梯度的可视化揭示了模型在解码过程中如何重建声场分布,并展示了可解释性。实验结果表明,DMT-UNet具有较高的预测精度和计算效率,同时保持了与物理规律的一致性和泛化能力。因此,DMT-UNet能够在复杂的深海环境中实现实时、多任务声学建模。
{"title":"Dynamic multi-task neural network for end-to-end prediction of deep-sea acoustic transmission loss.","authors":"Yongxian Wang, Zhao Sun, Houwang Tu","doi":"10.1121/10.0042428","DOIUrl":"https://doi.org/10.1121/10.0042428","url":null,"abstract":"<p><p>Accurate and efficient prediction of deep-sea acoustic transmission loss is essential for underwater applications, such as sonar design and underwater communication. In practical acoustic field computation tasks, it is often necessary to analyze the spatial distribution characteristics of the acoustic field in different regions. However, traditional numerical models require large-scale simulations on dense grids to predict acoustic fields over multiple spatial ranges, resulting in high computational cost. To address these limitations, this study proposes a dynamic multi-task U-Net (DMT-UNet) neural network model. Built on a multi-task learning framework, the model can dynamically adjust its network structure for end-to-end joint modeling of acoustic transmission loss at different computation ranges. Incorporating source information and deep-sea sound speed profiles as inputs enhances adaptability to model complex environments. DMT-UNet achieves an average root mean square error of approximately 1.6 dB on simulated deep-sea acoustic field datasets, with computational efficiency improved by more than 98.8% compared to traditional numerical models. Gradient-based visualization reveals how the model reconstructs acoustic field distributions during decoding, demonstrating interpretability. Experimental results show that DMT-UNet yields high prediction accuracy and computational efficiency, while maintaining consistency with physical laws and generalization capability. Thus, DMT-UNet enables real-time, multi-task acoustic modeling in complex deep-sea environments.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"1555-1569"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146213172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Wave propagation in an elastic lattice with nonreciprocal stiffness and engineered dampinga). 具有非倒易刚度和工程阻尼的弹性晶格中的波传播a)。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042349
Harshit Kumar Sandhu, Saurav Dutta, Rajesh Chaunsali

Nonreciprocal wave propagation allows for directional energy transport. In this work, wave dynamics is systematically investigated in an elastic lattice that combines nonreciprocal stiffness with viscous damping. After establishing how conventional damping counteracts the system's gain, a non-dissipative form of nonreciprocal damping in the form of gyroscopic damping is introduced. It is found that the coexistence of nonreciprocal stiffness and nonreciprocal damping results in a decoupled control mechanism. The nonreciprocal stiffness is shown to govern the temporal amplification rate, whereas the nonreciprocal damper independently tunes the wave's group velocity and oscillation frequency. This decoupling gives rise to phenomena such as the enhancement of net amplification for slower-propagating waves and also boundary-induced wave interference arising from divergent and convergent reflected wave trajectories with varying growth rates. These findings provide a theoretical framework for designing active metamaterials with more versatile control over their wave propagation characteristics.

非互反波传播允许定向能量传输。在这项工作中,波浪动力学系统地研究了一个弹性晶格,结合了非倒易刚度和粘性阻尼。在确定了传统阻尼如何抵消系统增益后,介绍了一种非耗散形式的非倒易阻尼,即陀螺阻尼。研究发现,非倒易刚度和非倒易阻尼的共存导致了一种解耦控制机制。非倒易刚度控制时间放大率,而非倒易阻尼器独立调节波的群速度和振荡频率。这种解耦产生了一些现象,如慢传播波的净放大增强,以及由不同增长率的发散和收敛反射波轨迹引起的边界诱导波干涉。这些发现为设计具有更多功能控制其波传播特性的活性超材料提供了理论框架。
{"title":"Wave propagation in an elastic lattice with nonreciprocal stiffness and engineered dampinga).","authors":"Harshit Kumar Sandhu, Saurav Dutta, Rajesh Chaunsali","doi":"10.1121/10.0042349","DOIUrl":"https://doi.org/10.1121/10.0042349","url":null,"abstract":"<p><p>Nonreciprocal wave propagation allows for directional energy transport. In this work, wave dynamics is systematically investigated in an elastic lattice that combines nonreciprocal stiffness with viscous damping. After establishing how conventional damping counteracts the system's gain, a non-dissipative form of nonreciprocal damping in the form of gyroscopic damping is introduced. It is found that the coexistence of nonreciprocal stiffness and nonreciprocal damping results in a decoupled control mechanism. The nonreciprocal stiffness is shown to govern the temporal amplification rate, whereas the nonreciprocal damper independently tunes the wave's group velocity and oscillation frequency. This decoupling gives rise to phenomena such as the enhancement of net amplification for slower-propagating waves and also boundary-induced wave interference arising from divergent and convergent reflected wave trajectories with varying growth rates. These findings provide a theoretical framework for designing active metamaterials with more versatile control over their wave propagation characteristics.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"978-993"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146106012","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of the Acoustical Society of America
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1