首页 > 最新文献

Journal of the Acoustical Society of America最新文献

英文 中文
Classifying accuracy of fin whale range estimates from single seismic sensors. 单地震传感器估算长须鲸距离的分类精度。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042399
Andreia Pereira, Carolina Marques, Rose Hilmo, David K Mellinger, William S D Wilcock, Tiago A Marques, Danielle V Harris, Luis Matias

Ocean-bottom seismometers (OBSs) are used increasingly often to track baleen whale signals, employing single-station ranging techniques such as the three-component (3C) method. By using the orientation of ground motion from OBS components, the 3C method provides robust range estimates of direct-path signals within a validity range that relates to instrument depth. Consequently, the method requires a classification process to determine whether a signal falls within the validity range. Fin whale tracks, composed of 20-Hz notes from six locations, were used to develop and evaluate three classification models: decision trees (DTs), generalized additive models, and neural networks. Models were trained using different data combinations and incorporated a comprehensive set of variables related to channel amplitude, signal quality, polarization, and estimated signal angles. The DT achieved the highest performance, reaching an accuracy of 0.94 on the test data. Key variables for predicting the validity of the 3C ranges included the difference between observed horizontal-to-vertical amplitude ratios and its theoretical value, polarization metrics, and the amplitude of one horizontally oriented OBS component (Y-channel). The resulting framework contributes to improving the utility of seismic data for biological studies, which are critical for marine mammal monitoring and conservation strategies.

海底地震仪(OBSs)越来越多地用于跟踪须鲸的信号,采用单站测距技术,如三分量(3C)方法。通过使用OBS组件的地面运动方向,3C方法在与仪器深度相关的有效范围内提供了直接路径信号的鲁棒范围估计。因此,该方法需要一个分类过程来确定信号是否在有效范围内。由来自6个地点的20赫兹音符组成的长须鲸轨迹被用来开发和评估三种分类模型:决策树(dt)、广义加性模型和神经网络。模型使用不同的数据组合进行训练,并纳入了一系列与信道幅度、信号质量、极化和估计信号角度相关的变量。DT取得了最高的性能,在测试数据上达到了0.94的精度。预测3C范围有效性的关键变量包括观测到的水平与垂直振幅比与其理论值之差、极化指标和一个水平方向OBS分量(y通道)的振幅。由此产生的框架有助于提高地震数据在生物学研究中的效用,这对海洋哺乳动物的监测和保护策略至关重要。
{"title":"Classifying accuracy of fin whale range estimates from single seismic sensors.","authors":"Andreia Pereira, Carolina Marques, Rose Hilmo, David K Mellinger, William S D Wilcock, Tiago A Marques, Danielle V Harris, Luis Matias","doi":"10.1121/10.0042399","DOIUrl":"https://doi.org/10.1121/10.0042399","url":null,"abstract":"<p><p>Ocean-bottom seismometers (OBSs) are used increasingly often to track baleen whale signals, employing single-station ranging techniques such as the three-component (3C) method. By using the orientation of ground motion from OBS components, the 3C method provides robust range estimates of direct-path signals within a validity range that relates to instrument depth. Consequently, the method requires a classification process to determine whether a signal falls within the validity range. Fin whale tracks, composed of 20-Hz notes from six locations, were used to develop and evaluate three classification models: decision trees (DTs), generalized additive models, and neural networks. Models were trained using different data combinations and incorporated a comprehensive set of variables related to channel amplitude, signal quality, polarization, and estimated signal angles. The DT achieved the highest performance, reaching an accuracy of 0.94 on the test data. Key variables for predicting the validity of the 3C ranges included the difference between observed horizontal-to-vertical amplitude ratios and its theoretical value, polarization metrics, and the amplitude of one horizontally oriented OBS component (Y-channel). The resulting framework contributes to improving the utility of seismic data for biological studies, which are critical for marine mammal monitoring and conservation strategies.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"1430-1445"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146180750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A physics-based low-order filter approximation for scattering from a rigid sphere. 刚性球散射的一种基于物理的低阶滤波近似。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042226
Yuqing Li, Stephan D Ewert

Efficient simulation of object scattering is crucial in virtual and room acoustics, particularly for interactive scenarios where time-varying diffraction must be evaluated repeatedly in real-time. In this context, the rigid sphere is commonly used as a simple geometric representation of more complex three-dimensional objects. Although the analytical solution for rigid sphere scattering is well known, its computation is too involved for interactive audio rendering. In this paper, we propose a physically informed, low-order digital filter approximation of rigid sphere scattering for arbitrary source and receiver positions. For the low-frequency range, the first three terms of the spherical harmonics analytical solution are directly expressed by combined first-order low- and high-pass filters. For high-frequencies, the basic properties of rigid sphere scattering are approximated by modelling the shortest and longest paths reflected or bent around the sphere. Both approximations are combined using a blending function to obtain the wideband result. The magnitude of the low-frequency approximation matches the analytical solution well, yielding mean root mean square errors below 0.5 dB for source and receiver distances greater than twice the sphere radius and about 1.5 dB at smaller distances. For the high-frequency range, the mean error in magnitude is overall larger and is about 2 dB.

物体散射的有效模拟在虚拟声学和室内声学中是至关重要的,特别是对于必须实时重复评估时变衍射的交互式场景。在这种情况下,刚性球体通常被用作更复杂的三维物体的简单几何表示。虽然刚性球散射的解析解是众所周知的,但它的计算对于交互式音频渲染来说过于复杂。在本文中,我们提出了一个物理通知,低阶数字滤波器近似刚性球散射任意源和接收机的位置。在低频范围内,球面谐波解析解的前三项由一阶低通和高通滤波器组合直接表示。对于高频,刚性球散射的基本特性是通过模拟在球周围反射或弯曲的最短和最长路径来近似的。使用混合函数将两个近似组合起来以获得宽带结果。低频近似的幅度与解析解吻合得很好,当源端和接收端距离大于球半径的两倍时,平均均方根误差小于0.5 dB,当距离较小时,平均均方根误差约为1.5 dB。对于高频范围,平均误差幅度总体较大,约为2db。
{"title":"A physics-based low-order filter approximation for scattering from a rigid sphere.","authors":"Yuqing Li, Stephan D Ewert","doi":"10.1121/10.0042226","DOIUrl":"https://doi.org/10.1121/10.0042226","url":null,"abstract":"<p><p>Efficient simulation of object scattering is crucial in virtual and room acoustics, particularly for interactive scenarios where time-varying diffraction must be evaluated repeatedly in real-time. In this context, the rigid sphere is commonly used as a simple geometric representation of more complex three-dimensional objects. Although the analytical solution for rigid sphere scattering is well known, its computation is too involved for interactive audio rendering. In this paper, we propose a physically informed, low-order digital filter approximation of rigid sphere scattering for arbitrary source and receiver positions. For the low-frequency range, the first three terms of the spherical harmonics analytical solution are directly expressed by combined first-order low- and high-pass filters. For high-frequencies, the basic properties of rigid sphere scattering are approximated by modelling the shortest and longest paths reflected or bent around the sphere. Both approximations are combined using a blending function to obtain the wideband result. The magnitude of the low-frequency approximation matches the analytical solution well, yielding mean root mean square errors below 0.5 dB for source and receiver distances greater than twice the sphere radius and about 1.5 dB at smaller distances. For the high-frequency range, the mean error in magnitude is overall larger and is about 2 dB.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"1173-1189"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146125391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A model-based approach for B/A estimation using the finite amplitude insertion-substitution method. 基于模型的有限振幅插入-替换法B/A估计方法。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042448
Andreas Schrempf, Lisa Stefely, Thomas Thurner, Harald Weichenberger

An enhanced, model-based framework is presented for estimating the acoustic nonlinearity parameter B/A using the finite amplitude insertion-substitution (FAIS) method in transmission mode. The proposed approach advances harmonic amplitude modeling by eliminating the restrictive plane wave assumption and introducing shape functions that accurately account for transducer diffraction and wave superposition. Additionally, a correction term for second-harmonic attenuation is incorporated, enabling precise characterization across a wide range of attenuation exponents. The framework supports both iterative and direct B/A estimation methods, each validated through comprehensive simulations and experimental measurements. Compared to conventional FAIS methods, this method significantly improves accuracy and robustness in B/A estimation, reduces sensitivity to experimental setup and transducer geometry, and provides practical guidance for optimizing experimental parameters such as sample thickness, distance between source, sample, and receiver, and attenuation measurement.

提出了一种改进的基于模型的框架,用于在传输模式下使用有限振幅插入-替换(FAIS)方法估计声学非线性参数B/A。该方法通过消除限制平面波假设和引入精确考虑换能器衍射和波叠加的形状函数来推进谐波振幅建模。此外,还纳入了二次谐波衰减的校正项,从而可以在广泛的衰减指数范围内进行精确表征。该框架支持迭代和直接B/A估计方法,每种方法都通过综合模拟和实验测量进行验证。与传统的FAIS方法相比,该方法显著提高了B/A估计的准确性和鲁棒性,降低了对实验设置和传感器几何形状的敏感性,并为优化实验参数(如样品厚度、源、样品和接收器之间的距离以及衰减测量)提供了实用的指导。
{"title":"A model-based approach for B/A estimation using the finite amplitude insertion-substitution method.","authors":"Andreas Schrempf, Lisa Stefely, Thomas Thurner, Harald Weichenberger","doi":"10.1121/10.0042448","DOIUrl":"https://doi.org/10.1121/10.0042448","url":null,"abstract":"<p><p>An enhanced, model-based framework is presented for estimating the acoustic nonlinearity parameter B/A using the finite amplitude insertion-substitution (FAIS) method in transmission mode. The proposed approach advances harmonic amplitude modeling by eliminating the restrictive plane wave assumption and introducing shape functions that accurately account for transducer diffraction and wave superposition. Additionally, a correction term for second-harmonic attenuation is incorporated, enabling precise characterization across a wide range of attenuation exponents. The framework supports both iterative and direct B/A estimation methods, each validated through comprehensive simulations and experimental measurements. Compared to conventional FAIS methods, this method significantly improves accuracy and robustness in B/A estimation, reduces sensitivity to experimental setup and transducer geometry, and provides practical guidance for optimizing experimental parameters such as sample thickness, distance between source, sample, and receiver, and attenuation measurement.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"1416-1429"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146165806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Range-dependent shallow water sound source localization via digital twin model incorporating spectral element full-wave numerical simulation. 基于谱元全波数值模拟的距离相关浅水声源定位方法。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042404
Yihua Xing, Shahram Khazaie, Xun Wang

Sound source localization in complex range-dependent shallow water environments is a challenging problem. The challenges arise from factors such as irregular seabed topography and sound speed profiles, which cannot be quantified by the classical underwater sound propagation models. Although full-wave numerical simulation can accurately simulate the acoustic field propagation in such complex range-dependent environments, applying it directly to the sound source localization problem is difficult because of its high computational cost. To address this problem, this paper proposes an acoustic digital twin model that integrates full-wave numerical simulations in an efficient way to perform sound source localization both accurately and rapidly. Specifically, the spectral element method-based full-wave numerical simulations are used to model the sound propagation in a range-dependent environment for a small number of well-designed representations of sound source locations. The Kriging method is then employed to build a digital twin model based on these numerical results to form a model of the sound field varying continuously with source parameters. Subsequently, as long as the sound field is measured by a hydrophone array, it can be matched with the output acoustic field of the digital twin model based on the matched-field processing principle to realize rapid and accurate source localization in complex underwater environments. Numerical experiments demonstrate that the proposed method offers accurate and low-cost real-time sound source localization in complex marine scenarios.

在复杂距离相关的浅水环境中,声源定位是一个具有挑战性的问题。传统的水声传播模型无法对不规则的海底地形和声速分布等因素进行量化,这是研究的难点。虽然全波数值模拟可以准确模拟这种复杂距离依赖环境下的声场传播,但由于计算成本高,将其直接应用于声源定位问题比较困难。为了解决这一问题,本文提出了一种声学数字孪生模型,该模型集成了全波数值模拟,有效地实现了准确快速的声源定位。具体而言,基于谱元方法的全波数值模拟用于模拟声源位置的少量良好设计表示在距离相关环境中的声音传播。在此基础上,利用Kriging方法建立数字孪生模型,形成声场随声源参数连续变化的模型。随后,只要通过水听器阵列测量声场,就可以根据匹配场处理原理将其与数字孪生模型的输出声场进行匹配,实现复杂水下环境下快速准确的声源定位。数值实验表明,该方法能够在复杂的海洋环境中提供准确、低成本的实时声源定位。
{"title":"Range-dependent shallow water sound source localization via digital twin model incorporating spectral element full-wave numerical simulation.","authors":"Yihua Xing, Shahram Khazaie, Xun Wang","doi":"10.1121/10.0042404","DOIUrl":"https://doi.org/10.1121/10.0042404","url":null,"abstract":"<p><p>Sound source localization in complex range-dependent shallow water environments is a challenging problem. The challenges arise from factors such as irregular seabed topography and sound speed profiles, which cannot be quantified by the classical underwater sound propagation models. Although full-wave numerical simulation can accurately simulate the acoustic field propagation in such complex range-dependent environments, applying it directly to the sound source localization problem is difficult because of its high computational cost. To address this problem, this paper proposes an acoustic digital twin model that integrates full-wave numerical simulations in an efficient way to perform sound source localization both accurately and rapidly. Specifically, the spectral element method-based full-wave numerical simulations are used to model the sound propagation in a range-dependent environment for a small number of well-designed representations of sound source locations. The Kriging method is then employed to build a digital twin model based on these numerical results to form a model of the sound field varying continuously with source parameters. Subsequently, as long as the sound field is measured by a hydrophone array, it can be matched with the output acoustic field of the digital twin model based on the matched-field processing principle to realize rapid and accurate source localization in complex underwater environments. Numerical experiments demonstrate that the proposed method offers accurate and low-cost real-time sound source localization in complex marine scenarios.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"1225-1234"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146142436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Wave propagation in an elastic lattice with nonreciprocal stiffness and engineered dampinga). 具有非倒易刚度和工程阻尼的弹性晶格中的波传播a)。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042349
Harshit Kumar Sandhu, Saurav Dutta, Rajesh Chaunsali

Nonreciprocal wave propagation allows for directional energy transport. In this work, wave dynamics is systematically investigated in an elastic lattice that combines nonreciprocal stiffness with viscous damping. After establishing how conventional damping counteracts the system's gain, a non-dissipative form of nonreciprocal damping in the form of gyroscopic damping is introduced. It is found that the coexistence of nonreciprocal stiffness and nonreciprocal damping results in a decoupled control mechanism. The nonreciprocal stiffness is shown to govern the temporal amplification rate, whereas the nonreciprocal damper independently tunes the wave's group velocity and oscillation frequency. This decoupling gives rise to phenomena such as the enhancement of net amplification for slower-propagating waves and also boundary-induced wave interference arising from divergent and convergent reflected wave trajectories with varying growth rates. These findings provide a theoretical framework for designing active metamaterials with more versatile control over their wave propagation characteristics.

非互反波传播允许定向能量传输。在这项工作中,波浪动力学系统地研究了一个弹性晶格,结合了非倒易刚度和粘性阻尼。在确定了传统阻尼如何抵消系统增益后,介绍了一种非耗散形式的非倒易阻尼,即陀螺阻尼。研究发现,非倒易刚度和非倒易阻尼的共存导致了一种解耦控制机制。非倒易刚度控制时间放大率,而非倒易阻尼器独立调节波的群速度和振荡频率。这种解耦产生了一些现象,如慢传播波的净放大增强,以及由不同增长率的发散和收敛反射波轨迹引起的边界诱导波干涉。这些发现为设计具有更多功能控制其波传播特性的活性超材料提供了理论框架。
{"title":"Wave propagation in an elastic lattice with nonreciprocal stiffness and engineered dampinga).","authors":"Harshit Kumar Sandhu, Saurav Dutta, Rajesh Chaunsali","doi":"10.1121/10.0042349","DOIUrl":"https://doi.org/10.1121/10.0042349","url":null,"abstract":"<p><p>Nonreciprocal wave propagation allows for directional energy transport. In this work, wave dynamics is systematically investigated in an elastic lattice that combines nonreciprocal stiffness with viscous damping. After establishing how conventional damping counteracts the system's gain, a non-dissipative form of nonreciprocal damping in the form of gyroscopic damping is introduced. It is found that the coexistence of nonreciprocal stiffness and nonreciprocal damping results in a decoupled control mechanism. The nonreciprocal stiffness is shown to govern the temporal amplification rate, whereas the nonreciprocal damper independently tunes the wave's group velocity and oscillation frequency. This decoupling gives rise to phenomena such as the enhancement of net amplification for slower-propagating waves and also boundary-induced wave interference arising from divergent and convergent reflected wave trajectories with varying growth rates. These findings provide a theoretical framework for designing active metamaterials with more versatile control over their wave propagation characteristics.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"978-993"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146106012","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Low-complexity iterative receiver based on variational Bayesian inference for multiple-input multiple-output underwater acoustic communication. 基于变分贝叶斯推理的多输入多输出水声通信低复杂度迭代接收机。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042465
Wei-Zhe Li, Xiao Han, Yi-Zhen Jia, Zheng Wu, Jing-Wei Yin

The design of high-performance receivers with manageable complexity is crucial for underwater acoustic communication, especially in multiple-input multiple-output (MIMO) scenarios. In this paper, a low-complexity MIMO receiver, which is based on variational Bayesian inference (VBI), is proposed. First, an iterative channel estimation model is constructed, which is based on VBI, the high-dimensional channel vector in the MIMO system is decomposed into a parallel connection of multiple low-dimensional channel vectors with different sparsity, and the prior distribution of the low-dimensional channel vectors is jointly modeled using temporal correlation (TC) and sparsity. Next, the low-complexity vector approximate message passing (VAMP) technique is integrated into the VBI framework and a channel estimation method is derived, based on TC-VAMP-VBI. Finally, to reduce the complexity of MIMO channel equalization, a serial iterative equalization algorithm is proposed, which incorporates passive time reversal under the VBI framework. The proposed algorithm was validated using simulations and MIMO communication data collected from two field experiments. The results show that the proposed algorithm can significantly reduce the computational complexity of the MIMO system while maintaining robustness of channel estimation in short data block scenarios.

设计可管理的高性能接收机是水声通信的关键,特别是在多输入多输出(MIMO)场景下。本文提出了一种基于变分贝叶斯推理(VBI)的低复杂度MIMO接收机。首先,构建基于VBI的迭代信道估计模型,将MIMO系统中的高维信道向量分解为多个不同稀疏度的低维信道向量的并行连接,并利用时间相关和稀疏度联合建模低维信道向量的先验分布;然后,将低复杂度的向量近似消息传递(VAMP)技术集成到VBI框架中,推导出基于TC-VAMP-VBI的信道估计方法。最后,为了降低MIMO信道均衡的复杂度,提出了一种在VBI框架下结合被动时间反转的串行迭代均衡算法。通过仿真和MIMO通信数据验证了该算法的有效性。结果表明,该算法在保持短数据块信道估计鲁棒性的同时,显著降低了MIMO系统的计算复杂度。
{"title":"Low-complexity iterative receiver based on variational Bayesian inference for multiple-input multiple-output underwater acoustic communication.","authors":"Wei-Zhe Li, Xiao Han, Yi-Zhen Jia, Zheng Wu, Jing-Wei Yin","doi":"10.1121/10.0042465","DOIUrl":"https://doi.org/10.1121/10.0042465","url":null,"abstract":"<p><p>The design of high-performance receivers with manageable complexity is crucial for underwater acoustic communication, especially in multiple-input multiple-output (MIMO) scenarios. In this paper, a low-complexity MIMO receiver, which is based on variational Bayesian inference (VBI), is proposed. First, an iterative channel estimation model is constructed, which is based on VBI, the high-dimensional channel vector in the MIMO system is decomposed into a parallel connection of multiple low-dimensional channel vectors with different sparsity, and the prior distribution of the low-dimensional channel vectors is jointly modeled using temporal correlation (TC) and sparsity. Next, the low-complexity vector approximate message passing (VAMP) technique is integrated into the VBI framework and a channel estimation method is derived, based on TC-VAMP-VBI. Finally, to reduce the complexity of MIMO channel equalization, a serial iterative equalization algorithm is proposed, which incorporates passive time reversal under the VBI framework. The proposed algorithm was validated using simulations and MIMO communication data collected from two field experiments. The results show that the proposed algorithm can significantly reduce the computational complexity of the MIMO system while maintaining robustness of channel estimation in short data block scenarios.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"1512-1528"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146180725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Empirical correction method for spatial averaging effect in ultrasonic device calibration: Enhancing the precision-sensitivity trade-off. 超声装置标定中空间平均效应的经验校正方法:提高精度灵敏度的权衡。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042356
Francisco Alves, Mário Santos, André Alvarenga, Lorena Petrella

The calibration of ultrasound diagnostic equipment is essential to ensure their effectiveness and safety. Calibrating acoustic fields using hydrophones involves the measurement of the maximum pressure point. Since the signal at the hydrophone output is proportional to the average pressure incident on its surface, when the active area of the hydrophone is larger than the area of the ultrasonic beam at the focus, the lower acoustic pressures surrounding the point of maximum pressure will cause an underestimation of this value. This phenomenon is referred as the spatial averaging effect. The main limitation in the use of smaller hydrophones is the inherent reduction in sensitivity. The International Electrotechnical Commission standards provide methods to correct for the spatial averaging effect when the ratio of the - 6 dB beam width to the hydrophone diameter (Rbh) is higher than 1.5. In this study, a novel method is presented for spatial averaging correction, developed using computational simulation. It consists of an empirical correction factor and allows extending corrections for Rbh values as low as 0.35, with errors below 3%, addressing the compromise between precision and sensitivity of the hydrophone. This method also generalizes to ultrasonic probes with varying characteristics.

超声诊断设备的标定是保证其有效性和安全性的关键。利用水听器校准声场涉及到最大压力点的测量。由于水听器输出处的信号与入射到其表面的平均压力成正比,当水听器的活动面积大于焦点处超声波束的面积时,最大压力点周围较低的声压会导致该值被低估。这种现象被称为空间平均效应。使用较小的水听器的主要限制是其固有的灵敏度降低。国际电工委员会标准提供了校正- 6db波束宽度与水听器直径之比(Rbh)大于1.5时的空间平均效应的方法。本文提出了一种基于计算模拟的空间平均校正方法。它包括一个经验校正因子,允许扩展校正Rbh值低至0.35,误差低于3%,解决了水听器精度和灵敏度之间的折衷问题。这种方法也适用于具有不同特性的超声波探头。
{"title":"Empirical correction method for spatial averaging effect in ultrasonic device calibration: Enhancing the precision-sensitivity trade-off.","authors":"Francisco Alves, Mário Santos, André Alvarenga, Lorena Petrella","doi":"10.1121/10.0042356","DOIUrl":"https://doi.org/10.1121/10.0042356","url":null,"abstract":"<p><p>The calibration of ultrasound diagnostic equipment is essential to ensure their effectiveness and safety. Calibrating acoustic fields using hydrophones involves the measurement of the maximum pressure point. Since the signal at the hydrophone output is proportional to the average pressure incident on its surface, when the active area of the hydrophone is larger than the area of the ultrasonic beam at the focus, the lower acoustic pressures surrounding the point of maximum pressure will cause an underestimation of this value. This phenomenon is referred as the spatial averaging effect. The main limitation in the use of smaller hydrophones is the inherent reduction in sensitivity. The International Electrotechnical Commission standards provide methods to correct for the spatial averaging effect when the ratio of the - 6 dB beam width to the hydrophone diameter (Rbh) is higher than 1.5. In this study, a novel method is presented for spatial averaging correction, developed using computational simulation. It consists of an empirical correction factor and allows extending corrections for Rbh values as low as 0.35, with errors below 3%, addressing the compromise between precision and sensitivity of the hydrophone. This method also generalizes to ultrasonic probes with varying characteristics.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"1027-1035"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146105957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Validation of Count-the-Dots audiogram approaches to calculating speech intelligibility indices. 计算语音可理解度指标的点阵听力图方法的验证。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042425
Koenraad S Rhebergen, Chaslav V Pavlovic

This study evaluates Count-the-Dots Audiogram approaches as a simplified clinically viable method to closely estimate the American National Standards Institute [ANSI (1997). S3.5-1997) Speech Intelligibility Index (SII)] standard in quiet environments. We compared audibility calculations and predicted intelligibility scores between Count-the-Dots methods and multiple ANSI [(1997). S3.5-1997)] SII variants, using eight frequency Band Importance Functions (BIF) for 14 776 audiograms from the National Health and Nutrition Examination Survey dataset. Results showed that Count-the-Dots methods closely approximate the ANSI [(1997). S3.5-1997)] SII model as long as the speech levels and the BIF used for calculations were equivalent between the two methods. This was true for audibility calculations and speech intelligibility predictions. However, deviations occurred at higher speech levels [≥65 dB sound pressure level (SPL)] because of differences in how masking is modeled. Count-the-Dots audiogram approaches offer a clinically viable, intuitive alternative for counseling purposes in quiet settings, particularly at natural speech levels (about 55 dB SPL). However, for speech-in-noise conditions, high-level speech, or aided speech inputs, the ANSI [(1997). S3.5-1997)] SII remains the preferred model because of its more detailed acoustic modeling.

本研究评估了点数听力图方法作为一种简化的临床可行方法来密切评估美国国家标准协会[ANSI(1997)]。S3.5-1997)安静环境下的语音清晰度指数(SII)]标准。我们比较了“点计数”方法和多个ANSI[(1997)]之间的可听性计算和预测可理解性分数。S3.5-1997)] SII变体,使用8个频带重要性函数(BIF)对来自国家健康和营养检查调查数据集的14776个听力图进行分析。结果表明,Count-the-Dots方法非常接近ANSI[(1997)]。(S3.5-1997)] SII模型,只要两种方法计算的语音电平和BIF相等。对于可听性计算和语音可理解性预测来说,这是正确的。然而,在较高的语音水平[≥65 dB声压级(SPL)]下,由于掩蔽建模方式的差异,出现了偏差。在安静环境下,特别是在自然语音水平(约55 dB SPL)下,点数听力学方法为咨询目的提供了临床可行的、直观的替代方案。然而,对于噪声中的语音条件,高级语音或辅助语音输入,ANSI[(1997)]。S3.5-1997)] SII仍然是首选模型,因为它更详细的声学建模。
{"title":"Validation of Count-the-Dots audiogram approaches to calculating speech intelligibility indices.","authors":"Koenraad S Rhebergen, Chaslav V Pavlovic","doi":"10.1121/10.0042425","DOIUrl":"https://doi.org/10.1121/10.0042425","url":null,"abstract":"<p><p>This study evaluates Count-the-Dots Audiogram approaches as a simplified clinically viable method to closely estimate the American National Standards Institute [ANSI (1997). S3.5-1997) Speech Intelligibility Index (SII)] standard in quiet environments. We compared audibility calculations and predicted intelligibility scores between Count-the-Dots methods and multiple ANSI [(1997). S3.5-1997)] SII variants, using eight frequency Band Importance Functions (BIF) for 14 776 audiograms from the National Health and Nutrition Examination Survey dataset. Results showed that Count-the-Dots methods closely approximate the ANSI [(1997). S3.5-1997)] SII model as long as the speech levels and the BIF used for calculations were equivalent between the two methods. This was true for audibility calculations and speech intelligibility predictions. However, deviations occurred at higher speech levels [≥65 dB sound pressure level (SPL)] because of differences in how masking is modeled. Count-the-Dots audiogram approaches offer a clinically viable, intuitive alternative for counseling purposes in quiet settings, particularly at natural speech levels (about 55 dB SPL). However, for speech-in-noise conditions, high-level speech, or aided speech inputs, the ANSI [(1997). S3.5-1997)] SII remains the preferred model because of its more detailed acoustic modeling.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"1337-1347"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146149759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How does a deep neural network look at lexical stress in English words? 深度神经网络如何看待英语单词中的词汇重音?
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042429
Itai Allouche, Itay Asael, Rotem Rousso, Vered Dassa, Ann Bradlow, Seung-Eun Kim, Matthew Goldrick, Joseph Keshet

Despite their success in speech processing, neural networks often operate as black boxes, prompting the following questions: What informs their decisions, and how can we interpret them? This work examines this issue in the context of lexical stress. A dataset of English disyllabic words was automatically constructed from read and spontaneous speech. Several convolutional neural network (CNN) architectures were trained to predict stress position from a spectrographic representation of disyllabic words lacking minimal stress pairs (e.g., initial stress WAllet, final stress exTEND), achieving up to 92% accuracy on held-out test data. Layerwise relevance propagation, a technique for neural network interpretability analysis, revealed that predictions for held-out minimal pairs (PROtest vs proTEST) were most strongly influenced by information in stressed versus unstressed syllables, particularly the spectral properties of stressed vowels. However, the classifiers also attended to information throughout the word. A feature-specific relevance analysis is proposed, and its results suggest that the best-performing classifier is strongly influenced by the stressed vowel's first and second formants, with some evidence that its pitch and third formant also contribute. These results reveal deep learning's ability to acquire distributed cues to stress from naturally occurring data, extending traditional phonetic work based around highly controlled stimuli.

尽管神经网络在语音处理方面取得了成功,但它们经常像黑盒子一样运作,这引发了以下问题:是什么影响了它们的决定,我们如何解释这些决定?这项工作考察了这个问题在词汇重音的背景下。从阅读和自发语音中自动构建英语双音节词数据集。几个卷积神经网络(CNN)架构被训练来从缺乏最小应力对的双音节单词的频谱表示(例如,初始应力WAllet,最终应力exTEND)中预测应力位置,在保持测试数据上达到高达92%的准确率。分层关联传播是一种神经网络可解释性分析技术,它揭示了对保持最小对(PROtest vs . PROtest)的预测最强烈地受到重音和非重音音节信息的影响,尤其是重音元音的频谱特性。然而,分类器也关注整个单词的信息。提出了一种特征相关分析,其结果表明,表现最好的分类器受到重读元音的第一个和第二个共振峰的强烈影响,有证据表明其音高和第三个共振峰也有贡献。这些结果揭示了深度学习从自然发生的数据中获取分布式压力线索的能力,扩展了基于高度受控刺激的传统语音工作。
{"title":"How does a deep neural network look at lexical stress in English words?","authors":"Itai Allouche, Itay Asael, Rotem Rousso, Vered Dassa, Ann Bradlow, Seung-Eun Kim, Matthew Goldrick, Joseph Keshet","doi":"10.1121/10.0042429","DOIUrl":"https://doi.org/10.1121/10.0042429","url":null,"abstract":"<p><p>Despite their success in speech processing, neural networks often operate as black boxes, prompting the following questions: What informs their decisions, and how can we interpret them? This work examines this issue in the context of lexical stress. A dataset of English disyllabic words was automatically constructed from read and spontaneous speech. Several convolutional neural network (CNN) architectures were trained to predict stress position from a spectrographic representation of disyllabic words lacking minimal stress pairs (e.g., initial stress WAllet, final stress exTEND), achieving up to 92% accuracy on held-out test data. Layerwise relevance propagation, a technique for neural network interpretability analysis, revealed that predictions for held-out minimal pairs (PROtest vs proTEST) were most strongly influenced by information in stressed versus unstressed syllables, particularly the spectral properties of stressed vowels. However, the classifiers also attended to information throughout the word. A feature-specific relevance analysis is proposed, and its results suggest that the best-performing classifier is strongly influenced by the stressed vowel's first and second formants, with some evidence that its pitch and third formant also contribute. These results reveal deep learning's ability to acquire distributed cues to stress from naturally occurring data, extending traditional phonetic work based around highly controlled stimuli.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"1348-1358"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146157220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Iterative Born solver for the acoustic Helmholtz equation with heterogeneous sound speed and density. 具有非均匀声速和密度的声学亥姆霍兹方程的迭代Born求解器。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-02-01 DOI: 10.1121/10.0042259
Antonio Stanziola, Simon R Arridge, Bradley E Treeby, Benjamin T Cox

Efficient numerical solution of the acoustic Helmholtz equation in heterogeneous media remains challenging, particularly for large-scale problems with spatially varying density-a limitation that restricts applications in biomedical acoustics and seismic imaging. A fast iterative solver that extends the convergent Born series [Osnabrugge, Leedumrongwatthanakun, and Vellekoop, J. Comput. Phys. 322, 113-124 (2016)] method to handle arbitrary variations in sound speed, density, and absorption simultaneously is presented. This approach reformulates the Helmholtz equation as a first-order system and applies the universal split-preconditioner from Vettenburg and Vellekoop [arXiv:2207.14222v2 (2022)], yielding a matrix-free algorithm that leverages Fast Fourier Transforms for computational efficiency. Unlike existing Born series methods, this solver accommodates heterogeneous density without requiring expensive matrix decompositions or preprocessing steps, making it suitable for large-scale three-dimensional problems with minimal memory overhead. The method provides forward and adjoint solutions, enabling its application for inverse problems. Accuracy is validated through comparison against an analytical solution and the solver's practical utility is demonstrated through transcranial ultrasound simulations. The solver achieves convergence for strong scattering scenarios, offering a computationally efficient alternative to time-domain methods and matrix-based Helmholtz solvers for applications ranging from medical ultrasound treatment planning to seismic exploration.

非均质介质中声学亥姆霍兹方程的有效数值解仍然具有挑战性,特别是对于具有空间变化密度的大规模问题,这限制了生物医学声学和地震成像的应用。一种扩展收敛Born级数的快速迭代求解器[j]。提出了同时处理声速、密度和吸收任意变化的方法。物理学报,322,113-124 (2016)]该方法将Helmholtz方程重新表述为一阶系统,并应用Vettenburg和Vellekoop的通用分裂预条件[arXiv:2207.14222v2(2022)],产生了一种利用快速傅里叶变换提高计算效率的无矩阵算法。与现有的Born系列方法不同,该求解器适应异构密度,而不需要昂贵的矩阵分解或预处理步骤,使其适用于内存开销最小的大规模三维问题。该方法提供了正解和伴随解,可用于求解反问题。通过与解析解的比较验证了准确性,并通过经颅超声模拟证明了求解器的实用性。该求解器在强散射情况下实现收敛,为从医学超声治疗计划到地震勘探等应用提供了时域方法和基于矩阵的亥姆霍兹求解器的计算效率替代方案。
{"title":"Iterative Born solver for the acoustic Helmholtz equation with heterogeneous sound speed and density.","authors":"Antonio Stanziola, Simon R Arridge, Bradley E Treeby, Benjamin T Cox","doi":"10.1121/10.0042259","DOIUrl":"https://doi.org/10.1121/10.0042259","url":null,"abstract":"<p><p>Efficient numerical solution of the acoustic Helmholtz equation in heterogeneous media remains challenging, particularly for large-scale problems with spatially varying density-a limitation that restricts applications in biomedical acoustics and seismic imaging. A fast iterative solver that extends the convergent Born series [Osnabrugge, Leedumrongwatthanakun, and Vellekoop, J. Comput. Phys. 322, 113-124 (2016)] method to handle arbitrary variations in sound speed, density, and absorption simultaneously is presented. This approach reformulates the Helmholtz equation as a first-order system and applies the universal split-preconditioner from Vettenburg and Vellekoop [arXiv:2207.14222v2 (2022)], yielding a matrix-free algorithm that leverages Fast Fourier Transforms for computational efficiency. Unlike existing Born series methods, this solver accommodates heterogeneous density without requiring expensive matrix decompositions or preprocessing steps, making it suitable for large-scale three-dimensional problems with minimal memory overhead. The method provides forward and adjoint solutions, enabling its application for inverse problems. Accuracy is validated through comparison against an analytical solution and the solver's practical utility is demonstrated through transcranial ultrasound simulations. The solver achieves convergence for strong scattering scenarios, offering a computationally efficient alternative to time-domain methods and matrix-based Helmholtz solvers for applications ranging from medical ultrasound treatment planning to seismic exploration.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 2","pages":"1457-1470"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146180755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of the Acoustical Society of America
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1