首页 > 最新文献

Journal of the Acoustical Society of America最新文献

英文 中文
The temporal effects of auditory and visual immersion on speech level in virtual environments. 虚拟环境中听觉和视觉沉浸对言语水平的时间效应。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-01-01 DOI: 10.1121/10.0042240
Xinyi N Zhang, Arian Shamei, Florian Grond, Ingrid Verduyckt, Rachel E Bouserhal

Speech takes place in physical environments with visual and acoustic properties, yet how these elements and their interaction influence speech production is not fully understood. While a room's appearance can suggest its acoustics, it is unclear whether people adjust their speech based on this visual information. Previous research shows that higher reverberation leads to reduced speech level, but how auditory and visual information interact in this process remains limited. This study examined how audiovisual information affects speech level by immersing participants in virtual environments with varying reverberation and room visuals (hemi-anechoic room, classroom, and gymnasium) while completing speech tasks. Speech level was analyzed using generalized additive mixed-effects modeling to assess temporal changes during utterances across conditions. Results showed that visual information significantly influenced speech level, though not strictly in line with expected acoustics or perceived room size; auditory information had a stronger overall effect than visual information. Visual information had an earlier influence that diminished over time, whereas the auditory effect increased and plateaued. These findings contribute to the understanding of multisensory integration in speech control and have implications in enhancing vocal performance and supporting more naturalistic communication in virtual environments.

语音发生在具有视觉和声学特性的物理环境中,但这些元素及其相互作用如何影响语音产生尚不完全清楚。虽然一个房间的外观可以表明它的声学效果,但人们是否会根据这种视觉信息来调整自己的语言还不清楚。先前的研究表明,较高的混响会导致语音水平降低,但听觉和视觉信息在这一过程中如何相互作用仍然有限。本研究通过将参与者沉浸在具有不同混响和房间视觉效果(半消声室、教室和体育馆)的虚拟环境中,同时完成演讲任务,研究了视听信息如何影响语音水平。使用广义加性混合效应模型分析语音水平,以评估不同条件下话语的时间变化。结果表明,视觉信息显著影响语音水平,尽管与预期的声学或感知的房间大小不完全一致;听觉信息的整体效果强于视觉信息。视觉信息的早期影响随着时间的推移而减弱,而听觉的影响则增加并趋于稳定。这些发现有助于理解语音控制中的多感觉整合,并对提高语音表现和支持虚拟环境中更自然的交流具有重要意义。
{"title":"The temporal effects of auditory and visual immersion on speech level in virtual environments.","authors":"Xinyi N Zhang, Arian Shamei, Florian Grond, Ingrid Verduyckt, Rachel E Bouserhal","doi":"10.1121/10.0042240","DOIUrl":"https://doi.org/10.1121/10.0042240","url":null,"abstract":"<p><p>Speech takes place in physical environments with visual and acoustic properties, yet how these elements and their interaction influence speech production is not fully understood. While a room's appearance can suggest its acoustics, it is unclear whether people adjust their speech based on this visual information. Previous research shows that higher reverberation leads to reduced speech level, but how auditory and visual information interact in this process remains limited. This study examined how audiovisual information affects speech level by immersing participants in virtual environments with varying reverberation and room visuals (hemi-anechoic room, classroom, and gymnasium) while completing speech tasks. Speech level was analyzed using generalized additive mixed-effects modeling to assess temporal changes during utterances across conditions. Results showed that visual information significantly influenced speech level, though not strictly in line with expected acoustics or perceived room size; auditory information had a stronger overall effect than visual information. Visual information had an earlier influence that diminished over time, whereas the auditory effect increased and plateaued. These findings contribute to the understanding of multisensory integration in speech control and have implications in enhancing vocal performance and supporting more naturalistic communication in virtual environments.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 1","pages":"384-397"},"PeriodicalIF":2.3,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145959731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PAMGuard: Application software for passive acoustic detection, classification, and localisation of animal sounds. PAMGuard:用于被动声学检测、分类和动物声音定位的应用软件。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-01-01 DOI: 10.1121/10.0042245
Douglas Gillespie, Jamie Macaulay, Michael Oswald, Marie Roch

Detection, classification, and localisation of animal sounds are essential in many ecological studies, including density estimation and behavioural studies. Real-time acoustic processing can also be used in mitigation exercises, with the possibility of curtailing harmful human activities when animals are present. Animal vocalisations vary widely, and there is no single detection algorithm that can robustly detect all sound types. Human-in-the loop analysis is often required to validate algorithm performance and deal with unexpected noise sources such as are often encountered in real-world situations. The PAMGuard software combines advanced automatic analysis algorithms, including AI methods, with interactive visual tools allowing users to develop efficient workflows for both real-time use and for processing archived datasets. A modular framework enables users to configure multiple detectors, classifiers, and localisers suitable for the equipment and species of interest in a particular application. Multiple detectors for different sound types can be run concurrently on the same data. An extensible "plug-in" interface also makes it possible for third parties to independently develop new modules to run within the software framework. Here, we describe the software's core functionality, illustrated using workflows for both real-time and offline use, and present an update on the latest features.

动物声音的检测、分类和定位在许多生态学研究中是必不可少的,包括密度估计和行为研究。实时声学处理也可用于缓解活动,有可能在有动物在场时减少有害的人类活动。动物的声音变化很大,没有单一的检测算法可以检测所有的声音类型。在验证算法性能和处理意外噪声源(如在现实世界中经常遇到的噪声源)时,通常需要人在循环分析。PAMGuard软件结合了先进的自动分析算法,包括人工智能方法,以及交互式可视化工具,允许用户开发实时使用和处理存档数据集的高效工作流程。模块化框架使用户能够配置多个检测器、分类器和本地化器,适合特定应用程序中感兴趣的设备和物种。不同声音类型的多个检测器可以在同一数据上并发运行。可扩展的“插件”接口也使第三方能够独立开发在软件框架内运行的新模块。在这里,我们描述了软件的核心功能,使用实时和离线使用的工作流进行说明,并介绍了最新功能的更新。
{"title":"PAMGuard: Application software for passive acoustic detection, classification, and localisation of animal sounds.","authors":"Douglas Gillespie, Jamie Macaulay, Michael Oswald, Marie Roch","doi":"10.1121/10.0042245","DOIUrl":"https://doi.org/10.1121/10.0042245","url":null,"abstract":"<p><p>Detection, classification, and localisation of animal sounds are essential in many ecological studies, including density estimation and behavioural studies. Real-time acoustic processing can also be used in mitigation exercises, with the possibility of curtailing harmful human activities when animals are present. Animal vocalisations vary widely, and there is no single detection algorithm that can robustly detect all sound types. Human-in-the loop analysis is often required to validate algorithm performance and deal with unexpected noise sources such as are often encountered in real-world situations. The PAMGuard software combines advanced automatic analysis algorithms, including AI methods, with interactive visual tools allowing users to develop efficient workflows for both real-time use and for processing archived datasets. A modular framework enables users to configure multiple detectors, classifiers, and localisers suitable for the equipment and species of interest in a particular application. Multiple detectors for different sound types can be run concurrently on the same data. An extensible \"plug-in\" interface also makes it possible for third parties to independently develop new modules to run within the software framework. Here, we describe the software's core functionality, illustrated using workflows for both real-time and offline use, and present an update on the latest features.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 1","pages":"437-443"},"PeriodicalIF":2.3,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145985116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing deep-sea communication via time-reversal equalization in reliable acoustic path channels. 在可靠声径信道中利用时间反转均衡增强深海通信。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-01-01 DOI: 10.1121/10.0042253
Yifan Zhou, Shiliang Fang, Liang An, Ruixin Nie

Reliable acoustic path (RAP) channels support stable long-range propagation, but deep-sea communication over RAP is constrained by strong multipath. South China Sea measurements reveal stable arrival structures across multiple source depths, with distinct arrival branches and reverberation tails from scattering at interface inhomogeneities. A RAP-adaptive time-reversal equalizer suppresses multipath by reconstructing the channel impulse response via physics-guided statistical fitting, modeling it as a superposition of discrete multipaths and reverberation. Performance is evaluated using frequency-hopping spread spectrum M-ary frequency-shift keying and direct-sequence spread spectrum M-ary phase-shift keying and quantified with a network-level throughput analysis. Experiments demonstrate reduced inter-symbol interference and improved link reliability, indicating RAP-adaptive time-reversal equalizer as a practical physical-layer method for deep-sea acoustic networks.

可靠声径(RAP)信道支持稳定的远程传播,但基于可靠声径的深海通信受到强多径的限制。南海测量揭示了多个震源深度的稳定到达结构,具有明显的到达分支和界面不均匀散射产生的混响尾。rap自适应时间反转均衡器通过物理引导的统计拟合重建信道脉冲响应来抑制多径,将其建模为离散多径和混响的叠加。使用跳频扩频移频键控和直接序列扩频移相键控来评估性能,并通过网络级吞吐量分析进行量化。实验结果表明,rapp自适应时间反转均衡器可以减少符号间干扰,提高链路可靠性,是一种实用的深海水声网络物理层方法。
{"title":"Enhancing deep-sea communication via time-reversal equalization in reliable acoustic path channels.","authors":"Yifan Zhou, Shiliang Fang, Liang An, Ruixin Nie","doi":"10.1121/10.0042253","DOIUrl":"https://doi.org/10.1121/10.0042253","url":null,"abstract":"<p><p>Reliable acoustic path (RAP) channels support stable long-range propagation, but deep-sea communication over RAP is constrained by strong multipath. South China Sea measurements reveal stable arrival structures across multiple source depths, with distinct arrival branches and reverberation tails from scattering at interface inhomogeneities. A RAP-adaptive time-reversal equalizer suppresses multipath by reconstructing the channel impulse response via physics-guided statistical fitting, modeling it as a superposition of discrete multipaths and reverberation. Performance is evaluated using frequency-hopping spread spectrum M-ary frequency-shift keying and direct-sequence spread spectrum M-ary phase-shift keying and quantified with a network-level throughput analysis. Experiments demonstrate reduced inter-symbol interference and improved link reliability, indicating RAP-adaptive time-reversal equalizer as a practical physical-layer method for deep-sea acoustic networks.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 1","pages":"685-701"},"PeriodicalIF":2.3,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146030098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigating the effects of vertical misalignment and stiffness asymmetry on phonation in a synthetic vocal fold model. 在合成声带模型中研究垂直错位和刚度不对称对发声的影响。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-01-01 DOI: 10.1121/10.0042273
Md Roknujjaman, Molly E Stewart, Byron D Erath

Blunt force trauma to the larynx can cause significant damage, resulting in displaced laryngeal cartilage fractures. Vertical misalignment of the left or right vocal fold (VF) in the inferior-superior direction and scarring of the VF tissue are common outcomes. The influence of inferior-superior VF displacement and VF scarring on phonation was investigated using synthetic, self-oscillating VF models in a physiologically-representative facility. Acoustic, kinematic, and aerodynamic parameters were assessed as a function of inferior-superior vertical displacement and asymmetric VF stiffness. The combination of vertical misalignment and asymmetric VF tissue stiffness became most prominent when the inferior-superior misalignment of the VFs exceeded the thickness of the medial surface. Only a small degree of stiffness asymmetry was tolerated before VF kinematics and acoustics were significantly degraded. The position of the scarred VF relative to the healthy one also influenced outcomes. If the stiffer VF was positioned inferior to the normal VF, phonatory outcomes were poorer than when it was positioned superior to the normal VF. Measures of shimmer and jitter were more than twice as high, while cepstral peak prominence was 3-5 dB lower.

钝力对喉部的创伤会造成严重的损伤,导致喉部软骨移位骨折。左或右声带(VF)在上下方向的垂直错位和VF组织的瘢痕是常见的结果。在一个具有生理代表性的设施中,利用合成的自振荡VF模型研究了上下位VF位移和VF疤痕对发声的影响。声学、运动学和气动参数被评估为上下垂直位移和不对称VF刚度的函数。纵向错位和不对称的VF组织刚度组合在VF上下错位超过内表面厚度时最为突出。在VF运动学和声学显著退化之前,只有很小程度的刚度不对称是可以容忍的。结疤的VF相对于健康的VF的位置也影响结果。如果较硬的VF位于正常VF的下方,发音结果比位于正常VF的上方时差。闪烁和抖动的测量值是原来的两倍多,而倒谱峰突出值则低3-5分贝。
{"title":"Investigating the effects of vertical misalignment and stiffness asymmetry on phonation in a synthetic vocal fold model.","authors":"Md Roknujjaman, Molly E Stewart, Byron D Erath","doi":"10.1121/10.0042273","DOIUrl":"10.1121/10.0042273","url":null,"abstract":"<p><p>Blunt force trauma to the larynx can cause significant damage, resulting in displaced laryngeal cartilage fractures. Vertical misalignment of the left or right vocal fold (VF) in the inferior-superior direction and scarring of the VF tissue are common outcomes. The influence of inferior-superior VF displacement and VF scarring on phonation was investigated using synthetic, self-oscillating VF models in a physiologically-representative facility. Acoustic, kinematic, and aerodynamic parameters were assessed as a function of inferior-superior vertical displacement and asymmetric VF stiffness. The combination of vertical misalignment and asymmetric VF tissue stiffness became most prominent when the inferior-superior misalignment of the VFs exceeded the thickness of the medial surface. Only a small degree of stiffness asymmetry was tolerated before VF kinematics and acoustics were significantly degraded. The position of the scarred VF relative to the healthy one also influenced outcomes. If the stiffer VF was positioned inferior to the normal VF, phonatory outcomes were poorer than when it was positioned superior to the normal VF. Measures of shimmer and jitter were more than twice as high, while cepstral peak prominence was 3-5 dB lower.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 1","pages":"715-726"},"PeriodicalIF":2.3,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12841898/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146052937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning-based bubble separation for passive acoustic monitoring of underwater gas plumesa). 基于深度学习的水下气体羽流被动声监测气泡分离。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-01-01 DOI: 10.1121/10.0042232
Shuduo Liu, Ben Liu, Mengran Du, Chenguang Yang, Wen Xu

Passive acoustic monitoring (PAM) techniques have shown great potential in studying underwater gas plumes by leveraging bubble resonance signals. Traditional bubble detection methods generally operate on the mixture of bubble and ambient sounds, which cannot achieve satisfactory performance in low SNR environments. In this study, we propose a deep learning (DL) based bubble sound separation method to extract the bubble waveform from the noisy mixture prior to detection, thereby enhancing the bubble detection performance. To obtain the labeled training data, we developed a numerical simulation framework based on bubble acoustic theories to generate the ground truth bubble sounds, which are then mixed with diverse noises. Experiments were conducted with both simulated data and realistic PAM recordings. The simulation experiments under different noise conditions demonstrate that the DL models can effectively extract bubble sound, even when their features are barely visible in the time-frequency domain. In the real-world experiment, the trained model was applied to the PAM recordings collected in Haima cold seep, and we found a negative correlation between bubble release rate and ambient pressure when the hydrophones were near the gas plumes, which is in accordance with existing literature and further validates the proposed method's effectiveness.

被动声监测技术在利用气泡共振信号研究水下气体羽流方面显示出巨大的潜力。传统的气泡检测方法一般是对气泡和环境声音的混合进行检测,在低信噪比环境下无法达到令人满意的检测效果。在本研究中,我们提出了一种基于深度学习(DL)的气泡声分离方法,在检测前从噪声混合物中提取气泡波形,从而提高气泡检测性能。为了获得标记的训练数据,我们开发了一个基于气泡声学理论的数值模拟框架,生成地面真实气泡声,然后将其与各种噪声混合。实验采用模拟数据和真实PAM记录进行。在不同噪声条件下的仿真实验表明,DL模型可以有效地提取气泡声,即使其特征在时频域中几乎不可见。在实际实验中,将训练好的模型应用于海马冷渗的PAM记录,发现当水听器靠近气体羽流时,气泡释放速率与环境压力呈负相关,这与已有文献一致,进一步验证了所提方法的有效性。
{"title":"Deep learning-based bubble separation for passive acoustic monitoring of underwater gas plumesa).","authors":"Shuduo Liu, Ben Liu, Mengran Du, Chenguang Yang, Wen Xu","doi":"10.1121/10.0042232","DOIUrl":"https://doi.org/10.1121/10.0042232","url":null,"abstract":"<p><p>Passive acoustic monitoring (PAM) techniques have shown great potential in studying underwater gas plumes by leveraging bubble resonance signals. Traditional bubble detection methods generally operate on the mixture of bubble and ambient sounds, which cannot achieve satisfactory performance in low SNR environments. In this study, we propose a deep learning (DL) based bubble sound separation method to extract the bubble waveform from the noisy mixture prior to detection, thereby enhancing the bubble detection performance. To obtain the labeled training data, we developed a numerical simulation framework based on bubble acoustic theories to generate the ground truth bubble sounds, which are then mixed with diverse noises. Experiments were conducted with both simulated data and realistic PAM recordings. The simulation experiments under different noise conditions demonstrate that the DL models can effectively extract bubble sound, even when their features are barely visible in the time-frequency domain. In the real-world experiment, the trained model was applied to the PAM recordings collected in Haima cold seep, and we found a negative correlation between bubble release rate and ambient pressure when the hydrophones were near the gas plumes, which is in accordance with existing literature and further validates the proposed method's effectiveness.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 1","pages":"816-832"},"PeriodicalIF":2.3,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146064420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The study on the design and performance analysis of acoustic metamaterial lens. 声学超材料透镜的设计与性能分析研究。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-01-01 DOI: 10.1121/10.0042194
Hua-Wei Ji, Li-Ming Lin, Jiang-Hai Wang, Di-Wei Xiong, Chong-Jin Du

Acoustic lens focusing is a commonly used method in high-intensity focused ultrasound (HIFU). However, traditional acoustic lens focusing suffers from low focusing efficiency and excessive sidelobes, which affect the efficacy and safety of HIFU treatment. To address this issue, this paper designs a periodic trapezoidal‑groove acoustic metasurface lens by leveraging the extraordinary acoustic transmission effect. Subsequently, its focal sound‑pressure level is calculated through theoretical analysis and finite‑element simulation, and is further validated experimentally. Finally, the influence of structural parameters-such as the period, center width, depth, and taper angle of the trapezoidal groove, as well as the amplitude of the excitation source-on the focusing performance of the acoustic metasurface lens is systematically analyzed. The results demonstrate that the periodic trapezoidal‑groove acoustic metasurface lens can further enhance focusing and suppress sidelobes within a specific frequency range; the frequency corresponding to the maximum sound pressure is determined by the period of the trapezoidal groove; and the shift of Wood's anomaly frequency is primarily governed by the groove depth. This study provides insights for the development of high‑performance acoustic‑lens-focused ultrasound transducers.

声透镜聚焦是高强度聚焦超声(HIFU)中常用的一种方法。然而,传统声透镜聚焦存在聚焦效率低、副瓣过多等问题,影响了HIFU治疗的有效性和安全性。为了解决这一问题,本文利用超常的声传输效应,设计了一种周期梯形槽声学超表面透镜。随后,通过理论分析和有限元模拟计算了其震源声压级,并进行了实验验证。最后,系统分析了梯形槽周期、中心宽度、深度、锥角以及激励源振幅等结构参数对声超表面透镜聚焦性能的影响。结果表明:周期梯形槽声学超表面透镜在一定频率范围内可以进一步增强聚焦,抑制副瓣;最大声压所对应的频率由梯形槽的周期决定;Wood异常频率的移动主要受槽深的控制。该研究为高性能声透镜聚焦超声换能器的开发提供了见解。
{"title":"The study on the design and performance analysis of acoustic metamaterial lens.","authors":"Hua-Wei Ji, Li-Ming Lin, Jiang-Hai Wang, Di-Wei Xiong, Chong-Jin Du","doi":"10.1121/10.0042194","DOIUrl":"https://doi.org/10.1121/10.0042194","url":null,"abstract":"<p><p>Acoustic lens focusing is a commonly used method in high-intensity focused ultrasound (HIFU). However, traditional acoustic lens focusing suffers from low focusing efficiency and excessive sidelobes, which affect the efficacy and safety of HIFU treatment. To address this issue, this paper designs a periodic trapezoidal‑groove acoustic metasurface lens by leveraging the extraordinary acoustic transmission effect. Subsequently, its focal sound‑pressure level is calculated through theoretical analysis and finite‑element simulation, and is further validated experimentally. Finally, the influence of structural parameters-such as the period, center width, depth, and taper angle of the trapezoidal groove, as well as the amplitude of the excitation source-on the focusing performance of the acoustic metasurface lens is systematically analyzed. The results demonstrate that the periodic trapezoidal‑groove acoustic metasurface lens can further enhance focusing and suppress sidelobes within a specific frequency range; the frequency corresponding to the maximum sound pressure is determined by the period of the trapezoidal groove; and the shift of Wood's anomaly frequency is primarily governed by the groove depth. This study provides insights for the development of high‑performance acoustic‑lens-focused ultrasound transducers.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 1","pages":"234-246"},"PeriodicalIF":2.3,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145933942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparisons of air and bone conduction transfer properties utilizing stimulus frequency otoacoustic emissions. 利用刺激频率耳声发射的空气和骨传导转移特性的比较。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-01-01 DOI: 10.1121/10.0042219
Jie Wang, Zhuoran Shi, Shengjian Wu, Stefan Stenfelt, Jinqiu Sang, Xiaodong Li, Chengshi Zheng

Otoacoustic emissions represent cochlear responses to auditory stimuli, enabling the investigation of air conduction (AC) and bone conduction (BC) transmission. This study developed and validated a non-invasive, objective method for measuring the sensitivity difference between AC and BC transmission, here termed bone-air difference transfer property (BADTP), using stimulus frequency otoacoustic emission (SFOAE). The BADTP was defined as the difference between the AC transfer property and the BC transfer property. To cross-validate the objective approach, BADTP was compared with subjectively obtained hearing thresholds. Measurements were conducted across frequencies from 1000 to 4000 Hz in ten individuals with normal hearing. Results revealed that the mean differences between the two methods were within 2 dB at frequencies from 1000 to 1600 Hz, while both methods showed similar trends from 1850 to 4000 Hz. The proposed SFOAE-based method for measuring provides valuable insight into BC transmission, with potential applications for objective assessment of BC function in research settings.

耳声发射代表耳蜗对听觉刺激的反应,使研究空气传导(AC)和骨传导(BC)传输成为可能。本研究开发并验证了一种非侵入性、客观的方法,用于测量交流和BC传输之间的灵敏度差异,这里称为骨-气差异传输特性(BADTP),使用刺激频率耳声发射(SFOAE)。BADTP被定义为AC传输属性和BC传输属性之间的差值。为了交叉验证客观方法,将BADTP与主观获得的听力阈值进行比较。对10名听力正常的人进行了1000到4000赫兹的频率测量。结果表明,在1000 ~ 1600 Hz范围内,两种方法的平均差异在2 dB以内,而在1850 ~ 4000 Hz范围内,两种方法的趋势相似。提出的基于sfoae的测量方法提供了对BC传播的有价值的见解,具有在研究环境中客观评估BC功能的潜在应用。
{"title":"Comparisons of air and bone conduction transfer properties utilizing stimulus frequency otoacoustic emissions.","authors":"Jie Wang, Zhuoran Shi, Shengjian Wu, Stefan Stenfelt, Jinqiu Sang, Xiaodong Li, Chengshi Zheng","doi":"10.1121/10.0042219","DOIUrl":"https://doi.org/10.1121/10.0042219","url":null,"abstract":"<p><p>Otoacoustic emissions represent cochlear responses to auditory stimuli, enabling the investigation of air conduction (AC) and bone conduction (BC) transmission. This study developed and validated a non-invasive, objective method for measuring the sensitivity difference between AC and BC transmission, here termed bone-air difference transfer property (BADTP), using stimulus frequency otoacoustic emission (SFOAE). The BADTP was defined as the difference between the AC transfer property and the BC transfer property. To cross-validate the objective approach, BADTP was compared with subjectively obtained hearing thresholds. Measurements were conducted across frequencies from 1000 to 4000 Hz in ten individuals with normal hearing. Results revealed that the mean differences between the two methods were within 2 dB at frequencies from 1000 to 1600 Hz, while both methods showed similar trends from 1850 to 4000 Hz. The proposed SFOAE-based method for measuring provides valuable insight into BC transmission, with potential applications for objective assessment of BC function in research settings.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 1","pages":"315-326"},"PeriodicalIF":2.3,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145959689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Geoacoustic inversion of layered seabed using near-field pulse signals based on deep learning methods. 基于深度学习方法的近场脉冲信号层状海底地球声反演。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-01-01 DOI: 10.1121/10.0041917
Yuxuan Ma, Li He, Zhaohui Peng, Wenyu Luo, Jixing Qin, Lixin Wu, Shuanglin Wu, Qingqing Zhang

The acoustic reflection coefficient of a layered seabed exhibits an oscillatory structure that varies with frequency and grazing angle. These oscillatory characteristics are linked to the seabed's stratification and its geoacoustic parameters. Through numerical simulation, the influence of seabed layering and geoacoustic parameters on the frequency-angle oscillatory structure of the bottom reflection coefficient (BRC) is investigated. Based on this, a deep learning geoacoustic inversion method is introduced for retrieving near-field seabed layering and its geoacoustic parameters from the frequency- and angle-dependent bottom reflection coefficient. A deep neural network model based on self-attention and cross-attention mechanism, Cross-ViT, is employed to learn features from the two-dimensional BRC matrix. The model is trained to perform the inversion of geoacoustic parameters using a multi-task learning strategy with gradient normalization (GradNorm). Simulation results indicate that, compared to convolutional neural network and transformer models, the presented model more effectively learns the mapping between seabed reflection characteristics and multiple geoacoustic parameters and possesses relatively strong noise robustness. The method's effectiveness is validated using near-field acoustic data from an acoustic inversion experiment conducted on the northern continental shelf of the South China Sea in 2022.

层状海床的声反射系数随频率和掠掠角的变化呈现振荡结构。这些振荡特征与海底分层及其地声参数有关。通过数值模拟研究了海底分层和地声参数对海底反射系数(BRC)频角振荡结构的影响。在此基础上,提出了一种基于频率和角度相关的海底反射系数反演近场海底分层及其地声参数的深度学习反演方法。采用基于自注意和交叉注意机制的深度神经网络模型Cross-ViT从二维BRC矩阵中学习特征。该模型被训练成使用梯度归一化的多任务学习策略(GradNorm)来执行地球声学参数的反演。仿真结果表明,与卷积神经网络和变压器模型相比,该模型能更有效地学习海底反射特征与多种地声参数之间的映射关系,具有较强的噪声鲁棒性。利用2022年在南海北部大陆架进行的近场声波反演实验数据验证了该方法的有效性。
{"title":"Geoacoustic inversion of layered seabed using near-field pulse signals based on deep learning methods.","authors":"Yuxuan Ma, Li He, Zhaohui Peng, Wenyu Luo, Jixing Qin, Lixin Wu, Shuanglin Wu, Qingqing Zhang","doi":"10.1121/10.0041917","DOIUrl":"https://doi.org/10.1121/10.0041917","url":null,"abstract":"<p><p>The acoustic reflection coefficient of a layered seabed exhibits an oscillatory structure that varies with frequency and grazing angle. These oscillatory characteristics are linked to the seabed's stratification and its geoacoustic parameters. Through numerical simulation, the influence of seabed layering and geoacoustic parameters on the frequency-angle oscillatory structure of the bottom reflection coefficient (BRC) is investigated. Based on this, a deep learning geoacoustic inversion method is introduced for retrieving near-field seabed layering and its geoacoustic parameters from the frequency- and angle-dependent bottom reflection coefficient. A deep neural network model based on self-attention and cross-attention mechanism, Cross-ViT, is employed to learn features from the two-dimensional BRC matrix. The model is trained to perform the inversion of geoacoustic parameters using a multi-task learning strategy with gradient normalization (GradNorm). Simulation results indicate that, compared to convolutional neural network and transformer models, the presented model more effectively learns the mapping between seabed reflection characteristics and multiple geoacoustic parameters and possesses relatively strong noise robustness. The method's effectiveness is validated using near-field acoustic data from an acoustic inversion experiment conducted on the northern continental shelf of the South China Sea in 2022.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 1","pages":"505-521"},"PeriodicalIF":2.3,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146011106","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-stage representation learning for blind Room-Acoustic parameter estimation with uncertainty quantification. 基于多阶段表征学习的盲室声学参数估计。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-01-01 DOI: 10.1121/10.0042193
Philipp Götz, Cagdas Tuna, Andreas Brendel, Andreas Walther, Emanuël A P Habets

The ability to infer a general representation of the acoustic environment from a reverberant recording is a key objective in numerous applications. We propose a multi-stage approach that integrates task-agnostic representation learning with uncertainty quantification. Leveraging the conformal prediction framework, our method models the error incurred in the estimation of the acoustic environment embedded in a reverberant recording, which reflects the ambiguity inherent in distinguishing between an unknown source signal and the induced reverberation. Although our approach is flexible and agnostic to specific downstream objectives, experiments on real-world data demonstrate competitive performance on established parameter estimation tasks when compared to baselines trained end-to-end or with contrastive losses. Furthermore, a latent disentanglement analysis reveals the interpretability of the learned representations, which effectively capture distinct factors of variation within the acoustic environment.

从混响录音中推断声环境的一般表示的能力在许多应用中是一个关键目标。我们提出了一种将任务不可知表征学习与不确定性量化相结合的多阶段方法。利用保形预测框架,我们的方法模拟了混响记录中嵌入的声环境估计中产生的误差,这反映了在区分未知源信号和诱导混响时固有的模糊性。尽管我们的方法是灵活的,对特定的下游目标是不可知的,但在现实世界数据上的实验表明,与端到端训练基线或对比损失相比,在已建立的参数估计任务上具有竞争力。此外,潜在解纠缠分析揭示了学习表征的可解释性,它有效地捕获了声环境中不同的变化因素。
{"title":"Multi-stage representation learning for blind Room-Acoustic parameter estimation with uncertainty quantification.","authors":"Philipp Götz, Cagdas Tuna, Andreas Brendel, Andreas Walther, Emanuël A P Habets","doi":"10.1121/10.0042193","DOIUrl":"https://doi.org/10.1121/10.0042193","url":null,"abstract":"<p><p>The ability to infer a general representation of the acoustic environment from a reverberant recording is a key objective in numerous applications. We propose a multi-stage approach that integrates task-agnostic representation learning with uncertainty quantification. Leveraging the conformal prediction framework, our method models the error incurred in the estimation of the acoustic environment embedded in a reverberant recording, which reflects the ambiguity inherent in distinguishing between an unknown source signal and the induced reverberation. Although our approach is flexible and agnostic to specific downstream objectives, experiments on real-world data demonstrate competitive performance on established parameter estimation tasks when compared to baselines trained end-to-end or with contrastive losses. Furthermore, a latent disentanglement analysis reveals the interpretability of the learned representations, which effectively capture distinct factors of variation within the acoustic environment.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 1","pages":"247-259"},"PeriodicalIF":2.3,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145952471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development of eco-friendly spiral-shaped sound absorber made from handcrafted fibrous paper enhanced with spent coffee waste for broadband noise control. 用咖啡渣增强手工纤维纸制成的环保螺旋形吸声器,用于宽带噪声控制。
IF 2.3 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2026-01-01 DOI: 10.1121/10.0041885
Jie Jin, Yunle Cao, Haipeng Hao, Yecheng Feng, Daitong Wei, Zhuqing Zhang

To address the non-degradability and toxicity of conventional acoustic materials, this study proposes a sustainable spiral-shaped sound absorber composed of plant fiber-based fibrous paper and recycled coffee waste (CW). The strong mechanical bonding between CW and Kozo fibrous paper in this composite acoustic material was observed using metallurgical microscopy, resulting in an environmentally friendly structure capable of controlling broadband noise. A prediction model based on parallel-slit theory was developed to evaluate the influence of key structural parameters-CW layer mass density, fibrous paper length, and absorber width-on sound absorption coefficients. Optimization reveals that wide spiral-shaped geometry paired with a high-density CW layer (0.04-0.05 kg/m2) enhances low-frequency noise reduction (<1000 Hz), whereas narrow configurations with a medium-density CW layer (0.03-0.04 kg/m2) improves high-frequency attenuation (>2000 Hz). The sound absorption coefficients of five prepared samples were measured using the two-microphone impedance tube method. The sound absorption coefficient showed significant improvement with the addition of an appropriate amount of CW in the mid- and high-frequency range. This work advances the development of lightweight, efficient, and sustainable acoustic solutions, providing a scalable strategy for the next generation of eco-friendly materials in line with circular economy principles and low-carbon manufacturing practices.

为了解决传统声学材料的不可降解性和毒性问题,本研究提出了一种可持续的螺旋形吸声器,该吸声器由植物纤维基纤维纸和再生咖啡废料(CW)组成。通过金相显微镜观察到这种复合声学材料中CW和Kozo纤维纸之间的强机械结合,从而形成了一种能够控制宽带噪声的环保结构。建立了基于平行缝理论的连续波层质量密度、纤维纸长度和吸声器宽度等关键结构参数对吸声系数的影响预测模型。优化结果表明,宽螺旋形几何结构与高密度连续波层(0.04-0.05 kg/m2)相匹配,可以增强低频降噪(2000 Hz)。采用双传声器阻抗管法测量了5种制备样品的吸声系数。在中频和高频范围内加入适量的连续波,吸声系数有显著提高。这项工作推动了轻质、高效和可持续声学解决方案的发展,为符合循环经济原则和低碳制造实践的下一代环保材料提供了可扩展的策略。
{"title":"Development of eco-friendly spiral-shaped sound absorber made from handcrafted fibrous paper enhanced with spent coffee waste for broadband noise control.","authors":"Jie Jin, Yunle Cao, Haipeng Hao, Yecheng Feng, Daitong Wei, Zhuqing Zhang","doi":"10.1121/10.0041885","DOIUrl":"https://doi.org/10.1121/10.0041885","url":null,"abstract":"<p><p>To address the non-degradability and toxicity of conventional acoustic materials, this study proposes a sustainable spiral-shaped sound absorber composed of plant fiber-based fibrous paper and recycled coffee waste (CW). The strong mechanical bonding between CW and Kozo fibrous paper in this composite acoustic material was observed using metallurgical microscopy, resulting in an environmentally friendly structure capable of controlling broadband noise. A prediction model based on parallel-slit theory was developed to evaluate the influence of key structural parameters-CW layer mass density, fibrous paper length, and absorber width-on sound absorption coefficients. Optimization reveals that wide spiral-shaped geometry paired with a high-density CW layer (0.04-0.05 kg/m2) enhances low-frequency noise reduction (<1000 Hz), whereas narrow configurations with a medium-density CW layer (0.03-0.04 kg/m2) improves high-frequency attenuation (>2000 Hz). The sound absorption coefficients of five prepared samples were measured using the two-microphone impedance tube method. The sound absorption coefficient showed significant improvement with the addition of an appropriate amount of CW in the mid- and high-frequency range. This work advances the development of lightweight, efficient, and sustainable acoustic solutions, providing a scalable strategy for the next generation of eco-friendly materials in line with circular economy principles and low-carbon manufacturing practices.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"159 1","pages":"260-271"},"PeriodicalIF":2.3,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145952421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of the Acoustical Society of America
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1