首页 > 最新文献

Journal of the Acoustical Society of America最新文献

英文 中文
Perception of Xiapu Min checked syllables and tones in citation and sandhi formsa). “下普闵”在引证和变调中音节和声调的辨析)。
IF 2.1 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2025-01-01 DOI: 10.1121/10.0034785
Yuan Chai, Shihong Ye

This study investigates the acoustic cues for listeners to differentiate checked syllables and tones from unchecked ones. In Xiapu Min, checked and unchecked syllables and tones differ in f0, glottalization, and duration, whereas these differences are reduced in their sandhi forms. In citation forms, listeners utilize all three cues while relying on duration the most. The results indicate that duration is an independent perceptual cue for checked syllables and tones, rather than a peripheral cue resulting from the syllable structure of /CVʔ/. In sandhi forms, where checked and unchecked syllables and tones are phonologically neutralized, the duration and f0 still influence listeners' perception of checked constituents significantly. Data from Xiapu Min, along with other languages, illustrate that cues consistently found in the production of checked syllables and tones are likely to be utilized in their perception.

本研究探讨了听者区分选中音节和未选中音节和音调的声学线索。在《霞浦闵》中,检查音节和检查音节在音高、声母化和音长等方面存在差异,而这些差异在连读形式中有所减少。在引用形式中,听众利用这三种线索,而最依赖于持续时间。结果表明,持续时间是对音节和音调的独立感知线索,而不是由/CV的音节结构产生的外围线索。在连读形式中,检查和未检查的音节和音调在语音上是中立的,持续时间和f0仍然显著影响听者对检查成分的感知。来自小浦闵和其他语言的数据表明,在核对音节和音调的产生中始终发现的线索很可能被用于他们的感知。
{"title":"Perception of Xiapu Min checked syllables and tones in citation and sandhi formsa).","authors":"Yuan Chai, Shihong Ye","doi":"10.1121/10.0034785","DOIUrl":"https://doi.org/10.1121/10.0034785","url":null,"abstract":"<p><p>This study investigates the acoustic cues for listeners to differentiate checked syllables and tones from unchecked ones. In Xiapu Min, checked and unchecked syllables and tones differ in f0, glottalization, and duration, whereas these differences are reduced in their sandhi forms. In citation forms, listeners utilize all three cues while relying on duration the most. The results indicate that duration is an independent perceptual cue for checked syllables and tones, rather than a peripheral cue resulting from the syllable structure of /CVʔ/. In sandhi forms, where checked and unchecked syllables and tones are phonologically neutralized, the duration and f0 still influence listeners' perception of checked constituents significantly. Data from Xiapu Min, along with other languages, illustrate that cues consistently found in the production of checked syllables and tones are likely to be utilized in their perception.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"70-83"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142950691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effects of leading-edge serration shape on noise reduction in rod-airfoil interactions. 前缘锯齿形对杆-翼型相互作用降噪的影响。
IF 2.1 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2025-01-01 DOI: 10.1121/10.0034845
Fu-Yang Yu, Zhen-Hua Wan, Ya-Sen Hu, De-Jun Sun, Xi-Yun Lu

Leading-edge serrations inspired by owls exhibit the capability to control airfoil-turbulence interaction noise, but the design principle of the serration shape is still an open issue. To this end, we designed five types of serration shapes with different combinations of curvature, namely, triangular, ogee, anti-ogee, feather-like, and anti-feather-like. These curves are applied to serrated modifications with different bluntness levels (sharp or blunt) and amplitudes (0.05, 0.075, and 0.1 chord length). Considering these serration shapes, 30 cases with various curved types, bluntness levels, and amplitudes are investigated using compressible large-eddy simulation and the acoustic analogy of Ffowcs-Williams and Hawkings on a rod-airfoil configuration. The outcomes reveal a general trend where increased amplitude and blunted serrations are more effective in noise mitigation. Notably, the blunt feather-like (FB) serrations demonstrate the maximum noise reduction capacity across all amplitude levels, decreasing the overall sound power level by up to 2.1 dB. Through multi-process acoustic analysis, source characteristics responsible for generating noise are diagnosed. It is found that noise reduction primarily stems from the change in the source distribution and destructive interference among sound sources, consistent with prior studies. Generally, the serration shape would significantly affect the source distribution and sound interference without altering the fundamental noise reduction mechanisms. The FB shape exhibits the highest concentration of sources at its peaks and roots among all shapes. The presence of concentrated sources in these locations enhances destructive interference, effectively reducing noise emissions. The superior noise-reduction feature of FB serrations should be attributed to both the concentration of sources and the destructive interference. This extensive examination underscores the importance of serration design, especially the potential of FB serrations, in noise control strategies for rod-airfoil configurations, contributing to advancements in aeroacoustic engineering.

受猫头鹰启发的前缘锯齿显示出控制翼型湍流相互作用噪音的能力,但锯齿形状的设计原则仍然是一个开放的问题。为此,我们设计了五种不同曲率组合的锯齿形状,分别是三角形、ogee、反ogee、羽状、反羽状。这些曲线适用于具有不同钝度水平(尖锐或钝)和振幅(0.05,0.075和0.1弦长)的锯齿形修饰。考虑到这些锯齿形状,采用可压缩大涡模拟和Ffowcs-Williams和hawkins在杆翼型结构上的声学类比,研究了30种具有不同弯曲类型、钝度水平和振幅的情况。结果揭示了一个总的趋势,即增加振幅和钝化锯齿在降噪方面更有效。值得注意的是,钝羽状(FB)锯齿在所有振幅水平上都表现出最大的降噪能力,将总声功率级降低了2.1 dB。通过多过程声学分析,诊断出产生噪声的声源特性。发现降噪主要源于声源分布的改变和声源间的相消干扰,与前人的研究结果一致。一般来说,锯齿形状会显著影响声源分布和声干扰,但不会改变基本的降噪机制。在所有形状中,FB形状在其峰和根处的源浓度最高。在这些位置集中源的存在增强了破坏性干扰,有效地减少了噪声排放。FB锯齿具有优异的降噪性能,其原因应归结于源的集中和相消干涉。这项广泛的研究强调了锯齿形设计的重要性,特别是FB锯齿形的潜力,在杆翼型配置的噪声控制策略中,有助于航空声学工程的进步。
{"title":"Effects of leading-edge serration shape on noise reduction in rod-airfoil interactions.","authors":"Fu-Yang Yu, Zhen-Hua Wan, Ya-Sen Hu, De-Jun Sun, Xi-Yun Lu","doi":"10.1121/10.0034845","DOIUrl":"https://doi.org/10.1121/10.0034845","url":null,"abstract":"<p><p>Leading-edge serrations inspired by owls exhibit the capability to control airfoil-turbulence interaction noise, but the design principle of the serration shape is still an open issue. To this end, we designed five types of serration shapes with different combinations of curvature, namely, triangular, ogee, anti-ogee, feather-like, and anti-feather-like. These curves are applied to serrated modifications with different bluntness levels (sharp or blunt) and amplitudes (0.05, 0.075, and 0.1 chord length). Considering these serration shapes, 30 cases with various curved types, bluntness levels, and amplitudes are investigated using compressible large-eddy simulation and the acoustic analogy of Ffowcs-Williams and Hawkings on a rod-airfoil configuration. The outcomes reveal a general trend where increased amplitude and blunted serrations are more effective in noise mitigation. Notably, the blunt feather-like (FB) serrations demonstrate the maximum noise reduction capacity across all amplitude levels, decreasing the overall sound power level by up to 2.1 dB. Through multi-process acoustic analysis, source characteristics responsible for generating noise are diagnosed. It is found that noise reduction primarily stems from the change in the source distribution and destructive interference among sound sources, consistent with prior studies. Generally, the serration shape would significantly affect the source distribution and sound interference without altering the fundamental noise reduction mechanisms. The FB shape exhibits the highest concentration of sources at its peaks and roots among all shapes. The presence of concentrated sources in these locations enhances destructive interference, effectively reducing noise emissions. The superior noise-reduction feature of FB serrations should be attributed to both the concentration of sources and the destructive interference. This extensive examination underscores the importance of serration design, especially the potential of FB serrations, in noise control strategies for rod-airfoil configurations, contributing to advancements in aeroacoustic engineering.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"215-233"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142983964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Erratum: Short-term noise annoyance towards drones and other transportation noise sources: A laboratory study [J. Acoust. Soc. Am. 156(4), 2578-2595 (2024)]. 对无人机及其他交通工具噪声源的短期噪声干扰研究[J]。Acoust。Soc。科学通报,2009(2):1 -2。
IF 2.1 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2025-01-01 DOI: 10.1121/10.0034837
Claudia Kawai, Jonas Jäggi, Fotis Georgiou, Jonas Meister, Reto Pieren, Beat Schäffer
{"title":"Erratum: Short-term noise annoyance towards drones and other transportation noise sources: A laboratory study [J. Acoust. Soc. Am. 156(4), 2578-2595 (2024)].","authors":"Claudia Kawai, Jonas Jäggi, Fotis Georgiou, Jonas Meister, Reto Pieren, Beat Schäffer","doi":"10.1121/10.0034837","DOIUrl":"https://doi.org/10.1121/10.0034837","url":null,"abstract":"","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"288"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143007441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ultra-wide range control of topological acoustic waveguidesa). 拓扑声波导的超宽范围控制[j]。
IF 2.1 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2025-01-01 DOI: 10.1121/10.0034839
Xiao Liang, Jiangxia Luo, Qiang Li, Yu Ye, Liang Shi, Zhongyuan Tang, Jingqiu Huang, Nan Li

Topological acoustic waveguides have a potential for applications in the precise transmission of sound. Currently, there is more attention to multi-band in this field. However, achieving tunability of the operating band is also of great significance. Different from previous studies, this paper proposes to replace the two-dimensional (2D) resonant cavity in the scatterer with an extended three-dimensional (3D) resonant cavity. In this way, a composite acoustic structure consisting of a 2D scatterer and a 3D resonant cavity is constructed. By controlling the position of the bottom of the resonant cavity, the length of the resonant cavities can be freely controlled. In this way, it is possible to achieve continuous control of the operating frequency band by a very simple mechanical method without changing the initial structure. The control range can reach nearly 6 kHz. This paper also proposes a parallel resonance mechanism that can increase the width of the bandgap by 50%. Simulation results show that this method does not affect the topological phase transition of the structure. In the transmission channel formed by two different topological phase interfaces of this topological acoustic waveguide, the acoustic wave has a high-precision unidirectional transmission characteristic that is immune to backscattering. This study provides a reliable solution for an ultra-wide range of controllable acoustic topological components.

拓扑声波导在声音的精确传输方面具有潜在的应用前景。目前,多波段通信技术在该领域受到越来越多的关注。然而,实现工作频带的可调性也具有重要意义。与以往的研究不同,本文提出将散射体中的二维谐振腔替换为扩展的三维谐振腔。通过这种方法,构建了由二维散射体和三维谐振腔组成的复合声学结构。通过控制谐振腔底部的位置,可以自由控制谐振腔的长度。这样,就可以在不改变初始结构的情况下,通过非常简单的机械方法实现对工作频带的连续控制。控制范围可以达到近6 kHz。本文还提出了一种可使带隙宽度增加50%的并联谐振机制。仿真结果表明,该方法不影响结构的拓扑相变。在该拓扑声波导的两种不同拓扑相位界面形成的传输通道中,声波具有高精度的单向传输特性,不受后向散射的影响。该研究为超宽范围的可控声学拓扑元件提供了可靠的解决方案。
{"title":"Ultra-wide range control of topological acoustic waveguidesa).","authors":"Xiao Liang, Jiangxia Luo, Qiang Li, Yu Ye, Liang Shi, Zhongyuan Tang, Jingqiu Huang, Nan Li","doi":"10.1121/10.0034839","DOIUrl":"https://doi.org/10.1121/10.0034839","url":null,"abstract":"<p><p>Topological acoustic waveguides have a potential for applications in the precise transmission of sound. Currently, there is more attention to multi-band in this field. However, achieving tunability of the operating band is also of great significance. Different from previous studies, this paper proposes to replace the two-dimensional (2D) resonant cavity in the scatterer with an extended three-dimensional (3D) resonant cavity. In this way, a composite acoustic structure consisting of a 2D scatterer and a 3D resonant cavity is constructed. By controlling the position of the bottom of the resonant cavity, the length of the resonant cavities can be freely controlled. In this way, it is possible to achieve continuous control of the operating frequency band by a very simple mechanical method without changing the initial structure. The control range can reach nearly 6 kHz. This paper also proposes a parallel resonance mechanism that can increase the width of the bandgap by 50%. Simulation results show that this method does not affect the topological phase transition of the structure. In the transmission channel formed by two different topological phase interfaces of this topological acoustic waveguide, the acoustic wave has a high-precision unidirectional transmission characteristic that is immune to backscattering. This study provides a reliable solution for an ultra-wide range of controllable acoustic topological components.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"289-301"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143007458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mode-informed complex-valued neural processes for matched field processing.
IF 2.1 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2025-01-01 DOI: 10.1121/10.0034856
Yining Liu, Wei Gao, Desheng Chen, Lijun Xu

A complex-valued neural process method, combined with modal depth functions (MDFs) of the ocean waveguide, is proposed to reconstruct the acoustic field. Neural networks are used to describe complex Gaussian processes, modeling the distribution of the acoustic field at different depths. The network parameters are optimized through a meta-learning strategy, preventing overfitting under small sample conditions (sample size equals the number of array elements) and mitigating the slow reconstruction speed of Gaussian processes (GPs), while denoising and interpolating sparsely distributed acoustic field data, generating dense field data for virtual receiver arrays. The predicted field is then integrated with the matched field processing (MFP) method for passive source localization. Validation on the SWellEx-96 waveguide shows significant improvements in localization performance and reduces sidelobes of ambiguity surface compared to traditional MFP and GP-based MFP. Moreover, the proposed kernel based on MDFs outperforms the Gaussian kernel in describing ocean waveguide characteristics. Because of the feature representation of multi-modal mapping, this kernel enhances acoustic field prediction performance and improves the accuracy and robustness of MFP. Simulated and real data are used to verify the validity.

{"title":"Mode-informed complex-valued neural processes for matched field processing.","authors":"Yining Liu, Wei Gao, Desheng Chen, Lijun Xu","doi":"10.1121/10.0034856","DOIUrl":"https://doi.org/10.1121/10.0034856","url":null,"abstract":"<p><p>A complex-valued neural process method, combined with modal depth functions (MDFs) of the ocean waveguide, is proposed to reconstruct the acoustic field. Neural networks are used to describe complex Gaussian processes, modeling the distribution of the acoustic field at different depths. The network parameters are optimized through a meta-learning strategy, preventing overfitting under small sample conditions (sample size equals the number of array elements) and mitigating the slow reconstruction speed of Gaussian processes (GPs), while denoising and interpolating sparsely distributed acoustic field data, generating dense field data for virtual receiver arrays. The predicted field is then integrated with the matched field processing (MFP) method for passive source localization. Validation on the SWellEx-96 waveguide shows significant improvements in localization performance and reduces sidelobes of ambiguity surface compared to traditional MFP and GP-based MFP. Moreover, the proposed kernel based on MDFs outperforms the Gaussian kernel in describing ocean waveguide characteristics. Because of the feature representation of multi-modal mapping, this kernel enhances acoustic field prediction performance and improves the accuracy and robustness of MFP. Simulated and real data are used to verify the validity.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"493-508"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143023741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A diffusion-based super resolution model for enhancing sonar images.
IF 2.1 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2025-01-01 DOI: 10.1121/10.0034882
Oscar Bryan, Thibaud Berthomier, Benoit D'Ales, Thomas Furfaro, Tom S F Haines, Yan Pailhas, Alan Hunter

Improved hardware and processing techniques such as synthetic aperture sonar have led to imaging sonar with centimeter resolution. However, practical limitations and old systems limit the resolution in modern and legacy datasets. This study proposes using single image super resolution based on a conditioned diffusion model to map between images at different resolutions. This approach focuses on upscaling legacy, low-resolution sonar datasets to enable backward compatibility with newer, high-resolution datasets, thus creating a unified dataset for machine learning applications. The study demonstrates improved performance for classifying upscaled images without increasing the probability of false detection. The increased probability of detection was 7% compared to bicubic interpolation, 6% compared to convolutional neural networks, and 2% compared to generative adversarial networks. The study also proposes two sonar specific evaluation metrics based on acoustic physics and utility to automatic target recognition.

{"title":"A diffusion-based super resolution model for enhancing sonar images.","authors":"Oscar Bryan, Thibaud Berthomier, Benoit D'Ales, Thomas Furfaro, Tom S F Haines, Yan Pailhas, Alan Hunter","doi":"10.1121/10.0034882","DOIUrl":"https://doi.org/10.1121/10.0034882","url":null,"abstract":"<p><p>Improved hardware and processing techniques such as synthetic aperture sonar have led to imaging sonar with centimeter resolution. However, practical limitations and old systems limit the resolution in modern and legacy datasets. This study proposes using single image super resolution based on a conditioned diffusion model to map between images at different resolutions. This approach focuses on upscaling legacy, low-resolution sonar datasets to enable backward compatibility with newer, high-resolution datasets, thus creating a unified dataset for machine learning applications. The study demonstrates improved performance for classifying upscaled images without increasing the probability of false detection. The increased probability of detection was 7% compared to bicubic interpolation, 6% compared to convolutional neural networks, and 2% compared to generative adversarial networks. The study also proposes two sonar specific evaluation metrics based on acoustic physics and utility to automatic target recognition.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"509-518"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143023631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A regional road network traffic noise limit prediction method based on design elements.
IF 2.1 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2025-01-01 DOI: 10.1121/10.0034866
Xiaoxia Wang, Junshan Lin, Hongjian Liang, Haibo Wang

Since traffic flow has not been generated, a traffic noise prediction model based on actual traffic state data cannot be directly applied to the planned road network. Therefore, a regional traffic noise prediction method is proposed to find the upper limit of network noise emission based on design elements. The model is developed with noise predictions of the basic road section, interrupted/continuous intersections, and regional network. Meanwhile, ranges of traffic flow speed and volume are inferred by design elements and constraints between road units are obeyed. A four-scenes experiment to verify the method's accuracy is organized and the average noise difference between the upper limit calculated value and maximum measurement value is 1.53 dBA. All noise differences are positive as the measured noise values may not reach the upper limit of network emission in the experimental state. The method is applied to a network under design elements, and the results show that the model is suitable for the predicting upper limits of noise under design constraints; under the same design elements, noise emission at interrupted intersections is higher than that at continuous intersections. The method can provide a theoretical and data basis for planning network noise protection.

{"title":"A regional road network traffic noise limit prediction method based on design elements.","authors":"Xiaoxia Wang, Junshan Lin, Hongjian Liang, Haibo Wang","doi":"10.1121/10.0034866","DOIUrl":"https://doi.org/10.1121/10.0034866","url":null,"abstract":"<p><p>Since traffic flow has not been generated, a traffic noise prediction model based on actual traffic state data cannot be directly applied to the planned road network. Therefore, a regional traffic noise prediction method is proposed to find the upper limit of network noise emission based on design elements. The model is developed with noise predictions of the basic road section, interrupted/continuous intersections, and regional network. Meanwhile, ranges of traffic flow speed and volume are inferred by design elements and constraints between road units are obeyed. A four-scenes experiment to verify the method's accuracy is organized and the average noise difference between the upper limit calculated value and maximum measurement value is 1.53 dBA. All noise differences are positive as the measured noise values may not reach the upper limit of network emission in the experimental state. The method is applied to a network under design elements, and the results show that the model is suitable for the predicting upper limits of noise under design constraints; under the same design elements, noise emission at interrupted intersections is higher than that at continuous intersections. The method can provide a theoretical and data basis for planning network noise protection.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"527-537"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143032265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Neural estimation of mutual information in speech signals processed by an auditory model. 听觉模型处理语音信号中互信息的神经估计。
IF 2.1 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2025-01-01 DOI: 10.1121/10.0034854
Donghoon Shin, Hyung Soon Kim

The amount of information contained in speech signals is a fundamental concern of speech-based technologies and is particularly relevant in speech perception. Measuring the mutual information of actual speech signals is non-trivial, and quantitative measurements have not been extensively conducted to date. Recent advancements in machine learning have made it possible to directly measure mutual information using data. This study utilized neural estimators of mutual information to estimate the information content in speech signals. The high-dimensional speech signal was divided into segments and then compressed using Mel-scale filter bank, which approximates the non-linear frequency perception of the human ear. The filter bank outputs were then truncated based on the dynamic range of the auditory system. This data compression preserved a significant amount of information from the original high-dimensional speech signal. The amount of information varied, depending on the categories of the speech sounds, with relatively higher mutual information in vowels compared to consonants. Furthermore, the information available in the speech signals, as processed by the auditory model, decreased as the dynamic range was reduced.

语音信号中包含的信息量是基于语音的技术的一个基本问题,在语音感知中尤为重要。测量实际语音信号的互信息是非常重要的,迄今为止还没有广泛地进行定量测量。机器学习的最新进展使得使用数据直接测量相互信息成为可能。本研究利用互信息的神经估计器来估计语音信号中的信息含量。采用近似人耳非线性频率感知的Mel-scale滤波器组对高维语音信号进行分段压缩。然后根据听觉系统的动态范围截断滤波器组输出。这种数据压缩方法保留了原始高维语音信号中大量的信息。根据语音的类别,信息的数量有所不同,与辅音相比,元音中的相互信息相对较高。此外,听觉模型处理的语音信号中可用的信息随着动态范围的减小而减少。
{"title":"Neural estimation of mutual information in speech signals processed by an auditory model.","authors":"Donghoon Shin, Hyung Soon Kim","doi":"10.1121/10.0034854","DOIUrl":"https://doi.org/10.1121/10.0034854","url":null,"abstract":"<p><p>The amount of information contained in speech signals is a fundamental concern of speech-based technologies and is particularly relevant in speech perception. Measuring the mutual information of actual speech signals is non-trivial, and quantitative measurements have not been extensively conducted to date. Recent advancements in machine learning have made it possible to directly measure mutual information using data. This study utilized neural estimators of mutual information to estimate the information content in speech signals. The high-dimensional speech signal was divided into segments and then compressed using Mel-scale filter bank, which approximates the non-linear frequency perception of the human ear. The filter bank outputs were then truncated based on the dynamic range of the auditory system. This data compression preserved a significant amount of information from the original high-dimensional speech signal. The amount of information varied, depending on the categories of the speech sounds, with relatively higher mutual information in vowels compared to consonants. Furthermore, the information available in the speech signals, as processed by the auditory model, decreased as the dynamic range was reduced.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"355-368"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143007445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Temporal patterns in Malaysian rainforest soundscapes demonstrated using acoustic indices and deep embeddings trained on time-of-day estimationa). 使用声学指数和在一天中时间估计训练的深度嵌入来证明马来西亚雨林声景观的时间模式(a)。
IF 2.1 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2025-01-01 DOI: 10.1121/10.0034638
Yen Yi Loo, Mei Yi Lee, Samien Shaheed, Tomas Maul, Dena Jane Clink

Rapid urban development impacts the integrity of tropical ecosystems on broad spatiotemporal scales. However, sustained long-term monitoring poses significant challenges, particularly in tropical regions. In this context, ecoacoustics emerges as a promising approach to address this gap. Yet, harnessing insights from extensive acoustic datasets presents its own set of challenges, such as the time and expertise needed to label species information in recordings. Here, this study presents an approach to investigating soundscapes: the use of a deep neural network trained on time-of-day estimation. This research endeavors to (1) provide a qualitative analysis of the temporal variation (daily and monthly) of the soundscape using conventional ecoacoustic indices and deep ecoacoustic embeddings, (2) compare the predictive power of both methods for time-of-day estimation, and (3) compare the performance of both methods for supervised classification and unsupervised clustering to the specific recording site, habitat type, and season. The study's findings reveal that conventional acoustic indices and the proposed deep ecoacoustic embeddings approach exhibit overall comparable performance. This article concludes by discussing potential avenues for further refinement of the proposed method, which will further contribute to understanding of soundscape variation across time and space.

城市的快速发展在大时空尺度上影响着热带生态系统的完整性。然而,持续的长期监测带来了重大挑战,特别是在热带地区。在这种情况下,生态声学成为解决这一差距的一种有希望的方法。然而,利用来自广泛声学数据集的见解也带来了一系列挑战,例如在录音中标记物种信息所需的时间和专业知识。在这里,本研究提出了一种调查声景的方法:使用经过时间估计训练的深度神经网络。本研究试图(1)利用传统生态声学指数和深层生态声学嵌入对声景观的时间变化(日和月)进行定性分析;(2)比较两种方法在日时间估计方面的预测能力;(3)比较两种方法在监督分类和无监督聚类方面的表现,具体的记录地点、栖息地类型和季节。研究结果表明,传统的声学指标和提出的深层生态声学嵌入方法在总体上表现出可比性。本文最后讨论了进一步改进所提出方法的潜在途径,这将进一步有助于理解跨时间和空间的声景变化。
{"title":"Temporal patterns in Malaysian rainforest soundscapes demonstrated using acoustic indices and deep embeddings trained on time-of-day estimationa).","authors":"Yen Yi Loo, Mei Yi Lee, Samien Shaheed, Tomas Maul, Dena Jane Clink","doi":"10.1121/10.0034638","DOIUrl":"https://doi.org/10.1121/10.0034638","url":null,"abstract":"<p><p>Rapid urban development impacts the integrity of tropical ecosystems on broad spatiotemporal scales. However, sustained long-term monitoring poses significant challenges, particularly in tropical regions. In this context, ecoacoustics emerges as a promising approach to address this gap. Yet, harnessing insights from extensive acoustic datasets presents its own set of challenges, such as the time and expertise needed to label species information in recordings. Here, this study presents an approach to investigating soundscapes: the use of a deep neural network trained on time-of-day estimation. This research endeavors to (1) provide a qualitative analysis of the temporal variation (daily and monthly) of the soundscape using conventional ecoacoustic indices and deep ecoacoustic embeddings, (2) compare the predictive power of both methods for time-of-day estimation, and (3) compare the performance of both methods for supervised classification and unsupervised clustering to the specific recording site, habitat type, and season. The study's findings reveal that conventional acoustic indices and the proposed deep ecoacoustic embeddings approach exhibit overall comparable performance. This article concludes by discussing potential avenues for further refinement of the proposed method, which will further contribute to understanding of soundscape variation across time and space.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"1-16"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142932113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Broadband wireless battery-free acoustic identification tags for high data-rate underwater backscatter communication.
IF 2.1 2区 物理与天体物理 Q2 ACOUSTICS Pub Date : 2025-01-01 DOI: 10.1121/10.0034835
Ananya Bhardwaj, Alper Erturk, Karim G Sabra

Developing persistent and smart underwater markers is critical for improving navigation accuracy and communication capabilities of autonomous underwater vehicles (AUVs). A wireless acoustic identification tag, which uses a piezoelectric transducer tuned in the broadband ultrasonic range (200-500 kHz), was experimentally demonstrated to achieve highly efficient power transfer (source-to-tag electrical power efficiency of >2% at 6 m) and concurrent high data rate and backscatter level communication (>83.3 kbit s-1, >170 dB sound pressure level at 6 m) with potential operating range ≈ 10 m based on analytical extrapolations. Parameter selection considerations dictated by the desired range and data-rate requirements in communication are presented. The transducer piezoelectric element selection, impedance matching approach, and simulation-based circuit optimization for frequency multiplexed operation are also detailed. Experimental tests benchmarking performance sensitivity to source and tag misalignment are introduced and implications for AUV operations are discussed.

{"title":"Broadband wireless battery-free acoustic identification tags for high data-rate underwater backscatter communication.","authors":"Ananya Bhardwaj, Alper Erturk, Karim G Sabra","doi":"10.1121/10.0034835","DOIUrl":"https://doi.org/10.1121/10.0034835","url":null,"abstract":"<p><p>Developing persistent and smart underwater markers is critical for improving navigation accuracy and communication capabilities of autonomous underwater vehicles (AUVs). A wireless acoustic identification tag, which uses a piezoelectric transducer tuned in the broadband ultrasonic range (200-500 kHz), was experimentally demonstrated to achieve highly efficient power transfer (source-to-tag electrical power efficiency of >2% at 6 m) and concurrent high data rate and backscatter level communication (>83.3 kbit s-1, >170 dB sound pressure level at 6 m) with potential operating range ≈ 10 m based on analytical extrapolations. Parameter selection considerations dictated by the desired range and data-rate requirements in communication are presented. The transducer piezoelectric element selection, impedance matching approach, and simulation-based circuit optimization for frequency multiplexed operation are also detailed. Experimental tests benchmarking performance sensitivity to source and tag misalignment are introduced and implications for AUV operations are discussed.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"157 1","pages":"445-458"},"PeriodicalIF":2.1,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143023581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of the Acoustical Society of America
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1