Virginia Best, Turley Duque, Defne Abur, Cara E Stepp
A recent study in individuals with hyperfunctional voice disorders (HVDs) reported poorer fo discrimination compared to matched controls. In that study, fo discrimination was measured using each individual's own voice as the stimulus, which may have introduced confounds given the differences in vocal quality between the groups. Here, this paper addressed this possibility by using an fo discrimination task where participants with and without HVDs judged the same external voice. In line with the previous study, the HVD group exhibited poorer fo discrimination than controls, providing convergent evidence for a pitch-related auditory deficit in this population.
{"title":"Convergent evidence for a pitch deficit in hyperfunctional voice disorders.","authors":"Virginia Best, Turley Duque, Defne Abur, Cara E Stepp","doi":"10.1121/10.0037107","DOIUrl":"10.1121/10.0037107","url":null,"abstract":"<p><p>A recent study in individuals with hyperfunctional voice disorders (HVDs) reported poorer fo discrimination compared to matched controls. In that study, fo discrimination was measured using each individual's own voice as the stimulus, which may have introduced confounds given the differences in vocal quality between the groups. Here, this paper addressed this possibility by using an fo discrimination task where participants with and without HVDs judged the same external voice. In line with the previous study, the HVD group exhibited poorer fo discrimination than controls, providing convergent evidence for a pitch-related auditory deficit in this population.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 7","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12238932/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144577144","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lilian Nguyen, Gabriel Arnold, Guillaume Andéol, Vincent Isnard
Degraded speech intelligibility in multitalker situations can be improved by spatial unmasking, using sound spatialization techniques such as binaural synthesis. However, intelligibility also depends on the ability to focus efficiently on the target. Three experimental sessions explored the benefit of an additional vibrotactile cue, spatialized around the waist, on intelligibility in different spatialized multitalker situations. Results indicate improvements in intelligibility scores and reduced listening effort specifically for an off-center target among masker talkers. Multimodality allows us to better understand the mechanisms of auditory attention and to open up new perspectives for improving speech intelligibility in multitalker situations.
{"title":"Speech intelligibility in multitalker situations with vibrotactile spatial cueing.","authors":"Lilian Nguyen, Gabriel Arnold, Guillaume Andéol, Vincent Isnard","doi":"10.1121/10.0036850","DOIUrl":"https://doi.org/10.1121/10.0036850","url":null,"abstract":"<p><p>Degraded speech intelligibility in multitalker situations can be improved by spatial unmasking, using sound spatialization techniques such as binaural synthesis. However, intelligibility also depends on the ability to focus efficiently on the target. Three experimental sessions explored the benefit of an additional vibrotactile cue, spatialized around the waist, on intelligibility in different spatialized multitalker situations. Results indicate improvements in intelligibility scores and reduced listening effort specifically for an off-center target among masker talkers. Multimodality allows us to better understand the mechanisms of auditory attention and to open up new perspectives for improving speech intelligibility in multitalker situations.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 6","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144236109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The complexity of deep-sea acoustic ray bending is positively correlated with propagation distance, typically turning beyond 10 km. Existing ranging correction methods fail under turning conditions, limiting ranging accuracy. This letter proposes an effective sound velocity estimation method for turning ray, based on reliable acoustic path boundary to classify turning direct sound and establish a ray-tracing model for ranging correction. This method extends high-accuracy ranging from a 10 km scale to the first convergence zone. Experimental results demonstrate that the method achieves a positioning precision of 6.18 m at a horizontal distance of 34 km, validating its effectiveness.
{"title":"A deep-sea direct sound ranging method based on effective sound velocity estimation for turning ray.","authors":"Chiyu Cheng, Cuie Zheng, Xiaoping Hong","doi":"10.1121/10.0036833","DOIUrl":"https://doi.org/10.1121/10.0036833","url":null,"abstract":"<p><p>The complexity of deep-sea acoustic ray bending is positively correlated with propagation distance, typically turning beyond 10 km. Existing ranging correction methods fail under turning conditions, limiting ranging accuracy. This letter proposes an effective sound velocity estimation method for turning ray, based on reliable acoustic path boundary to classify turning direct sound and establish a ray-tracing model for ranging correction. This method extends high-accuracy ranging from a 10 km scale to the first convergence zone. Experimental results demonstrate that the method achieves a positioning precision of 6.18 m at a horizontal distance of 34 km, validating its effectiveness.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 6","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144200998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tim Sonnemann, Jan Dettmer, Charles W Holland, Stan E Dosso
This study presents a probabilistic one-step two-dimensional (2D) inversion method of spherical-wave reflection coefficient data to estimate the range-dependent structure and geoacoustic parameters along a track. The approach of inverting such datasets independently as one-dimensional (1D) layered models and merging them to a 2D section is feasible but computationally expensive. This study demonstrates a more parsimonious 2D parametrization for active source data recorded on a towed hydrophone array. The comparison of data variance reduction for 1D- and 2D-based results clearly favors the 2D parametrization described here.
{"title":"Range-dependent meso-scale geoacoustic seabed quantification.","authors":"Tim Sonnemann, Jan Dettmer, Charles W Holland, Stan E Dosso","doi":"10.1121/10.0036835","DOIUrl":"https://doi.org/10.1121/10.0036835","url":null,"abstract":"<p><p>This study presents a probabilistic one-step two-dimensional (2D) inversion method of spherical-wave reflection coefficient data to estimate the range-dependent structure and geoacoustic parameters along a track. The approach of inverting such datasets independently as one-dimensional (1D) layered models and merging them to a 2D section is feasible but computationally expensive. This study demonstrates a more parsimonious 2D parametrization for active source data recorded on a towed hydrophone array. The comparison of data variance reduction for 1D- and 2D-based results clearly favors the 2D parametrization described here.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 6","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144201002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Trans-dimensional (trans-D) Bayesian inversion is a powerful approach to estimate seabed geoacoustic models from ocean-acoustic data, combining quantitative model selection and uncertainty estimation. Trans-D inversion samples probabilistically over the number of seabed layers and the geoacoustic parameters for each layer, with layers added and removed in sampling, changing the dimension of the model. However, the probability of accepting dimension changes can approach zero for problems involving highly informative data or large numbers of parameters per layer. This Letter examines the use of parallel tempering, which employs a sequence of interacting Markov chains with successively relaxed likelihoods, to address these challenging cases.
{"title":"Parallel tempering in trans-dimensional Bayesian geoacoustic inversion for high-information-content data and multi-parameter models.","authors":"Stan E Dosso","doi":"10.1121/10.0036948","DOIUrl":"https://doi.org/10.1121/10.0036948","url":null,"abstract":"<p><p>Trans-dimensional (trans-D) Bayesian inversion is a powerful approach to estimate seabed geoacoustic models from ocean-acoustic data, combining quantitative model selection and uncertainty estimation. Trans-D inversion samples probabilistically over the number of seabed layers and the geoacoustic parameters for each layer, with layers added and removed in sampling, changing the dimension of the model. However, the probability of accepting dimension changes can approach zero for problems involving highly informative data or large numbers of parameters per layer. This Letter examines the use of parallel tempering, which employs a sequence of interacting Markov chains with successively relaxed likelihoods, to address these challenging cases.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 6","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144318878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In this study, the question of generalization for dimension-based statistical learning in speech perception is revisited. Learning for F0 and voice-onset-time as cues to stop voicing has been suggested to be fairly specific to a particular contrast, which was previously shown not to generalize between two places of articulation. The present study seeks to replicate generalization for the same place of articulation, using more varied stimuli and a different design. Then, it is tested if increased evidence for a distributional pattern, i.e., two places of articulation showing that pattern, leads to generalization to a third place of articulation. Same place of articulation learning is replicated, and no generalization across place of articulation is found, reaffirming that learning appears to be quite specific.
{"title":"Examining variability and generalization in dimension-based statistical learning for speech: The case of place of articulation.","authors":"Jeremy Steffman, Minxuan He, Seb Segger-Staveley","doi":"10.1121/10.0036895","DOIUrl":"https://doi.org/10.1121/10.0036895","url":null,"abstract":"<p><p>In this study, the question of generalization for dimension-based statistical learning in speech perception is revisited. Learning for F0 and voice-onset-time as cues to stop voicing has been suggested to be fairly specific to a particular contrast, which was previously shown not to generalize between two places of articulation. The present study seeks to replicate generalization for the same place of articulation, using more varied stimuli and a different design. Then, it is tested if increased evidence for a distributional pattern, i.e., two places of articulation showing that pattern, leads to generalization to a third place of articulation. Same place of articulation learning is replicated, and no generalization across place of articulation is found, reaffirming that learning appears to be quite specific.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 6","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144318877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The size of an individual's tongue relative to the oral cavity is associated with articulation speed [Feng, Lu, Zheng, Chi, and Honda, in Proceedings of the 10th Biennial Asia Pacific Conference on Speech, Language, and Hearing (2017), pp. 17-19)] and may affect speech clarity. This study introduces an ultrasound-based method for measuring relative tongue size, termed ultrasound-based relative tongue size (uRTS), as a cost-effective alternative to the magnetic resonance imaging (MRI) based method. Using deep learning to extract the tongue contour, uRTS was calculated from tongue and oropharyngeal cavity sizes in the midsagittal plane. Results from ten speakers showed a strong correlation between uRTS and MRI-based measurements (r = 0.87) and a negative correlation with tongue movement speed (r = -0.73), indicating uRTS is a useful index for assessing tongue size.
个体舌头相对于口腔的大小与发音速度有关[Feng, Lu, Zheng, Chi, and Honda,在第十届亚太言语、语言和听力双年展会议论文集(2017),第17-19页)],并可能影响言语清晰度。本研究介绍了一种基于超声测量相对舌头大小的方法,称为基于超声的相对舌头大小(uRTS),作为一种经济有效的替代基于磁共振成像(MRI)的方法。利用深度学习提取舌头轮廓,从中矢状面上的舌头和口咽腔大小计算uRTS。10名说话者的结果显示,uRTS和基于mri的测量结果之间存在很强的相关性(r = 0.87),而与舌速呈负相关(r = -0.73),这表明uRTS是评估舌大小的有用指标。
{"title":"Ultrasound measurement of relative tongue size and its correlation with tongue mobility for healthy individuals.","authors":"Jing Sun, Tatsuya Kitamura, Yukiko Nota, Noriko Yamane, Ryoko Hayashi","doi":"10.1121/10.0036838","DOIUrl":"https://doi.org/10.1121/10.0036838","url":null,"abstract":"<p><p>The size of an individual's tongue relative to the oral cavity is associated with articulation speed [Feng, Lu, Zheng, Chi, and Honda, in Proceedings of the 10th Biennial Asia Pacific Conference on Speech, Language, and Hearing (2017), pp. 17-19)] and may affect speech clarity. This study introduces an ultrasound-based method for measuring relative tongue size, termed ultrasound-based relative tongue size (uRTS), as a cost-effective alternative to the magnetic resonance imaging (MRI) based method. Using deep learning to extract the tongue contour, uRTS was calculated from tongue and oropharyngeal cavity sizes in the midsagittal plane. Results from ten speakers showed a strong correlation between uRTS and MRI-based measurements (r = 0.87) and a negative correlation with tongue movement speed (r = -0.73), indicating uRTS is a useful index for assessing tongue size.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 6","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144236110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Kristian Kvist, Sergey V Sorokin, Jan Balle Larsen
This study enhances the approximation "radiation efficiency varying equivalent radiated power." This is done through introducing a new method for estimating radiation efficiencies, based on spherical harmonic decomposition. The proposed improvements eliminate the need for computationally expensive surface integrals and results in solution times comparable with the classical equivalent radiated power approximation. This is achieved while significantly outperforming classical equivalent radiated power in terms of accuracy when compared with full numerical solutions to Helmholtz equation. This is shown both quantitively and qualitatively through numerical acoustic models of two systems of industrial complexity. The proposed improvements make the method robust for non-convex geometries across varying mesh densities, making the method highly suitable for iterative acoustic analysis in industrial applications.
{"title":"Approximating radiated acoustic power by spherical harmonic decomposition.","authors":"Kristian Kvist, Sergey V Sorokin, Jan Balle Larsen","doi":"10.1121/10.0036786","DOIUrl":"https://doi.org/10.1121/10.0036786","url":null,"abstract":"<p><p>This study enhances the approximation \"radiation efficiency varying equivalent radiated power.\" This is done through introducing a new method for estimating radiation efficiencies, based on spherical harmonic decomposition. The proposed improvements eliminate the need for computationally expensive surface integrals and results in solution times comparable with the classical equivalent radiated power approximation. This is achieved while significantly outperforming classical equivalent radiated power in terms of accuracy when compared with full numerical solutions to Helmholtz equation. This is shown both quantitively and qualitatively through numerical acoustic models of two systems of industrial complexity. The proposed improvements make the method robust for non-convex geometries across varying mesh densities, making the method highly suitable for iterative acoustic analysis in industrial applications.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 6","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144201000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objects in synthetic aperture sonar imagery that lie proud above an interface will occlude a portion of the background behind it. This occlusion may appear as a shadow in the imagery whose dimensions can be used to infer the height of the object. Volume scattering from beneath the object, however, may prevent such shadows from being visible in imagery. Regions of occlusion can be detected using the coherence between two images collected from different horizontal baselines. Using controlled laboratory experimentation, this approach is shown to detect occluded regions for accurate estimation of the height of an object.
{"title":"Occlusion and height estimation using the coherence of multi-static synthetic aperture sonar images.","authors":"Thomas E Blanford","doi":"10.1121/10.0036836","DOIUrl":"https://doi.org/10.1121/10.0036836","url":null,"abstract":"<p><p>Objects in synthetic aperture sonar imagery that lie proud above an interface will occlude a portion of the background behind it. This occlusion may appear as a shadow in the imagery whose dimensions can be used to infer the height of the object. Volume scattering from beneath the object, however, may prevent such shadows from being visible in imagery. Regions of occlusion can be detected using the coherence between two images collected from different horizontal baselines. Using controlled laboratory experimentation, this approach is shown to detect occluded regions for accurate estimation of the height of an object.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 6","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144201001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The implicit representation by physics-informed neural networks (PINNs) serves as an effective solution for a key challenge faced by optical sound measurements. Since optical sound measurements observe line integral of the sound pressure along the optical path, reconstruction is necessary to determine the sound pressure at each point in the three-dimensional field. In this paper, we expand the PINNs-based reconstruction method into three-dimensional reconstruction and demonstrate its effectiveness for optically measured sound fields. Furthermore, we propose a reconstruction approach which can estimate solutions well outside the bounds of the data used for training.
{"title":"Three-dimensional sound field reconstruction from optical projections using physics-informed neural networks.","authors":"Rikuto Ito, Kenji Ishikawa, Risako Tanigawa, Yasuhiro Oikawa","doi":"10.1121/10.0036816","DOIUrl":"https://doi.org/10.1121/10.0036816","url":null,"abstract":"<p><p>The implicit representation by physics-informed neural networks (PINNs) serves as an effective solution for a key challenge faced by optical sound measurements. Since optical sound measurements observe line integral of the sound pressure along the optical path, reconstruction is necessary to determine the sound pressure at each point in the three-dimensional field. In this paper, we expand the PINNs-based reconstruction method into three-dimensional reconstruction and demonstrate its effectiveness for optically measured sound fields. Furthermore, we propose a reconstruction approach which can estimate solutions well outside the bounds of the data used for training.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 6","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144201003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}