This paper describes Omura's whale vocalizations recorded off northern Australia. Most common was a two-unit "doublet" (17-50 Hz, peak 25.5 Hz, duration ∼15 s). This was rhythmically repeated for extended periods of time, in typical baleen-whale song structure, suggesting it is a male breeding display. A one-unit "singlet" occurred in the lower-latitude locations off the northwest (13-72 Hz, peak 26.6 Hz, 8 s). In the Great Barrier Reef, similar doublets were detected, identifying a possible population of the species in the Southwest Pacific. Such geographic variation of song may benefit passive acoustic monitoring of this species for conservation management.
{"title":"Omura's whale (Balaenoptera omurai) song features and structure in northern Australian waters.","authors":"Ciara E Browne, Christine Erbe, Robert D McCauley","doi":"10.1121/10.0037187","DOIUrl":"https://doi.org/10.1121/10.0037187","url":null,"abstract":"<p><p>This paper describes Omura's whale vocalizations recorded off northern Australia. Most common was a two-unit \"doublet\" (17-50 Hz, peak 25.5 Hz, duration ∼15 s). This was rhythmically repeated for extended periods of time, in typical baleen-whale song structure, suggesting it is a male breeding display. A one-unit \"singlet\" occurred in the lower-latitude locations off the northwest (13-72 Hz, peak 26.6 Hz, 8 s). In the Great Barrier Reef, similar doublets were detected, identifying a possible population of the species in the Southwest Pacific. Such geographic variation of song may benefit passive acoustic monitoring of this species for conservation management.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 7","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144644312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Robert T Taylor, Megan S Ballard, Jason D Sagers, Leah Johnson, Harper Simmons
In wind-wave driven ambient sound environments, the cross-correlation of acoustic data recorded on vertically separated hydrophones can provide estimates of the average sound speed between hydrophones. Deployment of a 52-element vertical line array of hydrophones located on a 425 m deep ridge in the Nordic waters near Jan Mayen Island enabled estimates of the water column sound speed profile. Sound speed profiles were estimated for each 24-min recording, which were collected every four hours over the course of the year-long experiment, supplying a wide range of environmental conditions. Over the full experiment, estimated sound speeds had a root-median-square error of 0.79 m/s when compared to direct measurements.
{"title":"Passive acoustic estimates of sound speed from vertical line array data in the Nordic Seas.","authors":"Robert T Taylor, Megan S Ballard, Jason D Sagers, Leah Johnson, Harper Simmons","doi":"10.1121/10.0037103","DOIUrl":"https://doi.org/10.1121/10.0037103","url":null,"abstract":"<p><p>In wind-wave driven ambient sound environments, the cross-correlation of acoustic data recorded on vertically separated hydrophones can provide estimates of the average sound speed between hydrophones. Deployment of a 52-element vertical line array of hydrophones located on a 425 m deep ridge in the Nordic waters near Jan Mayen Island enabled estimates of the water column sound speed profile. Sound speed profiles were estimated for each 24-min recording, which were collected every four hours over the course of the year-long experiment, supplying a wide range of environmental conditions. Over the full experiment, estimated sound speeds had a root-median-square error of 0.79 m/s when compared to direct measurements.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 7","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144644313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The oceans are filled with acoustic waves, which are trapped in a low-velocity layer at about 1 km water depth. The sound speed depends on the temperature. From hydroacoustic arrays, travel times can be obtained through cross-correlating transient signals. Hydroacoustic station H10 (Ascension Island) appeared capable of measuring deep ocean temperature change. A decrease in modeled and observed travel times of -0.002 s/yr was derived between two arrays, corresponding to a warming of 0.007 °C/yr, at about 900 m water depth over two decades. As such, acoustic waves provide an independent and passively acquired measure of the temperature change in the deep ocean.
{"title":"Decadal observations of deep ocean temperature change passively probed with acoustic waves.","authors":"Läslo G Evers","doi":"10.1121/10.0037200","DOIUrl":"https://doi.org/10.1121/10.0037200","url":null,"abstract":"<p><p>The oceans are filled with acoustic waves, which are trapped in a low-velocity layer at about 1 km water depth. The sound speed depends on the temperature. From hydroacoustic arrays, travel times can be obtained through cross-correlating transient signals. Hydroacoustic station H10 (Ascension Island) appeared capable of measuring deep ocean temperature change. A decrease in modeled and observed travel times of -0.002 s/yr was derived between two arrays, corresponding to a warming of 0.007 °C/yr, at about 900 m water depth over two decades. As such, acoustic waves provide an independent and passively acquired measure of the temperature change in the deep ocean.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 7","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144644311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Virginia Best, Turley Duque, Defne Abur, Cara E Stepp
A recent study in individuals with hyperfunctional voice disorders (HVDs) reported poorer fo discrimination compared to matched controls. In that study, fo discrimination was measured using each individual's own voice as the stimulus, which may have introduced confounds given the differences in vocal quality between the groups. Here, this paper addressed this possibility by using an fo discrimination task where participants with and without HVDs judged the same external voice. In line with the previous study, the HVD group exhibited poorer fo discrimination than controls, providing convergent evidence for a pitch-related auditory deficit in this population.
{"title":"Convergent evidence for a pitch deficit in hyperfunctional voice disorders.","authors":"Virginia Best, Turley Duque, Defne Abur, Cara E Stepp","doi":"10.1121/10.0037107","DOIUrl":"10.1121/10.0037107","url":null,"abstract":"<p><p>A recent study in individuals with hyperfunctional voice disorders (HVDs) reported poorer fo discrimination compared to matched controls. In that study, fo discrimination was measured using each individual's own voice as the stimulus, which may have introduced confounds given the differences in vocal quality between the groups. Here, this paper addressed this possibility by using an fo discrimination task where participants with and without HVDs judged the same external voice. In line with the previous study, the HVD group exhibited poorer fo discrimination than controls, providing convergent evidence for a pitch-related auditory deficit in this population.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 7","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12238932/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144577144","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lilian Nguyen, Gabriel Arnold, Guillaume Andéol, Vincent Isnard
Degraded speech intelligibility in multitalker situations can be improved by spatial unmasking, using sound spatialization techniques such as binaural synthesis. However, intelligibility also depends on the ability to focus efficiently on the target. Three experimental sessions explored the benefit of an additional vibrotactile cue, spatialized around the waist, on intelligibility in different spatialized multitalker situations. Results indicate improvements in intelligibility scores and reduced listening effort specifically for an off-center target among masker talkers. Multimodality allows us to better understand the mechanisms of auditory attention and to open up new perspectives for improving speech intelligibility in multitalker situations.
{"title":"Speech intelligibility in multitalker situations with vibrotactile spatial cueing.","authors":"Lilian Nguyen, Gabriel Arnold, Guillaume Andéol, Vincent Isnard","doi":"10.1121/10.0036850","DOIUrl":"https://doi.org/10.1121/10.0036850","url":null,"abstract":"<p><p>Degraded speech intelligibility in multitalker situations can be improved by spatial unmasking, using sound spatialization techniques such as binaural synthesis. However, intelligibility also depends on the ability to focus efficiently on the target. Three experimental sessions explored the benefit of an additional vibrotactile cue, spatialized around the waist, on intelligibility in different spatialized multitalker situations. Results indicate improvements in intelligibility scores and reduced listening effort specifically for an off-center target among masker talkers. Multimodality allows us to better understand the mechanisms of auditory attention and to open up new perspectives for improving speech intelligibility in multitalker situations.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 6","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144236109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The complexity of deep-sea acoustic ray bending is positively correlated with propagation distance, typically turning beyond 10 km. Existing ranging correction methods fail under turning conditions, limiting ranging accuracy. This letter proposes an effective sound velocity estimation method for turning ray, based on reliable acoustic path boundary to classify turning direct sound and establish a ray-tracing model for ranging correction. This method extends high-accuracy ranging from a 10 km scale to the first convergence zone. Experimental results demonstrate that the method achieves a positioning precision of 6.18 m at a horizontal distance of 34 km, validating its effectiveness.
{"title":"A deep-sea direct sound ranging method based on effective sound velocity estimation for turning ray.","authors":"Chiyu Cheng, Cuie Zheng, Xiaoping Hong","doi":"10.1121/10.0036833","DOIUrl":"https://doi.org/10.1121/10.0036833","url":null,"abstract":"<p><p>The complexity of deep-sea acoustic ray bending is positively correlated with propagation distance, typically turning beyond 10 km. Existing ranging correction methods fail under turning conditions, limiting ranging accuracy. This letter proposes an effective sound velocity estimation method for turning ray, based on reliable acoustic path boundary to classify turning direct sound and establish a ray-tracing model for ranging correction. This method extends high-accuracy ranging from a 10 km scale to the first convergence zone. Experimental results demonstrate that the method achieves a positioning precision of 6.18 m at a horizontal distance of 34 km, validating its effectiveness.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 6","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144200998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tim Sonnemann, Jan Dettmer, Charles W Holland, Stan E Dosso
This study presents a probabilistic one-step two-dimensional (2D) inversion method of spherical-wave reflection coefficient data to estimate the range-dependent structure and geoacoustic parameters along a track. The approach of inverting such datasets independently as one-dimensional (1D) layered models and merging them to a 2D section is feasible but computationally expensive. This study demonstrates a more parsimonious 2D parametrization for active source data recorded on a towed hydrophone array. The comparison of data variance reduction for 1D- and 2D-based results clearly favors the 2D parametrization described here.
{"title":"Range-dependent meso-scale geoacoustic seabed quantification.","authors":"Tim Sonnemann, Jan Dettmer, Charles W Holland, Stan E Dosso","doi":"10.1121/10.0036835","DOIUrl":"https://doi.org/10.1121/10.0036835","url":null,"abstract":"<p><p>This study presents a probabilistic one-step two-dimensional (2D) inversion method of spherical-wave reflection coefficient data to estimate the range-dependent structure and geoacoustic parameters along a track. The approach of inverting such datasets independently as one-dimensional (1D) layered models and merging them to a 2D section is feasible but computationally expensive. This study demonstrates a more parsimonious 2D parametrization for active source data recorded on a towed hydrophone array. The comparison of data variance reduction for 1D- and 2D-based results clearly favors the 2D parametrization described here.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 6","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144201002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Trans-dimensional (trans-D) Bayesian inversion is a powerful approach to estimate seabed geoacoustic models from ocean-acoustic data, combining quantitative model selection and uncertainty estimation. Trans-D inversion samples probabilistically over the number of seabed layers and the geoacoustic parameters for each layer, with layers added and removed in sampling, changing the dimension of the model. However, the probability of accepting dimension changes can approach zero for problems involving highly informative data or large numbers of parameters per layer. This Letter examines the use of parallel tempering, which employs a sequence of interacting Markov chains with successively relaxed likelihoods, to address these challenging cases.
{"title":"Parallel tempering in trans-dimensional Bayesian geoacoustic inversion for high-information-content data and multi-parameter models.","authors":"Stan E Dosso","doi":"10.1121/10.0036948","DOIUrl":"https://doi.org/10.1121/10.0036948","url":null,"abstract":"<p><p>Trans-dimensional (trans-D) Bayesian inversion is a powerful approach to estimate seabed geoacoustic models from ocean-acoustic data, combining quantitative model selection and uncertainty estimation. Trans-D inversion samples probabilistically over the number of seabed layers and the geoacoustic parameters for each layer, with layers added and removed in sampling, changing the dimension of the model. However, the probability of accepting dimension changes can approach zero for problems involving highly informative data or large numbers of parameters per layer. This Letter examines the use of parallel tempering, which employs a sequence of interacting Markov chains with successively relaxed likelihoods, to address these challenging cases.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 6","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144318878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In this study, the question of generalization for dimension-based statistical learning in speech perception is revisited. Learning for F0 and voice-onset-time as cues to stop voicing has been suggested to be fairly specific to a particular contrast, which was previously shown not to generalize between two places of articulation. The present study seeks to replicate generalization for the same place of articulation, using more varied stimuli and a different design. Then, it is tested if increased evidence for a distributional pattern, i.e., two places of articulation showing that pattern, leads to generalization to a third place of articulation. Same place of articulation learning is replicated, and no generalization across place of articulation is found, reaffirming that learning appears to be quite specific.
{"title":"Examining variability and generalization in dimension-based statistical learning for speech: The case of place of articulation.","authors":"Jeremy Steffman, Minxuan He, Seb Segger-Staveley","doi":"10.1121/10.0036895","DOIUrl":"https://doi.org/10.1121/10.0036895","url":null,"abstract":"<p><p>In this study, the question of generalization for dimension-based statistical learning in speech perception is revisited. Learning for F0 and voice-onset-time as cues to stop voicing has been suggested to be fairly specific to a particular contrast, which was previously shown not to generalize between two places of articulation. The present study seeks to replicate generalization for the same place of articulation, using more varied stimuli and a different design. Then, it is tested if increased evidence for a distributional pattern, i.e., two places of articulation showing that pattern, leads to generalization to a third place of articulation. Same place of articulation learning is replicated, and no generalization across place of articulation is found, reaffirming that learning appears to be quite specific.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 6","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144318877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The size of an individual's tongue relative to the oral cavity is associated with articulation speed [Feng, Lu, Zheng, Chi, and Honda, in Proceedings of the 10th Biennial Asia Pacific Conference on Speech, Language, and Hearing (2017), pp. 17-19)] and may affect speech clarity. This study introduces an ultrasound-based method for measuring relative tongue size, termed ultrasound-based relative tongue size (uRTS), as a cost-effective alternative to the magnetic resonance imaging (MRI) based method. Using deep learning to extract the tongue contour, uRTS was calculated from tongue and oropharyngeal cavity sizes in the midsagittal plane. Results from ten speakers showed a strong correlation between uRTS and MRI-based measurements (r = 0.87) and a negative correlation with tongue movement speed (r = -0.73), indicating uRTS is a useful index for assessing tongue size.
个体舌头相对于口腔的大小与发音速度有关[Feng, Lu, Zheng, Chi, and Honda,在第十届亚太言语、语言和听力双年展会议论文集(2017),第17-19页)],并可能影响言语清晰度。本研究介绍了一种基于超声测量相对舌头大小的方法,称为基于超声的相对舌头大小(uRTS),作为一种经济有效的替代基于磁共振成像(MRI)的方法。利用深度学习提取舌头轮廓,从中矢状面上的舌头和口咽腔大小计算uRTS。10名说话者的结果显示,uRTS和基于mri的测量结果之间存在很强的相关性(r = 0.87),而与舌速呈负相关(r = -0.73),这表明uRTS是评估舌大小的有用指标。
{"title":"Ultrasound measurement of relative tongue size and its correlation with tongue mobility for healthy individuals.","authors":"Jing Sun, Tatsuya Kitamura, Yukiko Nota, Noriko Yamane, Ryoko Hayashi","doi":"10.1121/10.0036838","DOIUrl":"https://doi.org/10.1121/10.0036838","url":null,"abstract":"<p><p>The size of an individual's tongue relative to the oral cavity is associated with articulation speed [Feng, Lu, Zheng, Chi, and Honda, in Proceedings of the 10th Biennial Asia Pacific Conference on Speech, Language, and Hearing (2017), pp. 17-19)] and may affect speech clarity. This study introduces an ultrasound-based method for measuring relative tongue size, termed ultrasound-based relative tongue size (uRTS), as a cost-effective alternative to the magnetic resonance imaging (MRI) based method. Using deep learning to extract the tongue contour, uRTS was calculated from tongue and oropharyngeal cavity sizes in the midsagittal plane. Results from ten speakers showed a strong correlation between uRTS and MRI-based measurements (r = 0.87) and a negative correlation with tongue movement speed (r = -0.73), indicating uRTS is a useful index for assessing tongue size.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 6","pages":""},"PeriodicalIF":1.2,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144236110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}