Dame Fall, Marc Duquennoy, Nikolay Smagin, Zakariae Oumekloul, Mohammadi Ouaftouh
This study presents a non-destructive method for estimating surface acoustic wave attenuation, which is highly sensitive to microstructural features, especially at high frequencies. The method uses a single wideband dispersive interdigital transducer (IDT) that remotely emits acoustic waves at the sample's edge. Chirp compression of the temporal displacement response is achieved by correlating the excitation signal with the spatial configuration of the IDT's electrodes. This technique generates high-amplitude pulses with a sufficient signal-to-noise ratio, critical for enabling accurate attenuation estimation over a frequency range (15-70 MHz). Results from nickel and aluminum demonstrate the method's effectiveness for rapid material characterization.
{"title":"Broadband surface acoustic wave attenuation in metals using chirp compression and dispersive interdigital transducers.","authors":"Dame Fall, Marc Duquennoy, Nikolay Smagin, Zakariae Oumekloul, Mohammadi Ouaftouh","doi":"10.1121/10.0039237","DOIUrl":"https://doi.org/10.1121/10.0039237","url":null,"abstract":"<p><p>This study presents a non-destructive method for estimating surface acoustic wave attenuation, which is highly sensitive to microstructural features, especially at high frequencies. The method uses a single wideband dispersive interdigital transducer (IDT) that remotely emits acoustic waves at the sample's edge. Chirp compression of the temporal displacement response is achieved by correlating the excitation signal with the spatial configuration of the IDT's electrodes. This technique generates high-amplitude pulses with a sufficient signal-to-noise ratio, critical for enabling accurate attenuation estimation over a frequency range (15-70 MHz). Results from nickel and aluminum demonstrate the method's effectiveness for rapid material characterization.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 9","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145002179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hearing aid (HA) processing can affect acoustic features linked with emotions, potentially making them less distinguishable. This study investigated whether HA processing, with both standard and short processing delays, affects emotion prediction from a set of acoustic features associated with speech emotions and how well these predictions align with perceived emotions. The findings indicated that anger and sadness are the easiest emotions to predict from acoustic features, while happiness and fear are the most accurately perceived emotions by listeners with normal hearing. HA processing, regardless of delay, does not seem to impair the predictability of emotions from acoustic features or the perception of these emotions.
{"title":"Influence of hearing aid processing on acoustic features associated with emotional speech: Acoustic analyses and perception by listeners with normal hearing.","authors":"Frederic Marmel, Dina Lelic","doi":"10.1121/10.0039220","DOIUrl":"https://doi.org/10.1121/10.0039220","url":null,"abstract":"<p><p>Hearing aid (HA) processing can affect acoustic features linked with emotions, potentially making them less distinguishable. This study investigated whether HA processing, with both standard and short processing delays, affects emotion prediction from a set of acoustic features associated with speech emotions and how well these predictions align with perceived emotions. The findings indicated that anger and sadness are the easiest emotions to predict from acoustic features, while happiness and fear are the most accurately perceived emotions by listeners with normal hearing. HA processing, regardless of delay, does not seem to impair the predictability of emotions from acoustic features or the perception of these emotions.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 9","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145031224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Benjamin J Kirby, Alaina Cunningham, Olivia Montou Zant
Misophonia is a condition characterized by intense negative emotional reactions to trigger sounds and related stimuli. In this study, adult listeners (N = 15) with a self-reported history of misophonia symptoms and a control group without misophonia (N = 15) completed listening judgements of recorded misophonia trigger stimuli using a standard scale. Participants also completed an established questionnaire of misophonia symptoms, the Misophonia Questionnaire (MQ). Summed scores of the listening task were significantly correlated with overall MQ score. The misophonia group had significantly higher listening scores and MQ scores compared to controls. These findings indicate applications for psychoacoustic methods in the assessment of misophonia.
{"title":"Psychoacoustic assessment of misophonia.","authors":"Benjamin J Kirby, Alaina Cunningham, Olivia Montou Zant","doi":"10.1121/10.0039238","DOIUrl":"https://doi.org/10.1121/10.0039238","url":null,"abstract":"<p><p>Misophonia is a condition characterized by intense negative emotional reactions to trigger sounds and related stimuli. In this study, adult listeners (N = 15) with a self-reported history of misophonia symptoms and a control group without misophonia (N = 15) completed listening judgements of recorded misophonia trigger stimuli using a standard scale. Participants also completed an established questionnaire of misophonia symptoms, the Misophonia Questionnaire (MQ). Summed scores of the listening task were significantly correlated with overall MQ score. The misophonia group had significantly higher listening scores and MQ scores compared to controls. These findings indicate applications for psychoacoustic methods in the assessment of misophonia.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 9","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145024843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jenna L Cramer, Ashley Reynard, Vanessa Torres, Jeremy J Donai
Identifying speakers of interest in an auditory scene is a fundamental task that facilitates effective communication. Little is known about the specific contributions of spectral and temporal detail required for identifying a specific speaker of interest by human listeners. This study investigated the relative contributions of spectral and temporal detail for identifying a speaker of interest and perceived effort in doing so. Results showed significant improvements in speaker identification and decreased effort ratings as spectral channels increased. Improved speaker identification performance with increased temporal filter cutoff from 20 Hz to 800 Hz was observed. These results have implications for speech signal processing by amplification devices and automated speaker recognition systems.
{"title":"Spectral and temporal information and presentation mode effects on individual speaker identification and listening effort.","authors":"Jenna L Cramer, Ashley Reynard, Vanessa Torres, Jeremy J Donai","doi":"10.1121/10.0039369","DOIUrl":"https://doi.org/10.1121/10.0039369","url":null,"abstract":"<p><p>Identifying speakers of interest in an auditory scene is a fundamental task that facilitates effective communication. Little is known about the specific contributions of spectral and temporal detail required for identifying a specific speaker of interest by human listeners. This study investigated the relative contributions of spectral and temporal detail for identifying a speaker of interest and perceived effort in doing so. Results showed significant improvements in speaker identification and decreased effort ratings as spectral channels increased. Improved speaker identification performance with increased temporal filter cutoff from 20 Hz to 800 Hz was observed. These results have implications for speech signal processing by amplification devices and automated speaker recognition systems.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 9","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145082696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A prior solution for the scattering of traveling wave sound by concentric fluid spheres is recast using complex unimodular s-function notation, which is convenient for expressing partial wave amplitudes and radiation forces on spheres in standing waves. Viscous and thermal energy dissipation are neglected. The fluid core affects the low-frequency dynamics of the fluid shell. The lowest monopole mode of air-filled liquid shells in air is considered. The frequency is approximated by generalizing the analysis of the Minnaert resonance of an air bubble in water. This analysis is relevant to the acoustical scattering by and conditions for trapping of compound drops.
{"title":"Concentric fluid spheres: Scattering and radiation forces and the lowest monopole resonance of bubble shells.","authors":"Philip L Marston","doi":"10.1121/10.0039423","DOIUrl":"https://doi.org/10.1121/10.0039423","url":null,"abstract":"<p><p>A prior solution for the scattering of traveling wave sound by concentric fluid spheres is recast using complex unimodular s-function notation, which is convenient for expressing partial wave amplitudes and radiation forces on spheres in standing waves. Viscous and thermal energy dissipation are neglected. The fluid core affects the low-frequency dynamics of the fluid shell. The lowest monopole mode of air-filled liquid shells in air is considered. The frequency is approximated by generalizing the analysis of the Minnaert resonance of an air bubble in water. This analysis is relevant to the acoustical scattering by and conditions for trapping of compound drops.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 9","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145126593","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ny Tsiky Rakotomalala, Pierre Baraduc, Pascal Perrier
We present a speech motor control model that integrates optimal feedback control (OFC) for movement planning and execution with a biomechanical model of the vocal tract. The OFC model was designed to optimize a cost function that combines motor effort and the achievement of multisensory goal zones. We show that the model can account for various aspects of speech production: kinematic properties, coarticulation, and sensorimotor integration. Furthermore, we provide evidence that hearing, proprioception, and tactile feedback may play distinct roles in shaping speech trajectories.
{"title":"GEPPETO-OFC: An optimal feedback speech motor control model integrating biomechanical constraints and multisensory goal specification.","authors":"Ny Tsiky Rakotomalala, Pierre Baraduc, Pascal Perrier","doi":"10.1121/10.0039197","DOIUrl":"https://doi.org/10.1121/10.0039197","url":null,"abstract":"<p><p>We present a speech motor control model that integrates optimal feedback control (OFC) for movement planning and execution with a biomechanical model of the vocal tract. The OFC model was designed to optimize a cost function that combines motor effort and the achievement of multisensory goal zones. We show that the model can account for various aspects of speech production: kinematic properties, coarticulation, and sensorimotor integration. Furthermore, we provide evidence that hearing, proprioception, and tactile feedback may play distinct roles in shaping speech trajectories.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 9","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145002173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The glockenspiel is a bright, resonant percussion instrument with a series of simple bars mounted next to each other in a frame. Its acoustic radiation remains underexplored, particularly in its full instrument configuration. This study investigates the acoustic radiation and vibrational behavior of a glockenspiel bar in different mounting conditions. Directivity measurements and the scanning laser Doppler vibrometer were used to compare a single bar in free-free, baffled, and full-instrument configurations. The results show that the mounting significantly alters radiation patterns of the bar, particularly at higher modes. Torsional modes exhibited greater deviation from free-free predictions than bending modes, especially in the full-instrument case. The findings highlight the importance of considering frame and structural interactions in modeling glockenspiel vibration and radiation.
{"title":"The effect of mounting conditions on the vibration and directivity patterns of the glockenspiel.","authors":"Hanna M Pavill, Micah R Shepherd","doi":"10.1121/10.0039258","DOIUrl":"https://doi.org/10.1121/10.0039258","url":null,"abstract":"<p><p>The glockenspiel is a bright, resonant percussion instrument with a series of simple bars mounted next to each other in a frame. Its acoustic radiation remains underexplored, particularly in its full instrument configuration. This study investigates the acoustic radiation and vibrational behavior of a glockenspiel bar in different mounting conditions. Directivity measurements and the scanning laser Doppler vibrometer were used to compare a single bar in free-free, baffled, and full-instrument configurations. The results show that the mounting significantly alters radiation patterns of the bar, particularly at higher modes. Torsional modes exhibited greater deviation from free-free predictions than bending modes, especially in the full-instrument case. The findings highlight the importance of considering frame and structural interactions in modeling glockenspiel vibration and radiation.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 9","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The passive localization of dual targets composed of a surface ship and a submerged source located nearby beneath the ship is an intriguing problem. This study develops a passive localization method based on multipath arrival angles for dual targets, with similar source levels in the deep-ocean direct arrival zone, using a horizontal line array. Compared to the classical minimum variance distortionless response method, the sparse Bayesian learning method is used to improve resolution for multipath arrival angles under coherent signal conditions, enhancing both the effective range and localization accuracy. The effectiveness of the proposed method has been validated through simulation and experiment.
{"title":"Passive localization of dual targets in deep-ocean direct-arrival zone using a horizontal line array.","authors":"Xiongyi Yu, Feilong Zhu, Yonggang Guo, Dai Liu","doi":"10.1121/10.0039110","DOIUrl":"10.1121/10.0039110","url":null,"abstract":"<p><p>The passive localization of dual targets composed of a surface ship and a submerged source located nearby beneath the ship is an intriguing problem. This study develops a passive localization method based on multipath arrival angles for dual targets, with similar source levels in the deep-ocean direct arrival zone, using a horizontal line array. Compared to the classical minimum variance distortionless response method, the sparse Bayesian learning method is used to improve resolution for multipath arrival angles under coherent signal conditions, enhancing both the effective range and localization accuracy. The effectiveness of the proposed method has been validated through simulation and experiment.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 9","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144981257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Aditya Kommineni, Rajat Hebbar, Sarah Petrosyan, Pranali Khobragade, Sudarsana Kadiri, Miguel Arce Rentería, Jinkook Lee, Shrikanth Narayanan
Speech foundation models (SFMs) achieve state-of-the-art results in many tasks, but their performance on elderly, multilingual speech remains underexplored. In this work, we investigate SFMs' ability to analyze multilingual speech from older adults using spoken language identification as a proxy task. We propose three key qualities for foundation models to serve multilingual aging populations: robustness to input duration, invariance to speaker demographics, and few-shot transferability in low-resource settings. Zero-shot evaluation indicates a noticeable performance drop for shorter inputs. We find that native speakers' speech consistently outperforms non-native speech across languages. Few-shot learning indicates better transferability in larger models.
{"title":"Can speech foundation models effectively identify languages in low-resource multilingual aging populations?","authors":"Aditya Kommineni, Rajat Hebbar, Sarah Petrosyan, Pranali Khobragade, Sudarsana Kadiri, Miguel Arce Rentería, Jinkook Lee, Shrikanth Narayanan","doi":"10.1121/10.0039265","DOIUrl":"10.1121/10.0039265","url":null,"abstract":"<p><p>Speech foundation models (SFMs) achieve state-of-the-art results in many tasks, but their performance on elderly, multilingual speech remains underexplored. In this work, we investigate SFMs' ability to analyze multilingual speech from older adults using spoken language identification as a proxy task. We propose three key qualities for foundation models to serve multilingual aging populations: robustness to input duration, invariance to speaker demographics, and few-shot transferability in low-resource settings. Zero-shot evaluation indicates a noticeable performance drop for shorter inputs. We find that native speakers' speech consistently outperforms non-native speech across languages. Few-shot learning indicates better transferability in larger models.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 9","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12434620/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145034876","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Marcelo Saldías O'Hrens, Víctor M Espinoza, Valentina Cruz, Melanie Garay, Josefa Reyes, Camilo Quezada, Pedro Cortez, Christian Castro, Jesús Parra, Anne-Maria Laukkanen
Modelling studies suggest that twang-like voice production with supralaryngeal constriction increases vocal economy. This has not been studied in contemporary commercial music (CCM) singers. This study explores the vocal economy of twang-like voices in CCM singers using the "quasi-output-cost ratio" (QOCR). Ten CCM singers sang the syllable [pa:] loudly, using neutral and twang-like voices at low and high pitches. QOCR, electroglottografic contact quotient, sound pressure level, air pressure, and inverse filtering measures were obtained. QOCR showed no significant differences between the voice types. Air pressure measures were significantly higher in twang-like voices, suggesting increased aerodynamic effort to compensate for supralaryngeal constriction. New tools for studying vocal economy in singing are warranted.
{"title":"Vocal economy in contemporary commercial music singers: A pilot study on twang-like voices.","authors":"Marcelo Saldías O'Hrens, Víctor M Espinoza, Valentina Cruz, Melanie Garay, Josefa Reyes, Camilo Quezada, Pedro Cortez, Christian Castro, Jesús Parra, Anne-Maria Laukkanen","doi":"10.1121/10.0039036","DOIUrl":"https://doi.org/10.1121/10.0039036","url":null,"abstract":"<p><p>Modelling studies suggest that twang-like voice production with supralaryngeal constriction increases vocal economy. This has not been studied in contemporary commercial music (CCM) singers. This study explores the vocal economy of twang-like voices in CCM singers using the \"quasi-output-cost ratio\" (QOCR). Ten CCM singers sang the syllable [pa:] loudly, using neutral and twang-like voices at low and high pitches. QOCR, electroglottografic contact quotient, sound pressure level, air pressure, and inverse filtering measures were obtained. QOCR showed no significant differences between the voice types. Air pressure measures were significantly higher in twang-like voices, suggesting increased aerodynamic effort to compensate for supralaryngeal constriction. New tools for studying vocal economy in singing are warranted.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 9","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145076611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}