Pub Date : 2025-07-29Epub Date: 2025-03-31DOI: 10.1044/2024_JSLHR-24-00347
Prad Kadambi, Tristan J Mahr, Katherine C Hustad, Visar Berisha
Purpose: Phonetic forced alignment has a multitude of applications in automated analysis of speech, particularly in studying nonstandard speech such as children's speech. Manual alignment is tedious but serves as the gold standard for clinical-grade alignment. Current tools do not support direct training on manual alignments. Thus, a trainable speaker adaptive phonetic forced alignment system, Wav2TextGrid, was developed for children's speech. The source code for the method is publicly available along with a graphical user interface at https://github.com/pkadambi/Wav2TextGrid.
Method: We propose a trainable, speaker-adaptive, neural forced aligner developed using a corpus of 42 neurotypical children from 3 to 6 years of age. Evaluation on both child speech and on the TIMIT corpus was performed to demonstrate aligner performance across age and dialectal variations.
Results: The trainable alignment tool markedly improved accuracy over baseline for several alignment quality metrics, for all phoneme categories. Accuracy for plosives and affricates in children's speech improved more than 40% over baseline. Performance matched existing methods using approximately 13 min of labeled data, while approximately 45-60 min of labeled alignments yielded significant improvement.
Conclusion: The Wav2TextGrid tool allows alternate alignment workflows where the forced alignments, via training, are directly tailored to match clinical-grade, manually provided alignments.
{"title":"A Tunable Forced Alignment System Based on Deep Learning: Applications to Child Speech.","authors":"Prad Kadambi, Tristan J Mahr, Katherine C Hustad, Visar Berisha","doi":"10.1044/2024_JSLHR-24-00347","DOIUrl":"10.1044/2024_JSLHR-24-00347","url":null,"abstract":"<p><strong>Purpose: </strong>Phonetic forced alignment has a multitude of applications in automated analysis of speech, particularly in studying nonstandard speech such as children's speech. Manual alignment is tedious but serves as the gold standard for clinical-grade alignment. Current tools do not support direct training on manual alignments. Thus, a trainable speaker adaptive phonetic forced alignment system, Wav2TextGrid, was developed for children's speech. The source code for the method is publicly available along with a graphical user interface at https://github.com/pkadambi/Wav2TextGrid.</p><p><strong>Method: </strong>We propose a trainable, speaker-adaptive, neural forced aligner developed using a corpus of 42 neurotypical children from 3 to 6 years of age. Evaluation on both child speech and on the TIMIT corpus was performed to demonstrate aligner performance across age and dialectal variations.</p><p><strong>Results: </strong>The trainable alignment tool markedly improved accuracy over baseline for several alignment quality metrics, for all phoneme categories. Accuracy for plosives and affricates in children's speech improved more than 40% over baseline. Performance matched existing methods using approximately 13 min of labeled data, while approximately 45-60 min of labeled alignments yielded significant improvement.</p><p><strong>Conclusion: </strong>The Wav2TextGrid tool allows alternate alignment workflows where the forced alignments, via training, are directly tailored to match clinical-grade, manually provided alignments.</p><p><strong>Supplemental material: </strong>https://doi.org/10.23641/asha.28593971.</p>","PeriodicalId":51254,"journal":{"name":"Journal of Speech Language and Hearing Research","volume":" ","pages":"3583-3601"},"PeriodicalIF":2.2,"publicationDate":"2025-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12337111/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143755644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-05Epub Date: 2025-05-15DOI: 10.1044/2025_JSLHR-25-00161
{"title":"Erratum to \"Oral Diadochokinetic Performance on Perceptual and Acoustic Measures for Typically Developing Cantonese-Speaking Preschool Children\".","authors":"","doi":"10.1044/2025_JSLHR-25-00161","DOIUrl":"10.1044/2025_JSLHR-25-00161","url":null,"abstract":"","PeriodicalId":51254,"journal":{"name":"Journal of Speech Language and Hearing Research","volume":" ","pages":"2836"},"PeriodicalIF":2.2,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144081689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-05Epub Date: 2025-04-23DOI: 10.1044/2025_JSLHR-24-00535
Matthew Masapollo, Rosalie Gendron, Erin Wyndham, Ally Marcellus, Allen Shamsi, Nathan Maxfield
Purpose: During speech production, complex patterns of coordinated movements between sets of articulators (e.g., jaw and tongue, jaw and lips) form precise and consistent constrictions at distinct locations along the vocal tract, despite rampant contextual variation. Speech motor control research seeks to uncover basic principles of organization governing interarticulator coordination during constriction formation, assuming many degrees of freedom for controlling articulator movements. This study tested the hypothesis that the motor system reduces degrees of freedom and facilitates coordination by reliably controlling interarticulator timing.
Method: Ten talkers produced vowel-consonant-vowel (VCV) sequences, recorded using electromagnetic articulography, with variation in production rate and syllable stress. V was /ɑ/-/ɛ/, and C was alveolar /t/-/d/ or bilabial /p/-/b/. Timing relations between peak velocities of condition-specific sets of articulators were determined during oral closure and release for C: jaw, upper lip, and lower lip for bilabial constrictions and jaw and tongue tip for alveolar constrictions.
Results: During oral closing, the timing of articulator peak velocities was tightly coupled across scalar changes in rate and stress, such that timing variation in one articulator was accompanied by proportional changes in the timing of another articulator. In contrast, the timing of peak velocities was less tightly coupled during subsequent oral opening. The timing of peak velocities was also more reliably differentiated by rate/stress condition during oral closing than opening, indicating that speech articulator movements are temporally coordinated primarily based on the part of movement related to constriction formation, rather than its subsequent release.
Conclusion: Findings align with the view that stable interarticulator timing relations underlie the achievement of precise and consistent vocal tract constrictions during speech.
目的:在语言产生过程中,尽管上下文变化很大,但在声道的不同位置上,不同发音器(如颌和舌头、颌和嘴唇)之间协调运动的复杂模式形成了精确和一致的收缩。语言运动控制研究旨在揭示收缩形成过程中控制发音器协调的基本组织原则,假设控制发音器运动的多个自由度。这项研究验证了运动系统通过可靠地控制关节间的时间来降低自由度和促进协调的假设。方法:10名说话者发出元音-辅音-元音(VCV)序列,用电磁声速记录仪记录,其发音速度和音节重音不同。V /ɑ/ - /ɛ/,和C是肺泡/ t /——/ d /或双唇音/ p /——/ b /。在C的口腔闭合和释放过程中,确定了特定条件下的关节组的峰值速度的时间关系:下颌、上唇和下唇用于双唇收缩,下颌和舌尖用于肺泡收缩。结果:在口腔闭合过程中,发音器峰值速度的时间在速率和应力的标量变化上紧密耦合,使得一个发音器的时间变化伴随着另一个发音器的时间成比例的变化。相比之下,在随后的开口过程中,峰值速度的时间耦合度较低。在闭口时,峰值速度的时间也比开口时的速率/压力条件更可靠地区分,这表明语音发音运动的时间协调主要基于与收缩形成相关的运动部分,而不是随后的释放。结论:研究结果与以下观点一致:在讲话过程中,稳定的发音器间时间关系是实现精确和一致的声道收缩的基础。补充资料:https://doi.org/10.23641/asha.28791872。
{"title":"Interarticulator Timing Relations Underlie the Production of Precise and Consistent Vocal Tract Constrictions During Speech.","authors":"Matthew Masapollo, Rosalie Gendron, Erin Wyndham, Ally Marcellus, Allen Shamsi, Nathan Maxfield","doi":"10.1044/2025_JSLHR-24-00535","DOIUrl":"10.1044/2025_JSLHR-24-00535","url":null,"abstract":"<p><strong>Purpose: </strong>During speech production, complex patterns of coordinated movements between sets of articulators (e.g., jaw and tongue, jaw and lips) form precise and consistent constrictions at distinct locations along the vocal tract, despite rampant contextual variation. Speech motor control research seeks to uncover basic principles of organization governing interarticulator coordination during constriction formation, assuming many degrees of freedom for controlling articulator movements. This study tested the hypothesis that the motor system reduces degrees of freedom and facilitates coordination by reliably controlling interarticulator timing.</p><p><strong>Method: </strong>Ten talkers produced vowel-consonant-vowel (VCV) sequences, recorded using electromagnetic articulography, with variation in production rate and syllable stress. V was /ɑ/-/ɛ/, and C was alveolar /t/-/d/ or bilabial /p/-/b/. Timing relations between peak velocities of condition-specific sets of articulators were determined during oral closure and release for C: jaw, upper lip, and lower lip for bilabial constrictions and jaw and tongue tip for alveolar constrictions.</p><p><strong>Results: </strong>During oral closing, the timing of articulator peak velocities was tightly coupled across scalar changes in rate and stress, such that timing variation in one articulator was accompanied by proportional changes in the timing of another articulator. In contrast, the timing of peak velocities was less tightly coupled during subsequent oral opening. The timing of peak velocities was also more reliably differentiated by rate/stress condition during oral closing than opening, indicating that speech articulator movements are temporally coordinated primarily based on the part of movement related to constriction formation, rather than its subsequent release.</p><p><strong>Conclusion: </strong>Findings align with the view that stable interarticulator timing relations underlie the achievement of precise and consistent vocal tract constrictions during speech.</p><p><strong>Supplemental material: </strong>https://doi.org/10.23641/asha.28791872.</p>","PeriodicalId":51254,"journal":{"name":"Journal of Speech Language and Hearing Research","volume":" ","pages":"2700-2720"},"PeriodicalIF":2.2,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12173160/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144043757","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-05Epub Date: 2025-05-07DOI: 10.1044/2025_JSLHR-24-00768
Jordanna S Sevitz, Nicole Rogus-Pulia, Georgia A Malandraki, Michelle S Troche
Purpose: Despite evidence to suggest that rehabilitation can improve airway protection in people with Parkinson's disease (pwPD), rehabilitative therapies are underutilized. One newer treatment approach with growing evidence to support its efficacy is cough skill training (CST). To improve utilization of rehabilitations such as CST, it is important to understand patient treatment experience. Therefore, the aim of this study was to define treatment acceptability and satisfaction of CST in pwPD.
Method: Thirteen pwPD were consecutively recruited from a trial during which participants completed two in-person sessions of CST via spirometry over 2 weeks. A mixed-methods approach was used, whereby quantitative data (obtained from questionnaires) and qualitative data (obtained from semistructured interviews) were integrated to provide a holistic understanding of patient experience. Data collection and thematic analyses (of qualitative data) were guided by the Systems Engineering Initiative for Patient Safety framework.
Results: While treatment demonstrated adequate acceptability with a System Usability Scale median score of 70/100 (scores ≥ 70 indicate acceptability), integrative examination of patient experience revealed several themes that highlight barriers and facilitators to treatment acceptability-as they relate to CST tools (visualization, lip seal, and measurement accuracy) and tasks (practice targets, feedback, practice amount, and future training). Most (61.5%) participants were "satisfied" or "very satisfied" with CST. Four themes emerged related to satisfaction: treatment relevance, patient awareness and control over their disease, skill acquisition, and skill transference to real-life choking events.
Conclusions: In this study, patient responses indicated that small, personalized adaptations to equipment, training targets, and feedback may enhance their experiences, while education and personalized goal setting may enhance treatment relevance and perceived benefit. Patient perspectives and needs can inform the refinement of person-centered clinical implementation of CST and improve treatment uptake.
{"title":"Treatment Acceptability and Satisfaction With Cough Skill Training in Parkinson's Disease: A Mixed-Methods Study.","authors":"Jordanna S Sevitz, Nicole Rogus-Pulia, Georgia A Malandraki, Michelle S Troche","doi":"10.1044/2025_JSLHR-24-00768","DOIUrl":"10.1044/2025_JSLHR-24-00768","url":null,"abstract":"<p><strong>Purpose: </strong>Despite evidence to suggest that rehabilitation can improve airway protection in people with Parkinson's disease (pwPD), rehabilitative therapies are underutilized. One newer treatment approach with growing evidence to support its efficacy is cough skill training (CST). To improve utilization of rehabilitations such as CST, it is important to understand patient treatment experience. Therefore, the aim of this study was to define treatment acceptability and satisfaction of CST in pwPD.</p><p><strong>Method: </strong>Thirteen pwPD were consecutively recruited from a trial during which participants completed two in-person sessions of CST via spirometry over 2 weeks. A mixed-methods approach was used, whereby quantitative data (obtained from questionnaires) and qualitative data (obtained from semistructured interviews) were integrated to provide a holistic understanding of patient experience. Data collection and thematic analyses (of qualitative data) were guided by the Systems Engineering Initiative for Patient Safety framework.</p><p><strong>Results: </strong>While treatment demonstrated adequate acceptability with a System Usability Scale median score of 70/100 (scores ≥ 70 indicate acceptability), integrative examination of patient experience revealed several themes that highlight barriers and facilitators to treatment acceptability-as they relate to CST tools (visualization, lip seal, and measurement accuracy) and tasks (practice targets, feedback, practice amount, and future training). Most (61.5%) participants were \"satisfied\" or \"very satisfied\" with CST. Four themes emerged related to satisfaction: treatment <i>relevance</i>, patient <i>awareness and control</i> over their disease, <i>skill acquisition</i>, and <i>skill transference</i> to real-life choking events<i>.</i></p><p><strong>Conclusions: </strong>In this study, patient responses indicated that small, personalized adaptations to equipment, training targets, and feedback may enhance their experiences, while education and personalized goal setting may enhance treatment relevance and perceived benefit. Patient perspectives and needs can inform the refinement of person-centered clinical implementation of CST and improve treatment uptake.</p><p><strong>Supplemental material: </strong>https://doi.org/10.23641/asha.28887167.</p>","PeriodicalId":51254,"journal":{"name":"Journal of Speech Language and Hearing Research","volume":" ","pages":"2782-2801"},"PeriodicalIF":2.2,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12510374/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144016153","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-05Epub Date: 2025-05-05DOI: 10.1044/2025_JSLHR-25-00034
Brett Welch, Leah B Helou, Aidan Wright
Purpose: Voice disorders associated with vocal hyperfunction are some of the most common vocal pathologies. Certain personality traits are thought to be a risk factor for developing these disorders. The trait theory of voice disorders (TTVD) provided a unified framework to understand these relationships. This study re-examines the TTVD by adopting current theories and methods from personality science.
Method: This cross-sectional study recruited individuals diagnosed with primary muscle tension dysphonia (MTD) or a diagnosis associated with phonotraumatic vocal hyperfunction (PVH), that is, benign bilateral lesions of the lamina propria and bilateral or unilateral vocal fold polyp(s). Participants completed a contemporary personality battery. Data were analyzed via structural equation modeling and compared to vocally healthy controls.
Results: Several significant differences existed between vocally healthy controls (n = 416) and participants with MTD (n = 71) or PVH (n = 38). Compared to the controls, participants with MTD reported lower levels of Stability, Plasticity, Conscientiousness, Openness/Intellect, and the corresponding aspects of Industriousness and Intellect, respectively. Conversely, the MTD cohort was significantly higher in Neuroticism and its two corresponding aspects, Withdrawal and Volatility. Likewise, when compared to controls, the PVH reported significantly lower levels of Stability, Agreeableness, its aspect Politeness, and the aspect of Industriousness. Finally, compared to the MTD cohort, individuals with PVH were higher on Extraversion, specifically the aspect of Assertiveness, and lower on the aspect of Politeness.
Conclusions: The current study largely replicates the initial TTVD studies and updates them with a modern theory of personality. These results provide a strong foundation for future investigations to continue to study the relationships between personality traits and voice disorders.
{"title":"Integrating Advances in Personality Science to Re-Examine the Trait Theory of Voice Disorders.","authors":"Brett Welch, Leah B Helou, Aidan Wright","doi":"10.1044/2025_JSLHR-25-00034","DOIUrl":"10.1044/2025_JSLHR-25-00034","url":null,"abstract":"<p><strong>Purpose: </strong>Voice disorders associated with vocal hyperfunction are some of the most common vocal pathologies. Certain personality traits are thought to be a risk factor for developing these disorders. The trait theory of voice disorders (TTVD) provided a unified framework to understand these relationships. This study re-examines the TTVD by adopting current theories and methods from personality science.</p><p><strong>Method: </strong>This cross-sectional study recruited individuals diagnosed with primary muscle tension dysphonia (MTD) or a diagnosis associated with phonotraumatic vocal hyperfunction (PVH), that is, benign bilateral lesions of the lamina propria and bilateral or unilateral vocal fold polyp(s). Participants completed a contemporary personality battery. Data were analyzed via structural equation modeling and compared to vocally healthy controls.</p><p><strong>Results: </strong>Several significant differences existed between vocally healthy controls (<i>n</i> = 416) and participants with MTD (<i>n</i> = 71) or PVH (<i>n</i> = 38). Compared to the controls, participants with MTD reported lower levels of Stability, Plasticity, Conscientiousness, Openness/Intellect, and the corresponding aspects of Industriousness and Intellect, respectively. Conversely, the MTD cohort was significantly higher in Neuroticism and its two corresponding aspects, Withdrawal and Volatility. Likewise, when compared to controls, the PVH reported significantly lower levels of Stability, Agreeableness, its aspect Politeness, and the aspect of Industriousness. Finally, compared to the MTD cohort, individuals with PVH were higher on Extraversion, specifically the aspect of Assertiveness, and lower on the aspect of Politeness.</p><p><strong>Conclusions: </strong>The current study largely replicates the initial TTVD studies and updates them with a modern theory of personality. These results provide a strong foundation for future investigations to continue to study the relationships between personality traits and voice disorders.</p><p><strong>Supplemental material: </strong>https://doi.org/10.23641/asha.28843052.</p>","PeriodicalId":51254,"journal":{"name":"Journal of Speech Language and Hearing Research","volume":" ","pages":"2759-2781"},"PeriodicalIF":2.2,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12510375/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144065180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-05Epub Date: 2025-05-15DOI: 10.1044/2025_JSLHR-24-00306
David R Moore, Li Lin, Ritu Bhalerao, Jody Caldwell-Kurtzman, Lisa L Hunter
Purpose: Listening difficulty (LiD), often classified as auditory processing disorder (APD), has been studied in both research and clinic settings. The aim of this study was to examine the predictive relation between these two settings. In our SICLiD (Sensitive Indicators of Childhood Listening Difficulties) research study, children with normal audiometry, but caregiver-reported LiD, performed poorly on both listening and cognitive tests. Here, we examined results of clinical assessments and interventions for these children in relation to research performance.
Method: Study setting was a tertiary pediatric hospital. Electronic medical records were reviewed for 64 children aged 6-13 years recruited into a SICLiD LiD group based on a caregiver report (Evaluation of Children's Listening and Processing Skill [ECLiPS]). The review focused on clinical assessments and interventions provided by audiology, occupational therapy, psychology (developmental and behavioral pediatrics), and speech-language pathology services, prior to study participation. Descriptive statistics on clinical encounters, identified conditions, and interventions were compared with quantitative, standardized performance on research tests. z scores were compared for participants with and without each clinical condition using univariate and logistic prediction analyses.
Results: Overall, 24 clinical categories related to LiD, including APD, were identified. Common conditions were Attention (32%), Language (28%), Hearing (18%), Anxiety (16%), and Autism Spectrum Disorder (6%). Performance on research tests varied significantly between providers, conditions, and interventions. Quantitative research data combined with caregiver reports provided reliable predictions of all clinical conditions except APD. Significant correlations in individual tests were scarce but included the SCAN Composite score, which predicted clinical language and attention difficulties, but not APD diagnoses.
Conclusions: The variety of disciplines, assessments, conditions, and interventions revealed here supports previous studies showing that LiD is a multifaceted problem of neurodevelopment. Comparisons between clinical- and research-based assessments suggest a path that prioritizes caregiver reports and selected psychometric tests for screening and diagnostic purposes.
{"title":"Multidisciplinary Clinical Assessment and Interventions for Childhood Listening Difficulty and Auditory Processing Disorder: Relation Between Research Findings and Clinical Practice.","authors":"David R Moore, Li Lin, Ritu Bhalerao, Jody Caldwell-Kurtzman, Lisa L Hunter","doi":"10.1044/2025_JSLHR-24-00306","DOIUrl":"10.1044/2025_JSLHR-24-00306","url":null,"abstract":"<p><strong>Purpose: </strong>Listening difficulty (LiD), often classified as auditory processing disorder (APD), has been studied in both research and clinic settings. The aim of this study was to examine the predictive relation between these two settings. In our SICLiD (Sensitive Indicators of Childhood Listening Difficulties) research study, children with normal audiometry, but caregiver-reported LiD, performed poorly on both listening and cognitive tests. Here, we examined results of clinical assessments and interventions for these children in relation to research performance.</p><p><strong>Method: </strong>Study setting was a tertiary pediatric hospital. Electronic medical records were reviewed for 64 children aged 6-13 years recruited into a SICLiD LiD group based on a caregiver report (Evaluation of Children's Listening and Processing Skill [ECLiPS]). The review focused on clinical assessments and interventions provided by audiology, occupational therapy, psychology (developmental and behavioral pediatrics), and speech-language pathology services, prior to study participation. Descriptive statistics on clinical encounters, identified conditions, and interventions were compared with quantitative, standardized performance on research tests. <i>z</i> scores were compared for participants with and without each clinical condition using univariate and logistic prediction analyses.</p><p><strong>Results: </strong>Overall, 24 clinical categories related to LiD, including APD, were identified. Common conditions were Attention (32%), Language (28%), Hearing (18%), Anxiety (16%), and Autism Spectrum Disorder (6%). Performance on research tests varied significantly between providers, conditions, and interventions. Quantitative research data combined with caregiver reports provided reliable predictions of all clinical conditions except APD. Significant correlations in individual tests were scarce but included the SCAN Composite score, which predicted clinical language and attention difficulties, but not APD diagnoses.</p><p><strong>Conclusions: </strong>The variety of disciplines, assessments, conditions, and interventions revealed here supports previous studies showing that LiD is a multifaceted problem of neurodevelopment. Comparisons between clinical- and research-based assessments suggest a path that prioritizes caregiver reports and selected psychometric tests for screening and diagnostic purposes.</p><p><strong>Supplemental material: </strong>https://doi.org/10.23641/asha.28907780.</p>","PeriodicalId":51254,"journal":{"name":"Journal of Speech Language and Hearing Research","volume":" ","pages":"2978-2991"},"PeriodicalIF":2.2,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12510372/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144081716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-05Epub Date: 2025-04-29DOI: 10.1044/2025_JSLHR-24-00749
Rachel Hahn Arkenberg, Samantha Mitchell, Anumitha Venkatraman, M Preeti Sivasankar, William G Pearson, Georgia A Malandraki
Purpose: Reduced ability to raise vocal pitch has been associated with risk of aspiration in some populations. However, the mechanisms driving this association are understudied. This secondary data analysis aims to add to our knowledge on the shared and distinct biomechanics of swallowing and maximum vocal pitch elevation.
Method: We used existing data from the study of Venkatraman et al. (2020) on 10 healthy younger (age range: 19-23, M = 21) and eight older (age range: 65-79; M = 73) adults who completed maximum pitch elevation and swallow tasks under videofluoroscopy. A MATLAB tracking tool and computational analysis of swallowing mechanics was used to analyze five elements of pharyngeal swallowing biomechanics (anterior and superior hyoid excursion, laryngeal elevation, pharyngeal shortening, tongue base retraction, head/neck extension). Canonical variate analysis (CVA) determined differences associated with task and age. Post hoc discriminant function analyses (DFAs) compared the events between tasks in each group.
Results: CVA revealed that 63.9% of variance was accounted for by task (D = 3.46, p < .0001) and 35.5% by age (D = 1.92, p < .0001). Across age, DFAs indicated similar anterior hyoid excursion and laryngeal elevation between tasks, but greater superior hyoid excursion during swallows, replicating earlier findings. We also found greater base of tongue retraction during swallows than maximum pitch and greater pharyngeal shortening during maximum pitch elevation compared to swallows across groups (D = 5.38, p < .0001).
Conclusions: Maximum pitch elevation and swallowing require similar anterior hyoid and laryngeal excursion. Added to the novel finding of greater pharyngeal shortening during pitch elevation, we indicate that pitch glides may be a mechanism for targeting pharyngeal dysphagia and warrant further investigation.
目的:在某些人群中,提高音高的能力降低与误吸风险有关。然而,推动这种联系的机制尚未得到充分研究。这一次要数据分析旨在增加我们对吞咽和最大音高提升的共享和独特生物力学的了解。方法:使用Venkatraman et al.(2020)的现有研究数据,对10名健康年轻人(年龄范围:19-23,M = 21)和8名老年人(年龄范围:65-79;在透视下完成最大俯仰和吞咽任务的73名成年人。采用MATLAB跟踪工具和吞咽力学计算分析,分析咽部吞咽生物力学的5个要素(舌骨前上移位、喉部抬高、咽部缩短、舌根后收、头/颈后伸)。典型变量分析(CVA)确定了与任务和年龄相关的差异。事后判别函数分析(dfa)比较各组任务之间的事件。结果:CVA显示,63.9%的方差由任务引起(D = 3.46, p < 0.0001), 35.5%的方差由年龄引起(D = 1.92, p < 0.0001)。在不同年龄,dfa显示任务之间类似的舌骨前偏移和喉抬高,但在吞咽过程中更大的舌骨上偏移,重复了早期的发现。我们还发现,与各组燕子相比,在最大音调时,燕子的舌根收缩比最大音调时更大,在最大音调升高时,咽缩短比最大音调升高时更大(D = 5.38, p < 0.0001)。结论:最大音高提升和吞咽需要相似的舌骨和喉前移位。此外,我们还发现音调升高时咽部会缩短,这表明音调下降可能是针对咽部吞咽困难的一种机制,值得进一步研究。
{"title":"Maximum Vocal Pitch Elevation and Swallowing: A Secondary Data Analysis Supporting Additional Shared Biomechanics and Potential Treatment Targets.","authors":"Rachel Hahn Arkenberg, Samantha Mitchell, Anumitha Venkatraman, M Preeti Sivasankar, William G Pearson, Georgia A Malandraki","doi":"10.1044/2025_JSLHR-24-00749","DOIUrl":"10.1044/2025_JSLHR-24-00749","url":null,"abstract":"<p><strong>Purpose: </strong>Reduced ability to raise vocal pitch has been associated with risk of aspiration in some populations. However, the mechanisms driving this association are understudied. This secondary data analysis aims to add to our knowledge on the shared and distinct biomechanics of swallowing and maximum vocal pitch elevation.</p><p><strong>Method: </strong>We used existing data from the study of Venkatraman et al. (2020) on 10 healthy younger (age range: 19-23, <i>M</i> = 21) and eight older (age range: 65-79; <i>M</i> = 73) adults who completed maximum pitch elevation and swallow tasks under videofluoroscopy. A MATLAB tracking tool and computational analysis of swallowing mechanics was used to analyze five elements of pharyngeal swallowing biomechanics (anterior and superior hyoid excursion, laryngeal elevation, pharyngeal shortening, tongue base retraction, head/neck extension). Canonical variate analysis (CVA) determined differences associated with task and age. Post hoc discriminant function analyses (DFAs) compared the events between tasks in each group.</p><p><strong>Results: </strong>CVA revealed that 63.9% of variance was accounted for by task (<i>D</i> = 3.46, <i>p</i> < .0001) and 35.5% by age (<i>D</i> = 1.92, <i>p</i> < .0001). Across age, DFAs indicated similar anterior hyoid excursion and laryngeal elevation between tasks, but greater superior hyoid excursion during swallows, replicating earlier findings. We also found greater base of tongue retraction during swallows than maximum pitch and greater pharyngeal shortening during maximum pitch elevation compared to swallows across groups (<i>D</i> = 5.38, <i>p</i> < .0001).</p><p><strong>Conclusions: </strong>Maximum pitch elevation and swallowing require similar anterior hyoid and laryngeal excursion. Added to the novel finding of greater pharyngeal shortening during pitch elevation, we indicate that pitch glides may be a mechanism for targeting pharyngeal dysphagia and warrant further investigation.</p>","PeriodicalId":51254,"journal":{"name":"Journal of Speech Language and Hearing Research","volume":" ","pages":"2674-2684"},"PeriodicalIF":2.2,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144031373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-05Epub Date: 2025-05-14DOI: 10.1044/2025_JSLHR-24-00446
Wei Hu, Libo Qiao, Lei Wu, Guoli Yan, Lihong Wang, Can Xu, Yao Chen, Chang Liu
<p><strong>Purpose: </strong>The goal of this study was to investigate how face masks influenced the acoustic features of Chinese running speech in both temporal and spectral domains and how the intelligibility of the speech with face masks was affected in quiet and multitalker babbles. The relationship between the acoustic features and speech intelligibility was also examined.</p><p><strong>Method: </strong>In Experiment 1, Mandarin Chinese sentences were recorded by 24 native Mandarin Chinese speakers wearing a surgical mask, a KN95 mask, or not wearing a mask and temporal modulation (TM) depth; speaking rate, spectral tilt, and average value and standard deviation of fundamental frequency (<i>F</i>0) were then examined. In Experiment 2, the intelligibility of these recorded sentences were assessed in quiet and multitalker babble with the signal-to-noise ratios of -2 and -5 dB. To further examine the possible causal relationship between the impacted acoustic variables and speech intelligibility under different mask wearing conditions, the acoustic and speech intelligibility data were analyzed in a stepwise regression.</p><p><strong>Results: </strong>Results showed that both the KN95 and surgical masks produced significantly smaller TM depth compared to the no-mask condition. In terms of speaking rate, participants spoke faster with face masks than without a mask, whereas there was no significant difference between the KN95 and surgical mask. Additionally, spectral tilt was significantly shallower for the two face masks compared to the no-mask condition. Regarding <i>F</i>0, the mean <i>F</i>0 was higher with the KN95 mask than the surgical mask and no mask, while the standard deviation of <i>F</i>0 was lower in the two mask conditions than the no-mask condition, with no significant difference between the two types of masks. In addition to these acoustic differences, speech intelligibility in noise was significantly lower for the two mask conditions than the no-mask condition, with no significant difference between the KN95 and surgical masks, whereas there was no significant effect of face masks on speech intelligibly in quiet. Finally, the relationship between acoustic features and speech intelligibility showed that, under noise conditions, TM depth, spectral tilt, and <i>F</i>0 dynamics (e.g., standard deviation) were significantly correlated with speech intelligibility, while speaking rate and mean <i>F</i>0 were not.</p><p><strong>Conclusions: </strong>Acoustically, face masks led to smaller TM depth, slower speaking rate, shallower spectral tilt, higher mean <i>F</i>0 and smaller standard deviation of <i>F</i>0 in Mandarin Chinese running speech, and perceptually resulted in lower speech intelligibility in noise, but had no impact on speech intelligibility in quiet. Findings also suggest that certain acoustic characteristics (e.g., TM depth and spectral tilt) play important roles on speech intelligibility, especially in challenging listeni
{"title":"Face Mask Effects on Acoustic Features and Intelligibility of Mandarin Chinese Speech.","authors":"Wei Hu, Libo Qiao, Lei Wu, Guoli Yan, Lihong Wang, Can Xu, Yao Chen, Chang Liu","doi":"10.1044/2025_JSLHR-24-00446","DOIUrl":"10.1044/2025_JSLHR-24-00446","url":null,"abstract":"<p><strong>Purpose: </strong>The goal of this study was to investigate how face masks influenced the acoustic features of Chinese running speech in both temporal and spectral domains and how the intelligibility of the speech with face masks was affected in quiet and multitalker babbles. The relationship between the acoustic features and speech intelligibility was also examined.</p><p><strong>Method: </strong>In Experiment 1, Mandarin Chinese sentences were recorded by 24 native Mandarin Chinese speakers wearing a surgical mask, a KN95 mask, or not wearing a mask and temporal modulation (TM) depth; speaking rate, spectral tilt, and average value and standard deviation of fundamental frequency (<i>F</i>0) were then examined. In Experiment 2, the intelligibility of these recorded sentences were assessed in quiet and multitalker babble with the signal-to-noise ratios of -2 and -5 dB. To further examine the possible causal relationship between the impacted acoustic variables and speech intelligibility under different mask wearing conditions, the acoustic and speech intelligibility data were analyzed in a stepwise regression.</p><p><strong>Results: </strong>Results showed that both the KN95 and surgical masks produced significantly smaller TM depth compared to the no-mask condition. In terms of speaking rate, participants spoke faster with face masks than without a mask, whereas there was no significant difference between the KN95 and surgical mask. Additionally, spectral tilt was significantly shallower for the two face masks compared to the no-mask condition. Regarding <i>F</i>0, the mean <i>F</i>0 was higher with the KN95 mask than the surgical mask and no mask, while the standard deviation of <i>F</i>0 was lower in the two mask conditions than the no-mask condition, with no significant difference between the two types of masks. In addition to these acoustic differences, speech intelligibility in noise was significantly lower for the two mask conditions than the no-mask condition, with no significant difference between the KN95 and surgical masks, whereas there was no significant effect of face masks on speech intelligibly in quiet. Finally, the relationship between acoustic features and speech intelligibility showed that, under noise conditions, TM depth, spectral tilt, and <i>F</i>0 dynamics (e.g., standard deviation) were significantly correlated with speech intelligibility, while speaking rate and mean <i>F</i>0 were not.</p><p><strong>Conclusions: </strong>Acoustically, face masks led to smaller TM depth, slower speaking rate, shallower spectral tilt, higher mean <i>F</i>0 and smaller standard deviation of <i>F</i>0 in Mandarin Chinese running speech, and perceptually resulted in lower speech intelligibility in noise, but had no impact on speech intelligibility in quiet. Findings also suggest that certain acoustic characteristics (e.g., TM depth and spectral tilt) play important roles on speech intelligibility, especially in challenging listeni","PeriodicalId":51254,"journal":{"name":"Journal of Speech Language and Hearing Research","volume":" ","pages":"2929-2944"},"PeriodicalIF":2.2,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144081694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-05Epub Date: 2025-05-14DOI: 10.1044/2025_JSLHR-24-00883
Delaney E Kelemen, Camden Burnsworth, Charles Chubb, Tracy M Centanni
Purpose: Pitch perception is important for speech sound learning, and reading acquisition requires integration of speech sounds and written letters. Many individuals with dyslexia exhibit auditory perception deficits that may therefore contribute to their reading impairment given that complex pitch perception is crucial for categorizing speech sounds. Given rising interest in music training as a reading intervention, understanding associations between prior music experiences and pitch perception is important. This study explored the relationship between pitch perception skills and reading ability in young adults with and without dyslexia with various levels of musical experience.
Method: Young adults (18-35 years old) with (N = 43) and without (N = 105) dyslexia completed two pitch perception tasks, reading assessments, and a survey reporting formal music training and childhood home music environment (HME).
Results: Participants with dyslexia performed worse than typically developing peers on both pitch perception tasks. Single-word reading was related to pitch perception in the typically developing group only. Childhood HME positively correlated with mode categorization and simple pitch discrimination in both groups. Formal music training was associated with performance on both pitch perception tasks in the typically developing group, and simple pitch discrimination in the dyslexia group.
Conclusions: Pitch perception deficits may interfere with complex acoustic categorization and persist in some individuals with dyslexia despite prior music experiences. Future research should investigate the link between pitch perception and phonological awareness in dyslexia and assess whether music interventions targeting these skills improve reading.
{"title":"Complex Pitch Perception Deficits in Dyslexia Persist Regardless of Previous Musical Experiences.","authors":"Delaney E Kelemen, Camden Burnsworth, Charles Chubb, Tracy M Centanni","doi":"10.1044/2025_JSLHR-24-00883","DOIUrl":"10.1044/2025_JSLHR-24-00883","url":null,"abstract":"<p><strong>Purpose: </strong>Pitch perception is important for speech sound learning, and reading acquisition requires integration of speech sounds and written letters. Many individuals with dyslexia exhibit auditory perception deficits that may therefore contribute to their reading impairment given that complex pitch perception is crucial for categorizing speech sounds. Given rising interest in music training as a reading intervention, understanding associations between prior music experiences and pitch perception is important. This study explored the relationship between pitch perception skills and reading ability in young adults with and without dyslexia with various levels of musical experience.</p><p><strong>Method: </strong>Young adults (18-35 years old) with (<i>N</i> = 43) and without (<i>N</i> = 105) dyslexia completed two pitch perception tasks, reading assessments, and a survey reporting formal music training and childhood home music environment (HME).</p><p><strong>Results: </strong>Participants with dyslexia performed worse than typically developing peers on both pitch perception tasks. Single-word reading was related to pitch perception in the typically developing group only. Childhood HME positively correlated with mode categorization and simple pitch discrimination in both groups. Formal music training was associated with performance on both pitch perception tasks in the typically developing group, and simple pitch discrimination in the dyslexia group.</p><p><strong>Conclusions: </strong>Pitch perception deficits may interfere with complex acoustic categorization and persist in some individuals with dyslexia despite prior music experiences. Future research should investigate the link between pitch perception and phonological awareness in dyslexia and assess whether music interventions targeting these skills improve reading.</p>","PeriodicalId":51254,"journal":{"name":"Journal of Speech Language and Hearing Research","volume":" ","pages":"2871-2884"},"PeriodicalIF":2.2,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144080706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-05Epub Date: 2025-05-14DOI: 10.1044/2025_JSLHR-24-00651
Erin West, Shani Dettman, Colleen Holt
Purpose: The aim of the study was to describe the expressive sign vocabularies of a group of children learning Australian Sign Language (Auslan).
Method: The spontaneous signs of 44 children aged 3.0-6.8 years enrolled in one early-years bilingual-bicultural educational program were documented using a new approach, the Handshape Analysis Recording Tool, across a 2-year period. The resultant corpus was analyzed to determine the frequency of word classes including nouns, verbs, and adjectives.
Results: There were 3,003 Auslan tokens and 806 different sign types. Nouns, adjectives, and verbs were highly represented in this exploratory study, comprising 54.1%, 21.0%, and 15.8% of the entire corpus, respectively. Preliminary analyses indicated differences in the composition of Auslan vocabularies when compared with existing spoken English and American Sign Language data.
Conclusions: This exploratory study identified that the types of Auslan word classes used by this heterogeneous group of young learners included a high proportion of nouns and adjectives. While comparisons with past data are stated with caution as the composition of the child sample group was not controlled, there is preliminary support for earlier exposure and focused teaching of Auslan to facilitate the development of more varied expressive sign vocabularies.
{"title":"Australian Sign Language Lexicons in a Bilingual-Bicultural Program.","authors":"Erin West, Shani Dettman, Colleen Holt","doi":"10.1044/2025_JSLHR-24-00651","DOIUrl":"10.1044/2025_JSLHR-24-00651","url":null,"abstract":"<p><strong>Purpose: </strong>The aim of the study was to describe the expressive sign vocabularies of a group of children learning Australian Sign Language (Auslan).</p><p><strong>Method: </strong>The spontaneous signs of 44 children aged 3.0-6.8 years enrolled in one early-years bilingual-bicultural educational program were documented using a new approach, the Handshape Analysis Recording Tool, across a 2-year period. The resultant corpus was analyzed to determine the frequency of word classes including nouns, verbs, and adjectives.</p><p><strong>Results: </strong>There were 3,003 Auslan tokens and 806 different sign types. Nouns, adjectives, and verbs were highly represented in this exploratory study, comprising 54.1%, 21.0%, and 15.8% of the entire corpus, respectively. Preliminary analyses indicated differences in the composition of Auslan vocabularies when compared with existing spoken English and American Sign Language data.</p><p><strong>Conclusions: </strong>This exploratory study identified that the types of Auslan word classes used by this heterogeneous group of young learners included a high proportion of nouns and adjectives. While comparisons with past data are stated with caution as the composition of the child sample group was not controlled, there is preliminary support for earlier exposure and focused teaching of Auslan to facilitate the development of more varied expressive sign vocabularies.</p>","PeriodicalId":51254,"journal":{"name":"Journal of Speech Language and Hearing Research","volume":" ","pages":"2851-2870"},"PeriodicalIF":2.2,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144080556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}