首页 > 最新文献

Journal of Voice最新文献

英文 中文
Management of Severe Dysphonia and Dysphagia Following Lateral Skull Base Surgery. 颅底外侧手术后严重发音障碍和吞咽困难的处理。
IF 2.5 4区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-10-03 DOI: 10.1016/j.jvoice.2024.08.031
Yanyan Niu, Chuan Chen, Xiaofeng Jin, Hong Huo, Tingting Cui, Jian Wang

Objectives: Lateral skull base surgeries pose a risk of injuring the lower cranial nerves, leading to potential postoperative complications such as dysphonia and dysphagia. Conservative treatments have shown limited efficacy in addressing these resultant voice and swallowing dysfunctions, significantly impacting patient quality of life. This study aims to evaluate the safety and effectiveness of a combined surgical approach involving autologous fat injection laryngoplasty (AFIL) and transcervical cricopharyngeal myotomy (TCPM) in patients suffering from severe dysphonia and dysphagia following lateral skull base surgery.

Methods: A retrospective analysis was conducted on 16 patients who underwent concurrent AFIL and TCPM to improve severe dysphonia and dysphagia following lateral skull base surgery. Preoperative and postoperative assessments of voice and swallowing functions were performed using the Voice Handicap Index-10 (VHI-10), GRBAS scale, maximum phonation time (MPT), the Chinese version of Swallow Quality-of-Life Questionnaire (CSWAL-QOL), and videofluoroscopic swallowing studies (VFSS).

Results: The results demonstrated notable improvements in voice quality and swallowing function. The VHI-10 score improved significantly from a preoperative mean of 32.06 ± 4.92 to a postoperative 9.06 ± 5.24. The results of the perceptual parameters of the GRBAS scale also improved significantly. The MPT increased from a preoperative average of 3.91 ± 1.00 seconds to 9.14 ± 2.44 seconds postoperatively. The CSWAL-QOL scores significantly improved from a preoperative score of 92.44 ± 17.75 to 130.19 ± 26.07 postoperatively. The VFSS-SWAL scores decreased from 6.63 ± 1.36 before surgery to 3.56 ± 1.58 after surgery. Similarly, the Penetration Aspiration Scale (PAS) scores significantly dropped from 6.38 ± 1.05 preoperatively to 2.93 ± 1.48 postoperatively. Nine out of 11 patients were able to have their gastric tubes successfully removed after surgery. There were no significant postoperative complications.

Conclusion: Concurrent AFIL and TCPM present a promising reconstructive method for patients experiencing severe dysphonia and dysphagia following lateral skull base surgery, highlighting its value in the postoperative management of complex lower cranial nerve injuries.

目的:侧颅底手术存在损伤下颅神经的风险,可能导致发音障碍和吞咽困难等术后并发症。保守疗法对解决由此导致的发音和吞咽功能障碍的疗效有限,严重影响了患者的生活质量。本研究旨在评估自体脂肪注射喉成形术(AFIL)和经颈环咽肌切开术(TCPM)联合手术方法对侧颅底手术后严重发音障碍和吞咽困难患者的安全性和有效性:我们对16名同时接受AFIL和TCPM手术以改善侧颅底手术后严重发音障碍和吞咽困难的患者进行了回顾性分析。使用嗓音障碍指数-10(VHI-10)、GRBAS量表、最大发音时间(MPT)、中文版吞咽生活质量问卷(CSWAL-QOL)和视频荧光屏吞咽研究(VFSS)对患者的嗓音和吞咽功能进行术前和术后评估:结果显示,患者的嗓音质量和吞咽功能均有明显改善。VHI-10 评分从术前的平均值(32.06 ± 4.92)显著提高到术后的(9.06 ± 5.24)。GRBAS 量表的感知参数结果也有明显改善。MPT 从术前的平均 3.91 ± 1.00 秒增加到术后的 9.14 ± 2.44 秒。CSWAL-QOL 评分从术前的 92.44 ± 17.75 分明显提高到术后的 130.19 ± 26.07 分。VFSS-SWAL 评分从术前的 6.63 ± 1.36 降至术后的 3.56 ± 1.58。同样,穿刺抽吸量表(PAS)评分也从术前的 6.38 ± 1.05 显著降至术后的 2.93 ± 1.48。11 名患者中有 9 名在术后成功拔除了胃管。术后无明显并发症:结论:对于侧颅底手术后出现严重发音障碍和吞咽困难的患者,并发AFIL和TCPM是一种很有前景的重建方法,凸显了其在复杂下颅神经损伤术后治疗中的价值。
{"title":"Management of Severe Dysphonia and Dysphagia Following Lateral Skull Base Surgery.","authors":"Yanyan Niu, Chuan Chen, Xiaofeng Jin, Hong Huo, Tingting Cui, Jian Wang","doi":"10.1016/j.jvoice.2024.08.031","DOIUrl":"10.1016/j.jvoice.2024.08.031","url":null,"abstract":"<p><strong>Objectives: </strong>Lateral skull base surgeries pose a risk of injuring the lower cranial nerves, leading to potential postoperative complications such as dysphonia and dysphagia. Conservative treatments have shown limited efficacy in addressing these resultant voice and swallowing dysfunctions, significantly impacting patient quality of life. This study aims to evaluate the safety and effectiveness of a combined surgical approach involving autologous fat injection laryngoplasty (AFIL) and transcervical cricopharyngeal myotomy (TCPM) in patients suffering from severe dysphonia and dysphagia following lateral skull base surgery.</p><p><strong>Methods: </strong>A retrospective analysis was conducted on 16 patients who underwent concurrent AFIL and TCPM to improve severe dysphonia and dysphagia following lateral skull base surgery. Preoperative and postoperative assessments of voice and swallowing functions were performed using the Voice Handicap Index-10 (VHI-10), GRBAS scale, maximum phonation time (MPT), the Chinese version of Swallow Quality-of-Life Questionnaire (CSWAL-QOL), and videofluoroscopic swallowing studies (VFSS).</p><p><strong>Results: </strong>The results demonstrated notable improvements in voice quality and swallowing function. The VHI-10 score improved significantly from a preoperative mean of 32.06 ± 4.92 to a postoperative 9.06 ± 5.24. The results of the perceptual parameters of the GRBAS scale also improved significantly. The MPT increased from a preoperative average of 3.91 ± 1.00 seconds to 9.14 ± 2.44 seconds postoperatively. The CSWAL-QOL scores significantly improved from a preoperative score of 92.44 ± 17.75 to 130.19 ± 26.07 postoperatively. The VFSS-SWAL scores decreased from 6.63 ± 1.36 before surgery to 3.56 ± 1.58 after surgery. Similarly, the Penetration Aspiration Scale (PAS) scores significantly dropped from 6.38 ± 1.05 preoperatively to 2.93 ± 1.48 postoperatively. Nine out of 11 patients were able to have their gastric tubes successfully removed after surgery. There were no significant postoperative complications.</p><p><strong>Conclusion: </strong>Concurrent AFIL and TCPM present a promising reconstructive method for patients experiencing severe dysphonia and dysphagia following lateral skull base surgery, highlighting its value in the postoperative management of complex lower cranial nerve injuries.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142376250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prevalence of Voice Problems and Associated Risk Factors in Islamic Religious Orators. 伊斯兰宗教演说家嗓音问题的普遍性及相关风险因素
IF 2.5 4区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-10-03 DOI: 10.1016/j.jvoice.2024.08.036
Pathangalil Sreekumar Sujitha, Riza Zamrin Kabeer, Ridha Fathima

Islamic leadership is a diverse profession and previous research has highlighted the increased risk of voice problems among Islamic leaders. The prevalence and risk factors for voice problems among Malayalam-speaking Islamic religious orators have not been adequately investigated. This study aimed to assess the prevalence of self-reported voice problems among Islamic religious orators who recite the Adhaan and those who do not and to examine the impact of age and associated risk factors. This study employed a nonexperimental research design using survey research. The current findings report a high prevalence of voice problems among Islamic Adhaan reciters and suggest that age-related differences may influence the risk factors associated with voice problems in this population. Only two participants from the older group of Adhaan reciters had received voice therapy, suggesting that the majority of Islamic religious reciters lack knowledge about voice care and voice therapy. A limitation of the study is the lack of otorhinolaryngological examination, particularly laryngoscopy and voice analysis of Islamic religious orators. Further studies can investigate the effect of vocal hygiene training or similar voice management programs on Islamic religious orators' voices.

伊斯兰教领袖是一个多样化的职业,以往的研究强调了伊斯兰教领袖出现嗓音问题的风险增加。有关讲马拉雅拉姆语的伊斯兰宗教演说家中嗓音问题的普遍性和风险因素的研究还不够充分。本研究旨在评估背诵《阿丹经》的伊斯兰宗教演说家和不背诵《阿丹经》的伊斯兰宗教演说家自我报告的嗓音问题发生率,并研究年龄和相关风险因素的影响。本研究采用调查研究的非实验性研究设计。目前的研究结果表明,嗓音问题在伊斯兰教阿丹诵读者中的发生率很高,并表明与年龄有关的差异可能会影响与该人群嗓音问题相关的风险因素。老年阿丹朗诵者群体中只有两名参与者接受过嗓音治疗,这表明大多数伊斯兰宗教朗诵者缺乏有关嗓音保健和嗓音治疗的知识。这项研究的局限性在于没有对伊斯兰宗教朗诵者进行耳鼻喉科检查,特别是喉镜检查和嗓音分析。进一步的研究可以探讨嗓音卫生训练或类似的嗓音管理计划对伊斯兰宗教朗诵者嗓音的影响。
{"title":"Prevalence of Voice Problems and Associated Risk Factors in Islamic Religious Orators.","authors":"Pathangalil Sreekumar Sujitha, Riza Zamrin Kabeer, Ridha Fathima","doi":"10.1016/j.jvoice.2024.08.036","DOIUrl":"10.1016/j.jvoice.2024.08.036","url":null,"abstract":"<p><p>Islamic leadership is a diverse profession and previous research has highlighted the increased risk of voice problems among Islamic leaders. The prevalence and risk factors for voice problems among Malayalam-speaking Islamic religious orators have not been adequately investigated. This study aimed to assess the prevalence of self-reported voice problems among Islamic religious orators who recite the Adhaan and those who do not and to examine the impact of age and associated risk factors. This study employed a nonexperimental research design using survey research. The current findings report a high prevalence of voice problems among Islamic Adhaan reciters and suggest that age-related differences may influence the risk factors associated with voice problems in this population. Only two participants from the older group of Adhaan reciters had received voice therapy, suggesting that the majority of Islamic religious reciters lack knowledge about voice care and voice therapy. A limitation of the study is the lack of otorhinolaryngological examination, particularly laryngoscopy and voice analysis of Islamic religious orators. Further studies can investigate the effect of vocal hygiene training or similar voice management programs on Islamic religious orators' voices.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142376251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Knowledge and Practice Regarding Vocal Hygiene Among Teachers of Selected Schools, Kathmandu. 加德满都部分学校教师对嗓音卫生的认识和实践。
IF 2.5 4区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-10-01 DOI: 10.1016/j.jvoice.2024.08.025
Bibitsha Gurung, Mohammad Ali Imran, Sayma Suraiya, Pramila Pudasaini Thapa, Vitalii Nitsenko, Mariia Tepliuk

Background: Vocal hygiene (VH) is the practice of taking care of the voice and minimizing strain on it. This comprises avoiding smoking, alcohol, and other harmful substances, remaining hydrated, and practicing proper breathing techniques. This study intended to evaluate the level of knowledge and practices concerning VH.

Methodology: A cross-sectional study design was utilized to conduct the study among teachers in Greenland Higher Secondary School, Basundhara National Academy, and Nirmal Batika Academy using a self-administered structured questionnaire. A sample of 122 teachers was taken using a complete enumerative sampling technique. The data were collected from October 1 to October 16 and were edited, coded, and entered into the Statistical Package for Social Science for further analysis. Data were examined using descriptive (frequency, percentage mean, and standard deviation), inferential statistics (chi-square test), and Karl Pearson's test.

Findings: The study results presented that nearly half, 49.2% of the respondents, had a moderate level of knowledge, 31.1% had an inadequate level of knowledge, and 19.7% had an adequate level of knowledge regarding VH. The results discovered that 73% of the respondents had good practice, and 27% had poor practice regarding VH. There was a statistically significant association only between the level of knowledge regarding VH and the number of classes taken per week (P = 0.0053).

Value: The study showed that the majority of teachers have moderate knowledge about VH, whereas there is good practice among teachers. The results strongly recommend the need for VH training sessions for teachers.

背景:嗓音卫生(VH)是指对嗓音进行护理并尽量减少对嗓音造成的负担。这包括避免吸烟、饮酒和其他有害物质,保持水分充足,以及练习正确的呼吸技巧。本研究旨在评估有关嗓音保健的知识和实践水平:本研究采用横断面研究设计,使用自填式结构问卷对格陵兰高级中学、巴松达拉国家学院和尼尔马尔-巴蒂卡学院的教师进行调查。采用完全计数抽样技术,抽取了 122 名教师样本。数据收集时间为 10 月 1 日至 10 月 16 日,经编辑、编码后输入社会科学统计软件包进行进一步分析。使用描述性统计(频率、百分比均值和标准差)、推断性统计(卡方检验)和卡尔-皮尔逊检验对数据进行了检验:研究结果表明,近一半的受访者(49.2%)对性健康知识的了解处于中等水平,31.1%的受访者对性健康知识的了解处于不足水平,19.7%的受访者对性健康知识的了解处于充分水平。结果发现,73% 的受访者在自愿生殖健康方面的做法良好,27% 的做法较差。在统计学上,教师对性健康和生殖健康的了解程度与每周上课次数之间存在明显的联系(P = 0.0053):研究结果表明,大多数教师对 VH 的了解程度一般,而教师的实践情况良好。研究结果强烈建议有必要为教师举办自愿洗手培训课程。
{"title":"Knowledge and Practice Regarding Vocal Hygiene Among Teachers of Selected Schools, Kathmandu.","authors":"Bibitsha Gurung, Mohammad Ali Imran, Sayma Suraiya, Pramila Pudasaini Thapa, Vitalii Nitsenko, Mariia Tepliuk","doi":"10.1016/j.jvoice.2024.08.025","DOIUrl":"https://doi.org/10.1016/j.jvoice.2024.08.025","url":null,"abstract":"<p><strong>Background: </strong>Vocal hygiene (VH) is the practice of taking care of the voice and minimizing strain on it. This comprises avoiding smoking, alcohol, and other harmful substances, remaining hydrated, and practicing proper breathing techniques. This study intended to evaluate the level of knowledge and practices concerning VH.</p><p><strong>Methodology: </strong>A cross-sectional study design was utilized to conduct the study among teachers in Greenland Higher Secondary School, Basundhara National Academy, and Nirmal Batika Academy using a self-administered structured questionnaire. A sample of 122 teachers was taken using a complete enumerative sampling technique. The data were collected from October 1 to October 16 and were edited, coded, and entered into the Statistical Package for Social Science for further analysis. Data were examined using descriptive (frequency, percentage mean, and standard deviation), inferential statistics (chi-square test), and Karl Pearson's test.</p><p><strong>Findings: </strong>The study results presented that nearly half, 49.2% of the respondents, had a moderate level of knowledge, 31.1% had an inadequate level of knowledge, and 19.7% had an adequate level of knowledge regarding VH. The results discovered that 73% of the respondents had good practice, and 27% had poor practice regarding VH. There was a statistically significant association only between the level of knowledge regarding VH and the number of classes taken per week (P = 0.0053).</p><p><strong>Value: </strong>The study showed that the majority of teachers have moderate knowledge about VH, whereas there is good practice among teachers. The results strongly recommend the need for VH training sessions for teachers.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142367232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficacy of Long-Term Intensive Straw Phonation Exercise in Air for Hyperfunctional Voice Disorders: A Preliminary Study. 空气中长期强化吸管发音练习对高功能性嗓音障碍的疗效:初步研究。
IF 2.5 4区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-09-30 DOI: 10.1016/j.jvoice.2024.09.011
Sumanth Paramesh, Krishna Yeshoda

Purpose: Literature highlights the efficacy of prolonged use of straw phonation for vocally healthy individuals. Nevertheless, minimal studies have examined the prolonged use of single treatment in hyperfunctional voice disorders (HFVD), as straw phonation is used as physiological training to improve voice quality in these individuals. The present study aimed to investigate the long-term intensive training effect of straw phonation exercise in air (SPEA) for HFVD.

Method: In a time series design, seven females with HFVD (mean age 42.28years) participated in SPEA for 12 long sessions (30 minutes/session) intensively trained for 3weeks (4days/week). The participant-recorded voice samples were analyzed for aerodynamic measures Aerodynamic Subglottic Pressure (ASP), Airflow Rate (AFR), glottal behavior measures Contact Quotient (CQ), Contact Quotient Range (CQR), acoustic measure Acoustic Voice Quality Index (AVQI), auditory-perceptual Consensus Auditory Perceptual Evaluation of Voice - Kannada (CAPE-V-K), and self-perceptual measure Voice Handicap Index - Kannada (VHI-K) at pre-therapy, mid-therapy, post therapy, and 1-month follow-up timelines. Related Friedman's two-way analysis of variance and post hoc Wilcoxon signed-rank test were used to analyze the change in outcome measures across therapy timelines.

Results: Significant reduction in ASP, CQR, AVQI, CAPE-V-K, and VHI-K measures from pre-therapy to post-therapy and follow-up timelines was observed. Although there was a decrease in AFR and CQ measures from pre therapy to post therapy and follow-up, no significant differences were observed. No significant changes were observed from pre therapy to mid-therapy in any outcome measures, indicating the need for extended therapy duration and regular practice.

Conclusion: These findings provide preliminary evidence of implementing the long-term intensive training of SPEA for HFVD with the support of significant outcome measures at various therapy timelines.

目的:文献强调了长期使用吸管发音法对嗓音健康者的疗效。然而,很少有研究对嗓音功能亢进症(HFVD)患者长期使用单一疗法进行研究,因为稻草发音法被用作改善这些患者嗓音质量的生理训练。本研究旨在探讨空气中吸管发音训练(SPEA)对高功能性嗓音失调症的长期强化训练效果:在时间序列设计中,7 名患有高频心血管疾病的女性(平均年龄 42.28 岁)参加了 12 次长时段(30 分钟/时段)的 SPEA,强化训练为期 3 周(4 天/周)。对参与者录制的语音样本进行了分析,包括空气动力学指标声门下压力(ASP)、气流率(AFR),声门行为指标接触商数(CQ)、接触商数范围(CQR),声学指标声学嗓音质量指数(AVQI)、在治疗前、治疗中、治疗后和 1 个月的随访中,对听觉感知的共识听觉感知嗓音评估--卡纳达语(CAPE-V-K)和自我感知的嗓音障碍指数--卡纳达语(VHI-K)进行测量。相关的弗里德曼双向方差分析和事后的 Wilcoxon 符号秩检验用于分析不同治疗时间的结果指标的变化:结果:观察到从治疗前到治疗后和随访期间,ASP、CQR、AVQI、CAPE-V-K 和 VHI-K 的测量值显著下降。虽然从治疗前到治疗后和随访期间,AFR 和 CQ 指标有所下降,但未观察到显著差异。从治疗前到治疗中期,没有观察到任何结果指标有明显变化,这表明需要延长治疗时间和定期练习:这些研究结果提供了对高频心血管病实施 SPEA 长期强化训练的初步证据,并支持在不同治疗时间段的重要结果测量。
{"title":"Efficacy of Long-Term Intensive Straw Phonation Exercise in Air for Hyperfunctional Voice Disorders: A Preliminary Study.","authors":"Sumanth Paramesh, Krishna Yeshoda","doi":"10.1016/j.jvoice.2024.09.011","DOIUrl":"https://doi.org/10.1016/j.jvoice.2024.09.011","url":null,"abstract":"<p><strong>Purpose: </strong>Literature highlights the efficacy of prolonged use of straw phonation for vocally healthy individuals. Nevertheless, minimal studies have examined the prolonged use of single treatment in hyperfunctional voice disorders (HFVD), as straw phonation is used as physiological training to improve voice quality in these individuals. The present study aimed to investigate the long-term intensive training effect of straw phonation exercise in air (SPEA) for HFVD.</p><p><strong>Method: </strong>In a time series design, seven females with HFVD (mean age 42.28years) participated in SPEA for 12 long sessions (30 minutes/session) intensively trained for 3weeks (4days/week). The participant-recorded voice samples were analyzed for aerodynamic measures Aerodynamic Subglottic Pressure (ASP), Airflow Rate (AFR), glottal behavior measures Contact Quotient (CQ), Contact Quotient Range (CQR), acoustic measure Acoustic Voice Quality Index (AVQI), auditory-perceptual Consensus Auditory Perceptual Evaluation of Voice - Kannada (CAPE-V-K), and self-perceptual measure Voice Handicap Index - Kannada (VHI-K) at pre-therapy, mid-therapy, post therapy, and 1-month follow-up timelines. Related Friedman's two-way analysis of variance and post hoc Wilcoxon signed-rank test were used to analyze the change in outcome measures across therapy timelines.</p><p><strong>Results: </strong>Significant reduction in ASP, CQR, AVQI, CAPE-V-K, and VHI-K measures from pre-therapy to post-therapy and follow-up timelines was observed. Although there was a decrease in AFR and CQ measures from pre therapy to post therapy and follow-up, no significant differences were observed. No significant changes were observed from pre therapy to mid-therapy in any outcome measures, indicating the need for extended therapy duration and regular practice.</p><p><strong>Conclusion: </strong>These findings provide preliminary evidence of implementing the long-term intensive training of SPEA for HFVD with the support of significant outcome measures at various therapy timelines.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2024-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142367231","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Is Postoperative Voice Rest Unnecessary? A Systematic Review and Meta-analysis of Voice Rest Recommendation Outcomes. 术后嗓音休息是否不必要?嗓音休息建议结果的系统回顾和元分析》。
IF 2.5 4区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-09-28 DOI: 10.1016/j.jvoice.2024.09.013
Doreen Lam, Katherine Xu, Natasha Mirza

Background: Voice rest (VR) is widely recommended after microlaryngeal surgery to facilitate recovery and improve voice outcomes. Our study is the first systematic review and meta-analysis summarizing the impact of postoperative absolute voice rest (AVR) and no voice rest (NVR) instructions on voice outcomes.

Methods: PubMed, Embase, and Cochrane Library databases were searched using "voice rest laryngeal surgery" and "postoperative voice rest" for articles published before December 2022. Risk of bias was assessed using ROBINS-I and RoB2 tools. Meta-analysis using a random effects model was performed for studies comparing Voice Handicap Index (VHI-10) outcomes between NVR and AVR. Analysis was performed in R Studio.

Results: In total, 255 articles were reviewed, 24 underwent full-text screening, and nine met inclusion criteria. Four randomized control trials (RCT) and one retrospective review compared AVR durations (range: 2-10days). Four studies (two cohort, one cross-sectional, and one RCT) compared AVR to NVR. All studies had risk of bias (ROBINS-I: two moderate, two serious; RoB2: five with concerns). Comparing AVR durations, two found no difference between short and long duration, while two reported improved outcomes for the short cohort. In studies comparing AVR to NVR, all concluded no significant difference in outcomes. Pooled analysis of three studies (355 patients) comparing NVR and AVR demonstrated no significant differences in pre- and postoperative VHI-10 change (mean difference=-0.87; 95% CI, -2.51 to 0.77; P = 0.27).

Conclusion: Systematic review findings indicate postoperative VR may not lead to improved voice outcomes, and a meta-analysis demonstrated no difference in VHI-10 outcomes between AVR and NVR.

Level of evidence: 2:

背景:喉显微手术后普遍建议进行嗓音休息(VR),以促进恢复并改善嗓音效果。我们的研究是第一项系统回顾和荟萃分析,总结了术后绝对静声(AVR)和不静声(NVR)指导对嗓音效果的影响:在PubMed、Embase和Cochrane图书馆数据库中使用 "喉部手术禁声 "和 "术后禁声 "检索2022年12月之前发表的文章。使用 ROBINS-I 和 RoB2 工具评估偏倚风险。使用随机效应模型对比较 NVR 和 AVR 的嗓音障碍指数(VHI-10)结果的研究进行了 Meta 分析。分析在 R Studio 中进行:共审查了 255 篇文章,其中 24 篇经过全文筛选,9 篇符合纳入标准。四项随机对照试验(RCT)和一项回顾性研究比较了 AVR 的持续时间(范围:2-10 天)。四项研究(两项队列研究、一项横断面研究和一项 RCT)比较了 AVR 和 NVR。所有研究均存在偏倚风险(ROBINS-I:2 项中度偏倚,2 项严重偏倚;RoB2:5 项存在偏倚风险)。在比较 AVR 持续时间的研究中,有两项研究发现短持续时间和长持续时间之间没有差异,而有两项研究报告称短持续时间组群的预后有所改善。在比较 AVR 和 NVR 的研究中,所有研究都认为结果没有显著差异。对三项比较 NVR 和 AVR 的研究(355 名患者)进行的汇总分析表明,术前和术后 VHI-10 变化无显著差异(平均差异=-0.87;95% CI,-2.51 至 0.77;P = 0.27):系统综述结果表明术后VR可能不会改善嗓音预后,荟萃分析表明AVR和NVR的VHI-10预后无差异:
{"title":"Is Postoperative Voice Rest Unnecessary? A Systematic Review and Meta-analysis of Voice Rest Recommendation Outcomes.","authors":"Doreen Lam, Katherine Xu, Natasha Mirza","doi":"10.1016/j.jvoice.2024.09.013","DOIUrl":"https://doi.org/10.1016/j.jvoice.2024.09.013","url":null,"abstract":"<p><strong>Background: </strong>Voice rest (VR) is widely recommended after microlaryngeal surgery to facilitate recovery and improve voice outcomes. Our study is the first systematic review and meta-analysis summarizing the impact of postoperative absolute voice rest (AVR) and no voice rest (NVR) instructions on voice outcomes.</p><p><strong>Methods: </strong>PubMed, Embase, and Cochrane Library databases were searched using \"voice rest laryngeal surgery\" and \"postoperative voice rest\" for articles published before December 2022. Risk of bias was assessed using ROBINS-I and RoB2 tools. Meta-analysis using a random effects model was performed for studies comparing Voice Handicap Index (VHI-10) outcomes between NVR and AVR. Analysis was performed in R Studio.</p><p><strong>Results: </strong>In total, 255 articles were reviewed, 24 underwent full-text screening, and nine met inclusion criteria. Four randomized control trials (RCT) and one retrospective review compared AVR durations (range: 2-10days). Four studies (two cohort, one cross-sectional, and one RCT) compared AVR to NVR. All studies had risk of bias (ROBINS-I: two moderate, two serious; RoB2: five with concerns). Comparing AVR durations, two found no difference between short and long duration, while two reported improved outcomes for the short cohort. In studies comparing AVR to NVR, all concluded no significant difference in outcomes. Pooled analysis of three studies (355 patients) comparing NVR and AVR demonstrated no significant differences in pre- and postoperative VHI-10 change (mean difference=-0.87; 95% CI, -2.51 to 0.77; P = 0.27).</p><p><strong>Conclusion: </strong>Systematic review findings indicate postoperative VR may not lead to improved voice outcomes, and a meta-analysis demonstrated no difference in VHI-10 outcomes between AVR and NVR.</p><p><strong>Level of evidence: 2: </strong></p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2024-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142331291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Voice Disorder Classification Using Wav2vec 2.0 Feature Extraction. 利用 Wav2vec 2.0 特征提取进行语音障碍分类
IF 2.5 4区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-09-25 DOI: 10.1016/j.jvoice.2024.09.002
Jie Cai, Yuliang Song, Jianghao Wu, Xiong Chen

Objectives: The study aims to classify normal and pathological voices by leveraging the wav2vec 2.0 model as a feature extraction method in conjunction with machine learning classifiers.

Methods: Voice recordings were sourced from the publicly accessible VOICED database. The data underwent preprocessing, including normalization and data augmentation, before being input into the wav2vec 2.0 model for feature extraction. The extracted features were then used to train four machine learning models-Support Vector Machine (SVM), K-Nearest Neighbors, Decision Tree (DT), and Random Forest (RF)-which were evaluated using Stratified K-Fold cross-validation. Performance metrics such as accuracy, precision, recall, F1-score, macro average, micro average, receiver-operating characteristic (ROC) curve, and confusion matrix were utilized to assess model performance.

Results: The RF model achieved the highest accuracy (0.98 ± 0.02), alongside strong recall (0.97 ± 0.04), F1-score (0.95 ± 0.05), and consistently high area under the curve (AUC) values approaching 1.00, indicating superior classification performance. The DT model also demonstrated excellent performance, particularly in precision (0.97 ± 0.02) and F1-score (0.96 ± 0.02), with AUC values ranging from 0.86 to 1.00. Macro-averaged and micro-averaged analyses showed that the DT model provided the most balanced and consistent performance across all classes, while RF model exhibited robust performance across multiple metrics. Additionally, data augmentation significantly enhanced the performance of all models, with marked improvements in accuracy, recall, F1-score, and AUC values, especially notable in the RF and DT models. ROC curve analysis further confirms the consistency and reliability of the RF and SVM models across different folds, while confusion matrix analysis revealed that RF and SVM models had the fewest misclassifications in distinguishing "Normal" and "Pathological" samples. Consequently, RF and DT models emerged as the most robust performers, making them particularly well-suited for the voice classification task in this study.

Conclusions: The method of wav2vec 2.0 combining machine learning models proved highly effective in classifying normal and pathological voices, achieving exceptional accuracy and robustness across various machine evaluation metrics.

研究目的研究旨在利用 wav2vec 2.0 模型作为特征提取方法,结合机器学习分类器,对正常声音和病态声音进行分类:方法: 声音记录来自可公开访问的 VOICED 数据库。在将数据输入 wav2vec 2.0 模型进行特征提取之前,对数据进行了预处理,包括规范化和数据增强。提取的特征随后被用于训练四种机器学习模型--支持向量机(SVM)、K-近邻(K-Nearest Neighbors)、决策树(DT)和随机森林(RF)--这些模型使用分层 K 折交叉验证进行评估。准确率、精确度、召回率、F1-分数、宏观平均值、微观平均值、接收器运行特征曲线(ROC)和混淆矩阵等性能指标被用来评估模型的性能:RF 模型的准确率最高(0.98 ± 0.02),召回率(0.97 ± 0.04)和 F1 分数(0.95 ± 0.05)也很高,曲线下面积(AUC)值一直接近 1.00,显示出卓越的分类性能。DT 模型也表现出色,尤其是在精确度(0.97 ± 0.02)和 F1 分数(0.96 ± 0.02)方面,AUC 值在 0.86 到 1.00 之间。宏观均值和微观均值分析表明,DT 模型在所有类别中提供了最均衡、最一致的性能,而 RF 模型则在多个指标上表现出稳健的性能。此外,数据增强显著提高了所有模型的性能,在准确率、召回率、F1 分数和 AUC 值方面都有明显改善,尤其是 RF 和 DT 模型。ROC 曲线分析进一步证实了 RF 和 SVM 模型在不同褶皱中的一致性和可靠性,而混淆矩阵分析表明,RF 和 SVM 模型在区分 "正常 "和 "病理 "样本时的误分类最少。因此,RF 和 DT 模型表现最为稳健,特别适合本研究中的语音分类任务:结合机器学习模型的 wav2vec 2.0 方法在对正常和病理语音进行分类方面证明非常有效,在各种机器评估指标中都取得了优异的准确性和鲁棒性。
{"title":"Voice Disorder Classification Using Wav2vec 2.0 Feature Extraction.","authors":"Jie Cai, Yuliang Song, Jianghao Wu, Xiong Chen","doi":"10.1016/j.jvoice.2024.09.002","DOIUrl":"https://doi.org/10.1016/j.jvoice.2024.09.002","url":null,"abstract":"<p><strong>Objectives: </strong>The study aims to classify normal and pathological voices by leveraging the wav2vec 2.0 model as a feature extraction method in conjunction with machine learning classifiers.</p><p><strong>Methods: </strong>Voice recordings were sourced from the publicly accessible VOICED database. The data underwent preprocessing, including normalization and data augmentation, before being input into the wav2vec 2.0 model for feature extraction. The extracted features were then used to train four machine learning models-Support Vector Machine (SVM), K-Nearest Neighbors, Decision Tree (DT), and Random Forest (RF)-which were evaluated using Stratified K-Fold cross-validation. Performance metrics such as accuracy, precision, recall, F1-score, macro average, micro average, receiver-operating characteristic (ROC) curve, and confusion matrix were utilized to assess model performance.</p><p><strong>Results: </strong>The RF model achieved the highest accuracy (0.98 ± 0.02), alongside strong recall (0.97 ± 0.04), F1-score (0.95 ± 0.05), and consistently high area under the curve (AUC) values approaching 1.00, indicating superior classification performance. The DT model also demonstrated excellent performance, particularly in precision (0.97 ± 0.02) and F1-score (0.96 ± 0.02), with AUC values ranging from 0.86 to 1.00. Macro-averaged and micro-averaged analyses showed that the DT model provided the most balanced and consistent performance across all classes, while RF model exhibited robust performance across multiple metrics. Additionally, data augmentation significantly enhanced the performance of all models, with marked improvements in accuracy, recall, F1-score, and AUC values, especially notable in the RF and DT models. ROC curve analysis further confirms the consistency and reliability of the RF and SVM models across different folds, while confusion matrix analysis revealed that RF and SVM models had the fewest misclassifications in distinguishing \"Normal\" and \"Pathological\" samples. Consequently, RF and DT models emerged as the most robust performers, making them particularly well-suited for the voice classification task in this study.</p><p><strong>Conclusions: </strong>The method of wav2vec 2.0 combining machine learning models proved highly effective in classifying normal and pathological voices, achieving exceptional accuracy and robustness across various machine evaluation metrics.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2024-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142331292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Complications and Revisions for Adduction Arytenopexy, Medialization Laryngoplasty, and Cricothyroid Subluxation over 19 Years. 19 年来,Adduction Arytenopexy、Medialization Laryngoplasty 和 Cricothyroid Subluxation 的并发症和修正。
IF 2.5 4区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-09-25 DOI: 10.1016/j.jvoice.2024.05.006
Ozlem Onerci Celebi, Sungjin A Song, Alena Santeerapharp, Kanittha Choksawad, Ramon A Franco

Objective: To report the long-term data of the Triple procedure (medialization laryngoplasty, adduction arytenopexy, and cricothyroid subluxation) regarding complications and revisions in a large cohort of patients.

Study design: Retrospective case series.

Methods: This study included patients who underwent ≥1 components of the Triple procedure between January 2000 and July 2019. Demographic data, etiology of paralysis, duration of follow-up, complications, revision surgeries, and touch-up injections were noted from retrospective chart review.

Results: Of the 222 patients who underwent ≥1 of the Triple procedure components, 86 underwent medialization laryngoplasty alone and were excluded from the study. The remaining 136 underwent ≥1 components of the Triple procedure other than medialization laryngoplasty alone. The overall surgical complication rate was 7.3% (10/136) and no intraoperative complications were noted. Of the 10 complications, four were implant extrusions, four were hematoma, and two were rupture of the arytenoid fixation suture. In all, 20 of the 136 cases subsequently required revision surgery (14.7%) at a mean of 57.3months after the initial surgery.

Conclusion: The present findings show that the Triple procedure, or its subcomponents, can be performed with few complications and acceptable revision rates.

目的报告三重手术(内侧化喉头成形术、内收杓状咽成形术和环甲膜下移术)在大量患者中关于并发症和翻修的长期数据:研究设计:回顾性病例系列:本研究纳入了2000年1月至2019年7月期间接受过≥1项Triple手术的患者。回顾性病历记录了患者的人口统计学数据、瘫痪病因、随访时间、并发症、翻修手术和补针情况:在222名接受了≥1项三联手术的患者中,有86名仅接受了喉内侧化成形术,被排除在研究之外。其余 136 名患者除单独接受喉内侧化成形术外,还接受了≥1 项三重手术。总体手术并发症发生率为 7.3%(10/136),未发现术中并发症。在 10 例并发症中,4 例为假体挤出,4 例为血肿,2 例为杓状固定缝线断裂。总之,136 例病例中有 20 例(14.7%)在首次手术后平均 57.3 个月需要进行翻修手术:本研究结果表明,Triple 手术或其子部分在实施过程中并发症少,翻修率可接受。
{"title":"Complications and Revisions for Adduction Arytenopexy, Medialization Laryngoplasty, and Cricothyroid Subluxation over 19 Years.","authors":"Ozlem Onerci Celebi, Sungjin A Song, Alena Santeerapharp, Kanittha Choksawad, Ramon A Franco","doi":"10.1016/j.jvoice.2024.05.006","DOIUrl":"https://doi.org/10.1016/j.jvoice.2024.05.006","url":null,"abstract":"<p><strong>Objective: </strong>To report the long-term data of the Triple procedure (medialization laryngoplasty, adduction arytenopexy, and cricothyroid subluxation) regarding complications and revisions in a large cohort of patients.</p><p><strong>Study design: </strong>Retrospective case series.</p><p><strong>Methods: </strong>This study included patients who underwent ≥1 components of the Triple procedure between January 2000 and July 2019. Demographic data, etiology of paralysis, duration of follow-up, complications, revision surgeries, and touch-up injections were noted from retrospective chart review.</p><p><strong>Results: </strong>Of the 222 patients who underwent ≥1 of the Triple procedure components, 86 underwent medialization laryngoplasty alone and were excluded from the study. The remaining 136 underwent ≥1 components of the Triple procedure other than medialization laryngoplasty alone. The overall surgical complication rate was 7.3% (10/136) and no intraoperative complications were noted. Of the 10 complications, four were implant extrusions, four were hematoma, and two were rupture of the arytenoid fixation suture. In all, 20 of the 136 cases subsequently required revision surgery (14.7%) at a mean of 57.3months after the initial surgery.</p><p><strong>Conclusion: </strong>The present findings show that the Triple procedure, or its subcomponents, can be performed with few complications and acceptable revision rates.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2024-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142331290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Acoustic Analysis of Mandarin-Speaking Transgender Women. 对讲普通话的变性女性进行声学分析。
IF 2.5 4区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-09-24 DOI: 10.1016/j.jvoice.2024.08.037
Tianyi Huang, Xiaoyu Wang, Tianen Xu, Wensheng Zhao, Yanjing Cao, Hakyung Kim, Bin Yi
<p><strong>Objectives: </strong>This study aims to investigate the speech characteristics and assess the potential risk of voice fatigue and voice disorders in Chinese transgender women (TW).</p><p><strong>Methods: </strong>A case-control study was conducted involving TW recruited in Shanghai, China. The participants included 15 TW, 20 cisgender men (CISM), and 20 cisgender women (CISW). Acoustic parameters including formants (F<sub>1</sub>, F<sub>2</sub>, F<sub>3</sub>, F<sub>4</sub>), cepstral peak prominence (CPP), jitter, shimmer, harmonic-to-noise ratio (HNR), noise-to-harmonics (NHR), fundamental frequency (f<sub>0</sub>), and intensity, across vowels, passages, and free talking. Additionally, the Voice Handicap Index-10 (VHI-10) and the Voice Fatigue Index were administered to evaluate voice-related concerns.</p><p><strong>Results: </strong>(1) The F<sub>1</sub> of TW was significantly higher than that of CISW for the vowels /i/ and /u/, and significantly higher than that of CISM for the vowels /a/, /i/, and /u/. The F<sub>2</sub> of TW was significantly lower than CISW for the vowels /i/, significantly higher than CISW for the vowels /u/, and significantly higher than CISM for the vowels /a/ and /u/. F<sub>3</sub> was significantly lower in TW than in CISW for the vowels /a/ and /i/. The F4 formant was significantly lower in TW than in CISW for the vowels /a/ and /i/, but significantly higher than in CISM for the vowel /u/. (2) The f<sub>0</sub> of TW was significantly lower than that of CISW for the vowels /a/, /i/, /u/, during passage reading, and in free speech, but was significantly higher than CISM during passage reading and free talking. Additionally, TW exhibited significantly higher intensity compared with CISW for the vowel /a/ and during passage reading. (3) Jitter in TW was significantly higher than in CISW for the vowels /i/ and /u/, and significantly lower than in CISM during passage reading and free talking. Shimmer was significantly higher in TW compared with both CISW and CISM across the vowels /a/, /i/, during passage reading, and in free talking. The HNR in TW was significantly lower than in both CISW and CISM across all vowels, during passage reading, and in free talking. The NHR was significantly higher in TW than in CISW across all vowels, during passage reading, and in free talking, and significantly higher than in CISM for the vowels /a/, /i/, during passage reading, and in free talking. The CPP in TW was significantly lower than in CISW during passage reading and free talking, and significantly lower than in CISM across all vowels, during passage reading, and in free speech. (4) The VHI-10 scores were significantly higher in TW compared with both CISM and CISW.</p><p><strong>Conclusions: </strong>TW exhibit certain acoustic parameters, such as f<sub>0</sub> and some of the formants, that fall between those of CISW and CISM without undergoing phonosurgery or voice training. The findings suggest a potential risk for
研究目的本研究旨在调查中国变性女性(TW)的语音特征,并评估其嗓音疲劳和嗓音障碍的潜在风险:方法:在中国上海进行了一项变性女性病例对照研究。参与者包括 15 名变性女性、20 名顺性男性(CISM)和 20 名顺性女性(CISW)。声学参数包括声母(F1、F2、F3、F4)、epstral peak prominence(CPP)、抖动、闪烁、谐噪比(HNR)、噪声-谐波(NHR)、基频(f0)和强度。结果:(1) TW 的 F1 在元音 /i/ 和 /u/ 时明显高于 CISW,在元音 /a/、/i/ 和 /u/ 时明显高于 CISM。对于元音 /i/ 而言,TW 的 F2 明显低于 CISW,对于元音 /u/ 而言,TW 的 F2 明显高于 CISW,而对于元音 /a/ 和 /u/ 而言,TW 的 F2 则明显高于 CISM。元音 /a/ 和 /i/ 的 F3 在 TW 中明显低于 CISW。(2) 在元音 /a/、/i/、/u/、段落朗读和自由说话时,TW 的 f0 明显低于 CISW,但在段落朗读和自由说话时则明显高于 CISM。此外,在元音 /a/ 和段落阅读时,TW 的强度明显高于 CISW。(3) 在元音 /i/ 和 /u/ 时,TW 的抖动明显高于 CISW,而在段落阅读和自由交谈时,TW 的抖动明显低于 CISM。与 CISW 和 CISM 相比,在元音 /a/、/i/、段落阅读和自由交谈中,TW 的微光明显更高。在所有元音、段落阅读和自由交谈中,TW 的 HNR 都明显低于 CISW 和 CISM。在所有元音、段落阅读和自由交谈中,TW 的 NHR 都明显高于 CISW;在元音 /a/、/i/、段落阅读和自由交谈中,TW 的 NHR 明显高于 CISM。在段落阅读和自由交谈时,TW 的 CPP 明显低于 CISW,而在所有元音、段落阅读和自由交谈时,TW 的 CPP 明显低于 CISM。(4) TW 的 VHI-10 分数明显高于 CISM 和 CISW:结论:TW 表现出的某些声学参数,如 f0 和某些前置音,介于 CISW 和 CISM 之间,而无需进行语音手术或语音训练。研究结果表明,当 TW 试图改变自己的声音特征以符合其性别身份时,可能会出现嗓音疲劳和嗓音疾病。
{"title":"Acoustic Analysis of Mandarin-Speaking Transgender Women.","authors":"Tianyi Huang, Xiaoyu Wang, Tianen Xu, Wensheng Zhao, Yanjing Cao, Hakyung Kim, Bin Yi","doi":"10.1016/j.jvoice.2024.08.037","DOIUrl":"https://doi.org/10.1016/j.jvoice.2024.08.037","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Objectives: &lt;/strong&gt;This study aims to investigate the speech characteristics and assess the potential risk of voice fatigue and voice disorders in Chinese transgender women (TW).&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;A case-control study was conducted involving TW recruited in Shanghai, China. The participants included 15 TW, 20 cisgender men (CISM), and 20 cisgender women (CISW). Acoustic parameters including formants (F&lt;sub&gt;1&lt;/sub&gt;, F&lt;sub&gt;2&lt;/sub&gt;, F&lt;sub&gt;3&lt;/sub&gt;, F&lt;sub&gt;4&lt;/sub&gt;), cepstral peak prominence (CPP), jitter, shimmer, harmonic-to-noise ratio (HNR), noise-to-harmonics (NHR), fundamental frequency (f&lt;sub&gt;0&lt;/sub&gt;), and intensity, across vowels, passages, and free talking. Additionally, the Voice Handicap Index-10 (VHI-10) and the Voice Fatigue Index were administered to evaluate voice-related concerns.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;(1) The F&lt;sub&gt;1&lt;/sub&gt; of TW was significantly higher than that of CISW for the vowels /i/ and /u/, and significantly higher than that of CISM for the vowels /a/, /i/, and /u/. The F&lt;sub&gt;2&lt;/sub&gt; of TW was significantly lower than CISW for the vowels /i/, significantly higher than CISW for the vowels /u/, and significantly higher than CISM for the vowels /a/ and /u/. F&lt;sub&gt;3&lt;/sub&gt; was significantly lower in TW than in CISW for the vowels /a/ and /i/. The F4 formant was significantly lower in TW than in CISW for the vowels /a/ and /i/, but significantly higher than in CISM for the vowel /u/. (2) The f&lt;sub&gt;0&lt;/sub&gt; of TW was significantly lower than that of CISW for the vowels /a/, /i/, /u/, during passage reading, and in free speech, but was significantly higher than CISM during passage reading and free talking. Additionally, TW exhibited significantly higher intensity compared with CISW for the vowel /a/ and during passage reading. (3) Jitter in TW was significantly higher than in CISW for the vowels /i/ and /u/, and significantly lower than in CISM during passage reading and free talking. Shimmer was significantly higher in TW compared with both CISW and CISM across the vowels /a/, /i/, during passage reading, and in free talking. The HNR in TW was significantly lower than in both CISW and CISM across all vowels, during passage reading, and in free talking. The NHR was significantly higher in TW than in CISW across all vowels, during passage reading, and in free talking, and significantly higher than in CISM for the vowels /a/, /i/, during passage reading, and in free talking. The CPP in TW was significantly lower than in CISW during passage reading and free talking, and significantly lower than in CISM across all vowels, during passage reading, and in free speech. (4) The VHI-10 scores were significantly higher in TW compared with both CISM and CISW.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;TW exhibit certain acoustic parameters, such as f&lt;sub&gt;0&lt;/sub&gt; and some of the formants, that fall between those of CISW and CISM without undergoing phonosurgery or voice training. The findings suggest a potential risk for ","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142331289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigation of Interdisciplinary Differences in Perceptual Evaluation of Voice Disorders. 调查嗓音障碍感知评估中的学科间差异。
IF 2.5 4区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-09-21 DOI: 10.1016/j.jvoice.2024.08.035
Eylül Birkent, Ahmet Konrot

Ear, nose, and throat (ENT) physicians, speech and language pathologists (SLPs), and singing instructors play complementary roles in the assessment and remediation of voice disorders in performing artists. Given that there are differences among these groups in the training they receive, and that there may be further differences due to their occupational settings and goals, it is unknown whether and to what extent these different disciplines diverge in their perceptions and evaluations of voice quality. Against this background, the present study compared perceptual evaluation of pathological voice by these voice-related disciplines. Using the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) scale, five ENT physicians, five SLPs, and five singing instructors evaluated voice samples recorded from 10 people with pathological voice. The participant groups' scores for each CAPE-V parameter were compared. It was found that the ENT physicians and SLPs were similar in their evaluations, whereas the singing instructors gave higher scores than the ENT physicians and SLPs on several CAPE-V parameters, suggesting that the singing instructors tended to rate the patients' voice quality as more severe on average than ENT physicians and SLPs. These findings highlight the similarities between ENT physicians and SLPs in their perceptual evaluations of pathological voice and suggest that singing instructors may show a more sensitive and heightened perceptual response to pathological voice than the other professional groups, possibly due to differences in occupational experiences and expectations among these voice-related disciplines.

耳鼻喉科(ENT)医生、语言病理学家(SLP)和歌唱导师在表演艺术家嗓音障碍的评估和矫正方面发挥着互补作用。鉴于这些群体所接受的培训存在差异,而且他们的职业背景和目标可能存在进一步的差异,因此这些不同学科对嗓音质量的看法和评价是否存在差异以及差异程度如何尚不得而知。在此背景下,本研究比较了这些嗓音相关学科对病态嗓音的感知评估。五名耳鼻喉科医生、五名嗓音语言康复师和五名歌唱导师使用嗓音听觉知觉评估(CAPE-V)量表,对 10 名病态嗓音患者录制的嗓音样本进行了评估。比较了各组参与者在 CAPE-V 各项参数上的得分。结果发现,耳鼻喉科医生和嗓音康复师的评估结果相似,而歌唱指导员在几项 CAPE-V 参数上的得分要高于耳鼻喉科医生和嗓音康复师,这表明歌唱指导员对患者嗓音质量的平均评价要高于耳鼻喉科医生和嗓音康复师。这些发现凸显了耳鼻喉科医生和语言康复师对病态嗓音感知评估的相似性,并表明歌唱指导员可能比其他专业群体对病态嗓音表现出更敏感和更高的感知反应,这可能是由于这些嗓音相关学科的职业经验和期望值不同造成的。
{"title":"Investigation of Interdisciplinary Differences in Perceptual Evaluation of Voice Disorders.","authors":"Eylül Birkent, Ahmet Konrot","doi":"10.1016/j.jvoice.2024.08.035","DOIUrl":"https://doi.org/10.1016/j.jvoice.2024.08.035","url":null,"abstract":"<p><p>Ear, nose, and throat (ENT) physicians, speech and language pathologists (SLPs), and singing instructors play complementary roles in the assessment and remediation of voice disorders in performing artists. Given that there are differences among these groups in the training they receive, and that there may be further differences due to their occupational settings and goals, it is unknown whether and to what extent these different disciplines diverge in their perceptions and evaluations of voice quality. Against this background, the present study compared perceptual evaluation of pathological voice by these voice-related disciplines. Using the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) scale, five ENT physicians, five SLPs, and five singing instructors evaluated voice samples recorded from 10 people with pathological voice. The participant groups' scores for each CAPE-V parameter were compared. It was found that the ENT physicians and SLPs were similar in their evaluations, whereas the singing instructors gave higher scores than the ENT physicians and SLPs on several CAPE-V parameters, suggesting that the singing instructors tended to rate the patients' voice quality as more severe on average than ENT physicians and SLPs. These findings highlight the similarities between ENT physicians and SLPs in their perceptual evaluations of pathological voice and suggest that singing instructors may show a more sensitive and heightened perceptual response to pathological voice than the other professional groups, possibly due to differences in occupational experiences and expectations among these voice-related disciplines.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2024-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142299663","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Classification of Infant Cry Based on Hybrid Audio Features and ResLSTM. 基于混合音频特征和 ResLSTM 的婴儿哭声分类。
IF 2.5 4区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-09-20 DOI: 10.1016/j.jvoice.2024.08.022
Yongbo Qiu, Xin Yang, Siqi Yang, Yuyou Gong, Qinrui Lv, Bo Yang

Crying is one of the primary means by which infants communicate with their environment in the early stages of life. These cries can be triggered by physiological factors such as hunger or sleepiness, or by pathological factors such as illness or discomfort. Therefore, analyzing infant cries can assist inexperienced parents in better caring for their babies. Most studies have predominantly utilized a single-speech feature, such as Mel Frequency Cepstral Coefficients (MFCC), for classifying infant cries, while other speech features, such as Mel Spectrogram and Tonnetz, are often overlooked. In this study, we manually designed a hybrid feature set, MMT (including MFCC, Mel Spectrogram, and Tonnetz), and explored its application in infant cry classification. Additionally, we proposed a convolutional neural network based on residual connections and long short-term memory (LSTM) networks, termed ResLSTM. We compared the performance of different deep learning models using the hybrid feature set MMT and the single MFCC feature. This study utilized the Baby Crying, Dunstan Baby Language, and Donate a Cry datasets. The results indicate that the hybrid feature set MMT outperforms the single MFCC feature. The MMT combined with the ResLSTM method achieved the best performance, obtaining accuracy rates of 94.15%, 92.92%, and 95.98% on the three datasets, respectively.

啼哭是婴儿在生命早期阶段与周围环境交流的主要方式之一。引发这些哭声的原因可能是饥饿或困倦等生理因素,也可能是疾病或不适等病理因素。因此,分析婴儿的哭声可以帮助缺乏经验的父母更好地照顾婴儿。大多数研究主要利用单一语音特征(如 Mel Frequency Cepstral Coefficients,MFCC)对婴儿哭声进行分类,而其他语音特征(如 Mel Spectrogram 和 Tonnetz)往往被忽视。在本研究中,我们手动设计了一个混合特征集 MMT(包括 MFCC、Mel Spectrogram 和 Tonnetz),并探索了其在婴儿哭声分类中的应用。此外,我们还提出了一种基于残差连接和长短期记忆(LSTM)网络的卷积神经网络,称为 ResLSTM。我们比较了使用混合特征集 MMT 和单一 MFCC 特征的不同深度学习模型的性能。这项研究使用了婴儿哭声、邓斯坦婴儿语言和 Donate a Cry 数据集。结果表明,混合特征集 MMT 优于单一 MFCC 特征。MMT 与 ResLSTM 方法的结合取得了最佳性能,在三个数据集上的准确率分别为 94.15%、92.92% 和 95.98%。
{"title":"Classification of Infant Cry Based on Hybrid Audio Features and ResLSTM.","authors":"Yongbo Qiu, Xin Yang, Siqi Yang, Yuyou Gong, Qinrui Lv, Bo Yang","doi":"10.1016/j.jvoice.2024.08.022","DOIUrl":"https://doi.org/10.1016/j.jvoice.2024.08.022","url":null,"abstract":"<p><p>Crying is one of the primary means by which infants communicate with their environment in the early stages of life. These cries can be triggered by physiological factors such as hunger or sleepiness, or by pathological factors such as illness or discomfort. Therefore, analyzing infant cries can assist inexperienced parents in better caring for their babies. Most studies have predominantly utilized a single-speech feature, such as Mel Frequency Cepstral Coefficients (MFCC), for classifying infant cries, while other speech features, such as Mel Spectrogram and Tonnetz, are often overlooked. In this study, we manually designed a hybrid feature set, MMT (including MFCC, Mel Spectrogram, and Tonnetz), and explored its application in infant cry classification. Additionally, we proposed a convolutional neural network based on residual connections and long short-term memory (LSTM) networks, termed ResLSTM. We compared the performance of different deep learning models using the hybrid feature set MMT and the single MFCC feature. This study utilized the Baby Crying, Dunstan Baby Language, and Donate a Cry datasets. The results indicate that the hybrid feature set MMT outperforms the single MFCC feature. The MMT combined with the ResLSTM method achieved the best performance, obtaining accuracy rates of 94.15%, 92.92%, and 95.98% on the three datasets, respectively.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":null,"pages":null},"PeriodicalIF":2.5,"publicationDate":"2024-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142299657","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Voice
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1