Detecting of Voice Fatigue With Artificial Intelligence.

IF 2.5 4区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Journal of Voice Pub Date : 2024-08-24 DOI:10.1016/j.jvoice.2024.08.002
Abhinav Siripurapu, Robert T Sataloff
{"title":"Detecting of Voice Fatigue With Artificial Intelligence.","authors":"Abhinav Siripurapu, Robert T Sataloff","doi":"10.1016/j.jvoice.2024.08.002","DOIUrl":null,"url":null,"abstract":"<p><p>Voice fatigue (VF) has many symptoms and can occur after extended or brief voice use, depending on the presence or absence of voice pathology, and other factors. However, fatigue is difficult to detect and quantify through current approaches. This study explores the use of artificial intelligence (AI) in the automatic detection and analysis of VF, presenting a novel approach to detect and monitor the condition.</p><p><strong>Objective: </strong>This study aims to create an AI-based system for detecting VF. The AI model's performance is evaluated against traditional methods of assessment conducted by speech-language pathologists (SLPs).</p><p><strong>Methods: </strong>Voice samples were collected from individuals experiencing varying levels of VF. To validate these samples, we calculated f<sub>o</sub>, increases that have been shown to be correlated with VF, at the beginning and end of the recordings. The samples were processed using a machine learning model trained to recognize patterns associated with VF. To build the model, we extracted embeddings from an ECAPA-TDNN model that has been shown to capture changes in the voice characteristics of a speaker over time and used a Convolutional Neural Network for classification. To validate the model, the model's accuracy in detecting VF was compared with assessments from SLPs.</p><p><strong>Results: </strong>We achieved an accuracy score of 93% on our dataset of English academic lectures and podcasts. As further validation, we asked three experienced SLPs to classify audio segments from our dataset and compared their responses to the classifications from our model, and achieved an accuracy of 86% as compared to their ratings.</p><p><strong>Conclusion: </strong>The application of AI in the detection of VF shows a generalizable approach for the analysis of VF. Future research will incorporate patient data to validate further the models that we created.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":null,"pages":null},"PeriodicalIF":2.5000,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Voice","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jvoice.2024.08.002","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Voice fatigue (VF) has many symptoms and can occur after extended or brief voice use, depending on the presence or absence of voice pathology, and other factors. However, fatigue is difficult to detect and quantify through current approaches. This study explores the use of artificial intelligence (AI) in the automatic detection and analysis of VF, presenting a novel approach to detect and monitor the condition.

Objective: This study aims to create an AI-based system for detecting VF. The AI model's performance is evaluated against traditional methods of assessment conducted by speech-language pathologists (SLPs).

Methods: Voice samples were collected from individuals experiencing varying levels of VF. To validate these samples, we calculated fo, increases that have been shown to be correlated with VF, at the beginning and end of the recordings. The samples were processed using a machine learning model trained to recognize patterns associated with VF. To build the model, we extracted embeddings from an ECAPA-TDNN model that has been shown to capture changes in the voice characteristics of a speaker over time and used a Convolutional Neural Network for classification. To validate the model, the model's accuracy in detecting VF was compared with assessments from SLPs.

Results: We achieved an accuracy score of 93% on our dataset of English academic lectures and podcasts. As further validation, we asked three experienced SLPs to classify audio segments from our dataset and compared their responses to the classifications from our model, and achieved an accuracy of 86% as compared to their ratings.

Conclusion: The application of AI in the detection of VF shows a generalizable approach for the analysis of VF. Future research will incorporate patient data to validate further the models that we created.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用人工智能检测语音疲劳
嗓音疲劳(VF)有多种症状,可在长时间或短暂用嗓后出现,取决于是否存在嗓音病变和其他因素。然而,目前的方法很难检测和量化疲劳。本研究探讨了人工智能(AI)在自动检测和分析 VF 中的应用,提出了一种检测和监测 VF 的新方法:本研究旨在创建一个基于人工智能的检测室颤系统。该人工智能模型的性能与语言病理学家(SLP)进行的传统评估方法进行了对比评估:方法:我们收集了不同程度的 VF 患者的语音样本。为了验证这些样本,我们在录音开始和结束时计算了 fo 值,该值的增加已被证明与 VF 相关。我们使用经过训练的机器学习模型对样本进行处理,以识别与 VF 相关的模式。为了建立该模型,我们从 ECAPA-TDNN 模型中提取了嵌入信息,该模型已被证明能捕捉到说话者声音特征随时间的变化,并使用卷积神经网络进行分类。为了验证该模型,我们将该模型检测 VF 的准确率与 SLP 的评估结果进行了比较:我们在英语学术讲座和播客数据集上的准确率达到了 93%。作为进一步验证,我们请三位经验丰富的 SLPs 对数据集中的音频片段进行分类,并将他们的回答与我们模型的分类进行比较,结果与他们的评分相比,准确率达到了 86%:人工智能在心房颤动检测中的应用展示了一种分析心房颤动的通用方法。未来的研究将结合患者数据,进一步验证我们创建的模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Voice
Journal of Voice 医学-耳鼻喉科学
CiteScore
4.00
自引率
13.60%
发文量
395
审稿时长
59 days
期刊介绍: The Journal of Voice is widely regarded as the world''s premiere journal for voice medicine and research. This peer-reviewed publication is listed in Index Medicus and is indexed by the Institute for Scientific Information. The journal contains articles written by experts throughout the world on all topics in voice sciences, voice medicine and surgery, and speech-language pathologists'' management of voice-related problems. The journal includes clinical articles, clinical research, and laboratory research. Members of the Foundation receive the journal as a benefit of membership.
期刊最新文献
Does the Daily Practice of a Structured Voice Exercise Protocol Affect the Fitness Instructor's Self-Perceived Vocal Effort, Vocal Fatigue, and Voice Handicap? Vocal Effort in Clinical Settings of North and South American Countries: Characterization From Argentinian, Chilean, Colombian, and the United States Clinician's Reports. Anesthetic Techniques for Type-1 (Medialization) Thyroplasty: A Scoping Review. Associations Between Immunological Biomarkers, Voice Use Patterns, and Phonotraumatic Vocal Fold Lesions: A Scoping Review. Correlation Between Anxiety, Depression, and Self-Perceived Hoarseness: A Case Series of 100 Lebanese Patients.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1