{"title":"用人工智能检测语音疲劳","authors":"Abhinav Siripurapu, Robert T Sataloff","doi":"10.1016/j.jvoice.2024.08.002","DOIUrl":null,"url":null,"abstract":"<p><p>Voice fatigue (VF) has many symptoms and can occur after extended or brief voice use, depending on the presence or absence of voice pathology, and other factors. However, fatigue is difficult to detect and quantify through current approaches. This study explores the use of artificial intelligence (AI) in the automatic detection and analysis of VF, presenting a novel approach to detect and monitor the condition.</p><p><strong>Objective: </strong>This study aims to create an AI-based system for detecting VF. The AI model's performance is evaluated against traditional methods of assessment conducted by speech-language pathologists (SLPs).</p><p><strong>Methods: </strong>Voice samples were collected from individuals experiencing varying levels of VF. To validate these samples, we calculated f<sub>o</sub>, increases that have been shown to be correlated with VF, at the beginning and end of the recordings. The samples were processed using a machine learning model trained to recognize patterns associated with VF. To build the model, we extracted embeddings from an ECAPA-TDNN model that has been shown to capture changes in the voice characteristics of a speaker over time and used a Convolutional Neural Network for classification. To validate the model, the model's accuracy in detecting VF was compared with assessments from SLPs.</p><p><strong>Results: </strong>We achieved an accuracy score of 93% on our dataset of English academic lectures and podcasts. As further validation, we asked three experienced SLPs to classify audio segments from our dataset and compared their responses to the classifications from our model, and achieved an accuracy of 86% as compared to their ratings.</p><p><strong>Conclusion: </strong>The application of AI in the detection of VF shows a generalizable approach for the analysis of VF. Future research will incorporate patient data to validate further the models that we created.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":null,"pages":null},"PeriodicalIF":2.5000,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Detecting of Voice Fatigue With Artificial Intelligence.\",\"authors\":\"Abhinav Siripurapu, Robert T Sataloff\",\"doi\":\"10.1016/j.jvoice.2024.08.002\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Voice fatigue (VF) has many symptoms and can occur after extended or brief voice use, depending on the presence or absence of voice pathology, and other factors. However, fatigue is difficult to detect and quantify through current approaches. This study explores the use of artificial intelligence (AI) in the automatic detection and analysis of VF, presenting a novel approach to detect and monitor the condition.</p><p><strong>Objective: </strong>This study aims to create an AI-based system for detecting VF. The AI model's performance is evaluated against traditional methods of assessment conducted by speech-language pathologists (SLPs).</p><p><strong>Methods: </strong>Voice samples were collected from individuals experiencing varying levels of VF. To validate these samples, we calculated f<sub>o</sub>, increases that have been shown to be correlated with VF, at the beginning and end of the recordings. The samples were processed using a machine learning model trained to recognize patterns associated with VF. To build the model, we extracted embeddings from an ECAPA-TDNN model that has been shown to capture changes in the voice characteristics of a speaker over time and used a Convolutional Neural Network for classification. To validate the model, the model's accuracy in detecting VF was compared with assessments from SLPs.</p><p><strong>Results: </strong>We achieved an accuracy score of 93% on our dataset of English academic lectures and podcasts. As further validation, we asked three experienced SLPs to classify audio segments from our dataset and compared their responses to the classifications from our model, and achieved an accuracy of 86% as compared to their ratings.</p><p><strong>Conclusion: </strong>The application of AI in the detection of VF shows a generalizable approach for the analysis of VF. Future research will incorporate patient data to validate further the models that we created.</p>\",\"PeriodicalId\":49954,\"journal\":{\"name\":\"Journal of Voice\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2024-08-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Voice\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.jvoice.2024.08.002\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Voice","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jvoice.2024.08.002","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
Detecting of Voice Fatigue With Artificial Intelligence.
Voice fatigue (VF) has many symptoms and can occur after extended or brief voice use, depending on the presence or absence of voice pathology, and other factors. However, fatigue is difficult to detect and quantify through current approaches. This study explores the use of artificial intelligence (AI) in the automatic detection and analysis of VF, presenting a novel approach to detect and monitor the condition.
Objective: This study aims to create an AI-based system for detecting VF. The AI model's performance is evaluated against traditional methods of assessment conducted by speech-language pathologists (SLPs).
Methods: Voice samples were collected from individuals experiencing varying levels of VF. To validate these samples, we calculated fo, increases that have been shown to be correlated with VF, at the beginning and end of the recordings. The samples were processed using a machine learning model trained to recognize patterns associated with VF. To build the model, we extracted embeddings from an ECAPA-TDNN model that has been shown to capture changes in the voice characteristics of a speaker over time and used a Convolutional Neural Network for classification. To validate the model, the model's accuracy in detecting VF was compared with assessments from SLPs.
Results: We achieved an accuracy score of 93% on our dataset of English academic lectures and podcasts. As further validation, we asked three experienced SLPs to classify audio segments from our dataset and compared their responses to the classifications from our model, and achieved an accuracy of 86% as compared to their ratings.
Conclusion: The application of AI in the detection of VF shows a generalizable approach for the analysis of VF. Future research will incorporate patient data to validate further the models that we created.
期刊介绍:
The Journal of Voice is widely regarded as the world''s premiere journal for voice medicine and research. This peer-reviewed publication is listed in Index Medicus and is indexed by the Institute for Scientific Information. The journal contains articles written by experts throughout the world on all topics in voice sciences, voice medicine and surgery, and speech-language pathologists'' management of voice-related problems. The journal includes clinical articles, clinical research, and laboratory research. Members of the Foundation receive the journal as a benefit of membership.