{"title":"Comparison of Deep Learning Models for Voice Disorder Classification Using Kymographic Images.","authors":"B Panchami, S Pravin Kumar","doi":"10.1016/j.jvoice.2025.01.001","DOIUrl":null,"url":null,"abstract":"<p><p>Voice is a critical tool for communication, and diagnosing voice disorders poses significant challenges, particularly when using high-speed video (HSV) endoscopy. The primary difficulty with HSV lies in the need for clinical experts to manually analyze and interpret large volumes of HSV frames. To address these challenges, kymography has been introduced as an effective clinical decision-support tool. In this study, a deep learning-based approach for classifying kymographic images is proposed to automate the analysis by training models to detect subtle and intricate variations in pathological vibratory patterns. We used high-speed recordings from the Benchmark for Automatic Glottis Segmentation (BAGLS) dataset to generate kymographic images, which were then used for binary and tertiary classifications employing deep learning models. We evaluated the performance of five widely used pretrained models: AlexNet, DenseNet121, Xception, Inceptionv3, and ResNet50v2. Our experimental results demonstrate that DenseNet121 can automatically classify voice disorders with higher accuracy and better performance across different model evaluation indicators, outperforming existing methods. With further research, the deep learning classifier has the potential to become a valuable diagnostic assistance tool for clinicians.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":" ","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2025-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Voice","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jvoice.2025.01.001","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Voice is a critical tool for communication, and diagnosing voice disorders poses significant challenges, particularly when using high-speed video (HSV) endoscopy. The primary difficulty with HSV lies in the need for clinical experts to manually analyze and interpret large volumes of HSV frames. To address these challenges, kymography has been introduced as an effective clinical decision-support tool. In this study, a deep learning-based approach for classifying kymographic images is proposed to automate the analysis by training models to detect subtle and intricate variations in pathological vibratory patterns. We used high-speed recordings from the Benchmark for Automatic Glottis Segmentation (BAGLS) dataset to generate kymographic images, which were then used for binary and tertiary classifications employing deep learning models. We evaluated the performance of five widely used pretrained models: AlexNet, DenseNet121, Xception, Inceptionv3, and ResNet50v2. Our experimental results demonstrate that DenseNet121 can automatically classify voice disorders with higher accuracy and better performance across different model evaluation indicators, outperforming existing methods. With further research, the deep learning classifier has the potential to become a valuable diagnostic assistance tool for clinicians.
期刊介绍:
The Journal of Voice is widely regarded as the world''s premiere journal for voice medicine and research. This peer-reviewed publication is listed in Index Medicus and is indexed by the Institute for Scientific Information. The journal contains articles written by experts throughout the world on all topics in voice sciences, voice medicine and surgery, and speech-language pathologists'' management of voice-related problems. The journal includes clinical articles, clinical research, and laboratory research. Members of the Foundation receive the journal as a benefit of membership.