AROA based Pre-trained Model of Convolutional Neural Network for Voice Pathology Detection and Classification

Manikandan J, Kayalvizhi K, Yuvaraj Nachimuthu, Jeena R
{"title":"AROA based Pre-trained Model of Convolutional Neural Network for Voice Pathology Detection and Classification","authors":"Manikandan J, Kayalvizhi K, Yuvaraj Nachimuthu, Jeena R","doi":"10.53759/7669/jmc202404044","DOIUrl":null,"url":null,"abstract":"With the demand for better, more user-friendly HMIs, voice recognition systems have risen in prominence in recent years. The use of computer-assisted vocal pathology categorization tools allows for the accurate detection of voice pathology diseases. By using these methods, vocal disorders may be diagnosed early on and treated accordingly. An effective Deep Learning-based tool for feature extraction-based vocal pathology identification is the goal of this project. This research presents the results of using EfficientNet, a pre-trained Convolutional Neural Network (CNN), on a speech pathology dataset in order to achieve the highest possible classification accuracy. An Artificial Rabbit Optimization Algorithm (AROA)-tuned set of parameters complements the model's mobNet building elements, which include a linear stack of divisible convolution and max-pooling layers activated by Swish. In order to make the suggested approach applicable to a broad variety of voice disorder problems, this study also suggests a unique training method along with several training methodologies. One speech database, the Saarbrücken voice database (SVD), has been used to test the proposed technology. Using up to 96% accuracy, the experimental findings demonstrate that the suggested CNN approach is capable of detecting speech pathologies. The suggested method demonstrates great potential for use in real-world clinical settings, where it may provide accurate classifications in as little as three seconds and expedite automated diagnosis and treatment.","PeriodicalId":516221,"journal":{"name":"Journal of Machine and Computing","volume":"6 2","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Machine and Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.53759/7669/jmc202404044","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

With the demand for better, more user-friendly HMIs, voice recognition systems have risen in prominence in recent years. The use of computer-assisted vocal pathology categorization tools allows for the accurate detection of voice pathology diseases. By using these methods, vocal disorders may be diagnosed early on and treated accordingly. An effective Deep Learning-based tool for feature extraction-based vocal pathology identification is the goal of this project. This research presents the results of using EfficientNet, a pre-trained Convolutional Neural Network (CNN), on a speech pathology dataset in order to achieve the highest possible classification accuracy. An Artificial Rabbit Optimization Algorithm (AROA)-tuned set of parameters complements the model's mobNet building elements, which include a linear stack of divisible convolution and max-pooling layers activated by Swish. In order to make the suggested approach applicable to a broad variety of voice disorder problems, this study also suggests a unique training method along with several training methodologies. One speech database, the Saarbrücken voice database (SVD), has been used to test the proposed technology. Using up to 96% accuracy, the experimental findings demonstrate that the suggested CNN approach is capable of detecting speech pathologies. The suggested method demonstrates great potential for use in real-world clinical settings, where it may provide accurate classifications in as little as three seconds and expedite automated diagnosis and treatment.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于 AROA 的卷积神经网络预训练模型用于语音病理检测和分类
近年来,随着对更好、更友好的人机界面的需求,语音识别系统的地位日益突出。使用计算机辅助声带病理学分类工具可以准确检测声带病理学疾病。通过使用这些方法,可以及早诊断出声带疾病并进行相应治疗。本项目的目标是开发一种基于深度学习的有效工具,用于基于特征提取的声带病理识别。本研究展示了在语音病理学数据集上使用预先训练好的卷积神经网络(CNN)EfficientNet 的结果,以达到尽可能高的分类准确率。经过人工兔优化算法(AROA)调整的参数集补充了该模型的 mobNet 构建元素,其中包括由 Swish 激活的可分割卷积层和最大池化层的线性堆叠。为了使建议的方法适用于各种语音障碍问题,本研究还提出了一种独特的训练方法和几种训练方法。萨尔布吕肯语音数据库(SVD)被用来测试所建议的技术。实验结果表明,建议的 CNN 方法能够检测语音病理,准确率高达 96%。建议的方法在实际临床环境中具有巨大的应用潜力,可在短短三秒钟内提供准确的分类,加快自动诊断和治疗。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Optimizing Building Energy Management with Deep Reinforcement Learning for Smart and Sustainable Infrastructure Enhancing Predictive Maintenance in Water Treatment Plants through Sparse Autoencoder Based Anomaly Detection Advancements and Challenges in Underwater Soft Robotics: Materials, Control and Integration IoT Based ICU Healthcare: Optimizing Patient Monitoring and Treatment with Advanced Algorithms Hybrid Optimization Model Integrating Gradient Descent and Stochastic Descent for Enhanced Osteoporosis and Osteopenia Recognition
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1