利用卷积神经网络和视觉变换器从语音数据中诊断心理健康。

IF 2.5 4区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Journal of Voice Pub Date : 2024-11-15 DOI:10.1016/j.jvoice.2024.10.010
Rafiul Islam, Md Taimur Ahad, Faruk Ahmed, Bo Song, Yan Li
{"title":"利用卷积神经网络和视觉变换器从语音数据中诊断心理健康。","authors":"Rafiul Islam, Md Taimur Ahad, Faruk Ahmed, Bo Song, Yan Li","doi":"10.1016/j.jvoice.2024.10.010","DOIUrl":null,"url":null,"abstract":"<p><p>Integrating Convolutional Neural Networks and Vision Transformers in voice analysis has unveiled a new horizon in mental health identification. Human voice, a powerful indicator of mental health, was the focus of this study. Human voice data representing stable and unstable conditions were gathered from various mental health institutions in Bangladesh. The results of the experiment suggest that the proposed model achieved 91% accuracy, precision of 92% for the \"Unstable\" category and 90% for the \"Stable\" category, and recall of 91% for the \"Stable\" category and 92% for the \"Unstable\" category. In addition, a high F1 score of 91% was achieved. This study significantly contributes to computer-aided diagnosis in mental health by using deep learning (DL) to diagnose mental well-being. Our research underscores the substantial impact of DL on the advancement of mental health care, instilling hope for a brighter future in mental health care.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":" ","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Mental Health Diagnosis From Voice Data Using Convolutional Neural Networks and Vision Transformers.\",\"authors\":\"Rafiul Islam, Md Taimur Ahad, Faruk Ahmed, Bo Song, Yan Li\",\"doi\":\"10.1016/j.jvoice.2024.10.010\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Integrating Convolutional Neural Networks and Vision Transformers in voice analysis has unveiled a new horizon in mental health identification. Human voice, a powerful indicator of mental health, was the focus of this study. Human voice data representing stable and unstable conditions were gathered from various mental health institutions in Bangladesh. The results of the experiment suggest that the proposed model achieved 91% accuracy, precision of 92% for the \\\"Unstable\\\" category and 90% for the \\\"Stable\\\" category, and recall of 91% for the \\\"Stable\\\" category and 92% for the \\\"Unstable\\\" category. In addition, a high F1 score of 91% was achieved. This study significantly contributes to computer-aided diagnosis in mental health by using deep learning (DL) to diagnose mental well-being. Our research underscores the substantial impact of DL on the advancement of mental health care, instilling hope for a brighter future in mental health care.</p>\",\"PeriodicalId\":49954,\"journal\":{\"name\":\"Journal of Voice\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2024-11-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Voice\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.jvoice.2024.10.010\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Voice","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jvoice.2024.10.010","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

将卷积神经网络和视觉变换器整合到语音分析中,为精神健康识别开辟了新天地。人声是精神健康的有力指标,也是本研究的重点。研究人员从孟加拉国多家精神健康机构收集了代表稳定和不稳定状况的人声数据。实验结果表明,所提出的模型准确率达到 91%,"不稳定 "类别的准确率为 92%,"稳定 "类别的准确率为 90%,"稳定 "类别的召回率为 91%,"不稳定 "类别的召回率为 92%。此外,F1 分数也高达 91%。本研究通过使用深度学习(DL)诊断心理健康,为心理健康领域的计算机辅助诊断做出了重大贡献。我们的研究强调了深度学习对心理健康护理进步的重大影响,为心理健康护理的美好未来注入了希望。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Mental Health Diagnosis From Voice Data Using Convolutional Neural Networks and Vision Transformers.

Integrating Convolutional Neural Networks and Vision Transformers in voice analysis has unveiled a new horizon in mental health identification. Human voice, a powerful indicator of mental health, was the focus of this study. Human voice data representing stable and unstable conditions were gathered from various mental health institutions in Bangladesh. The results of the experiment suggest that the proposed model achieved 91% accuracy, precision of 92% for the "Unstable" category and 90% for the "Stable" category, and recall of 91% for the "Stable" category and 92% for the "Unstable" category. In addition, a high F1 score of 91% was achieved. This study significantly contributes to computer-aided diagnosis in mental health by using deep learning (DL) to diagnose mental well-being. Our research underscores the substantial impact of DL on the advancement of mental health care, instilling hope for a brighter future in mental health care.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Voice
Journal of Voice 医学-耳鼻喉科学
CiteScore
4.00
自引率
13.60%
发文量
395
审稿时长
59 days
期刊介绍: The Journal of Voice is widely regarded as the world''s premiere journal for voice medicine and research. This peer-reviewed publication is listed in Index Medicus and is indexed by the Institute for Scientific Information. The journal contains articles written by experts throughout the world on all topics in voice sciences, voice medicine and surgery, and speech-language pathologists'' management of voice-related problems. The journal includes clinical articles, clinical research, and laboratory research. Members of the Foundation receive the journal as a benefit of membership.
期刊最新文献
Optimal Brain Targets for Enhancing Vocal Performance With Transcranial Direct Current Stimulation. A Multidiscipline Practitioner Pilot Study Into the Potential Professionalization of Singing Voice Rehabilitation Specialists in the United Kingdom. Characterizing Vocal Hyperfunction Using Ecological Momentary Assessment of Relative Fundamental Frequency. Evaluation of Depression and Perceived Voice Handicap Through Self-Report in Individuals With Alzheimer's Disease. Vocal Biomarkers for Parkinson's Disease Classification Using Audio Spectrogram Transformers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1