基于双核支持向量的新型利维蜣螂算法,用于准确的语音情感检测

IF 1.8 3区 工程技术 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC Circuits, Systems and Signal Processing Pub Date : 2024-07-29 DOI:10.1007/s00034-024-02791-2
Tian Han, Zhu Zhang, Mingyuan Ren, Changchun Dong, Xiaolin Jiang
{"title":"基于双核支持向量的新型利维蜣螂算法,用于准确的语音情感检测","authors":"Tian Han, Zhu Zhang, Mingyuan Ren, Changchun Dong, Xiaolin Jiang","doi":"10.1007/s00034-024-02791-2","DOIUrl":null,"url":null,"abstract":"<p>Human emotions are easy to identify through facial expressions, body movements, and gestures. Speech carries a lot of emotional cues including variations in pitch, tone, intensity, and rhythm. In recent years, the increasing demand for human–computer interaction has spurred the development of speech recognition methods. Traditional Speech emotion detection methods are less effective in recognizing emotions, considering features like pitch, intensity, and spectral characteristics. To address these issues, this paper proposed a novel method named Dual Kernel Support Vector based Levy Dung Beetle (DKSV-LDB) Algorithm to accurately identify emotions like happiness, anger, sadness, etc. from speech patterns. In this study, the model is designed by combining a Dual Kernel Support Vector Machine (SVM) method with a Dung beetle Optimization algorithm, enriched by the Levy Flight strategy. This work conducted experiments in the datasets namely the CREMA-D, TESS, and EMO-DB (German). The performance evaluation measures such as accuracy, precision, recall, F-measure, and specificity are utilized for the evaluation of the proposed DKSV-LDB method and these results are compared with existing methods. The DKSV-LDB method achieved accuracy, precision, recall, F-measure, and specificity of 98.57%, 97.91%, 97.86%, 97.84%, and 97.78%. The experimental results depict the performance of the developed DKSV-LDB technique for speech emotion identification.</p>","PeriodicalId":10227,"journal":{"name":"Circuits, Systems and Signal Processing","volume":"74 1","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Novel Dual Kernel Support Vector-Based Levy Dung Beetle Algorithm for Accurate Speech Emotion Detection\",\"authors\":\"Tian Han, Zhu Zhang, Mingyuan Ren, Changchun Dong, Xiaolin Jiang\",\"doi\":\"10.1007/s00034-024-02791-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Human emotions are easy to identify through facial expressions, body movements, and gestures. Speech carries a lot of emotional cues including variations in pitch, tone, intensity, and rhythm. In recent years, the increasing demand for human–computer interaction has spurred the development of speech recognition methods. Traditional Speech emotion detection methods are less effective in recognizing emotions, considering features like pitch, intensity, and spectral characteristics. To address these issues, this paper proposed a novel method named Dual Kernel Support Vector based Levy Dung Beetle (DKSV-LDB) Algorithm to accurately identify emotions like happiness, anger, sadness, etc. from speech patterns. In this study, the model is designed by combining a Dual Kernel Support Vector Machine (SVM) method with a Dung beetle Optimization algorithm, enriched by the Levy Flight strategy. This work conducted experiments in the datasets namely the CREMA-D, TESS, and EMO-DB (German). The performance evaluation measures such as accuracy, precision, recall, F-measure, and specificity are utilized for the evaluation of the proposed DKSV-LDB method and these results are compared with existing methods. The DKSV-LDB method achieved accuracy, precision, recall, F-measure, and specificity of 98.57%, 97.91%, 97.86%, 97.84%, and 97.78%. The experimental results depict the performance of the developed DKSV-LDB technique for speech emotion identification.</p>\",\"PeriodicalId\":10227,\"journal\":{\"name\":\"Circuits, Systems and Signal Processing\",\"volume\":\"74 1\",\"pages\":\"\"},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2024-07-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Circuits, Systems and Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s00034-024-02791-2\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Circuits, Systems and Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s00034-024-02791-2","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

人类的情绪很容易通过面部表情、肢体动作和手势来识别。语音则包含许多情绪线索,包括音调、语气、强度和节奏的变化。近年来,人机交互需求的不断增长推动了语音识别方法的发展。传统的语音情感检测方法考虑到音调、强度和频谱特性等特征,在识别情感方面效果较差。为解决这些问题,本文提出了一种名为基于莱维蜣螂算法(DKSV-LDB)的双核支持向量新方法,可从语音模式中准确识别喜怒哀乐等情绪。在这项研究中,模型的设计结合了双核支持向量机(SVM)方法和蜣螂优化算法,并使用了李维飞行策略。这项工作在 CREMA-D、TESS 和 EMO-DB(德语)数据集上进行了实验。采用准确度、精确度、召回率、F-measure 和特异性等性能评估指标对所提出的 DKSV-LDB 方法进行了评估,并将这些结果与现有方法进行了比较。DKSV-LDB 方法的准确度、精确度、召回率、F-measure 和特异性分别达到了 98.57%、97.91%、97.86%、97.84% 和 97.78%。实验结果表明了所开发的 DKSV-LDB 技术在语音情感识别方面的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A Novel Dual Kernel Support Vector-Based Levy Dung Beetle Algorithm for Accurate Speech Emotion Detection

Human emotions are easy to identify through facial expressions, body movements, and gestures. Speech carries a lot of emotional cues including variations in pitch, tone, intensity, and rhythm. In recent years, the increasing demand for human–computer interaction has spurred the development of speech recognition methods. Traditional Speech emotion detection methods are less effective in recognizing emotions, considering features like pitch, intensity, and spectral characteristics. To address these issues, this paper proposed a novel method named Dual Kernel Support Vector based Levy Dung Beetle (DKSV-LDB) Algorithm to accurately identify emotions like happiness, anger, sadness, etc. from speech patterns. In this study, the model is designed by combining a Dual Kernel Support Vector Machine (SVM) method with a Dung beetle Optimization algorithm, enriched by the Levy Flight strategy. This work conducted experiments in the datasets namely the CREMA-D, TESS, and EMO-DB (German). The performance evaluation measures such as accuracy, precision, recall, F-measure, and specificity are utilized for the evaluation of the proposed DKSV-LDB method and these results are compared with existing methods. The DKSV-LDB method achieved accuracy, precision, recall, F-measure, and specificity of 98.57%, 97.91%, 97.86%, 97.84%, and 97.78%. The experimental results depict the performance of the developed DKSV-LDB technique for speech emotion identification.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Circuits, Systems and Signal Processing
Circuits, Systems and Signal Processing 工程技术-工程:电子与电气
CiteScore
4.80
自引率
13.00%
发文量
321
审稿时长
4.6 months
期刊介绍: Rapid developments in the analog and digital processing of signals for communication, control, and computer systems have made the theory of electrical circuits and signal processing a burgeoning area of research and design. The aim of Circuits, Systems, and Signal Processing (CSSP) is to help meet the needs of outlets for significant research papers and state-of-the-art review articles in the area. The scope of the journal is broad, ranging from mathematical foundations to practical engineering design. It encompasses, but is not limited to, such topics as linear and nonlinear networks, distributed circuits and systems, multi-dimensional signals and systems, analog filters and signal processing, digital filters and signal processing, statistical signal processing, multimedia, computer aided design, graph theory, neural systems, communication circuits and systems, and VLSI signal processing. The Editorial Board is international, and papers are welcome from throughout the world. The journal is devoted primarily to research papers, but survey, expository, and tutorial papers are also published. Circuits, Systems, and Signal Processing (CSSP) is published twelve times annually.
期刊最新文献
Squeeze-and-Excitation Self-Attention Mechanism Enhanced Digital Audio Source Recognition Based on Transfer Learning Recursive Windowed Variational Mode Decomposition Discrete-Time Delta-Sigma Modulator with Successively Approximating Register ADC Assisted Analog Feedback Technique Individually Weighted Modified Logarithmic Hyperbolic Sine Curvelet Based Recursive FLN for Nonlinear System Identification Event-Triggered $$H_{\infty }$$ Filtering for A Class of Nonlinear Systems Under DoS Attacks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1