A Review on Speech Emotion Recognition Using Machine Learning

Sk. Mohammed Jubear, D. P. K. Reddy, G. Subramanyam, Sk. Farooq, T. Sreenivasulu, N. S. Rao
{"title":"A Review on Speech Emotion Recognition Using Machine Learning","authors":"Sk. Mohammed Jubear, D. P. K. Reddy, G. Subramanyam, Sk. Farooq, T. Sreenivasulu, N. S. Rao","doi":"10.55524/ijircst.2022.10.3.65","DOIUrl":null,"url":null,"abstract":"This paper focuses on the development of a robust speech emotion recognition system using a combination of different speech features with feature optimization techniques and speech de-noising technique to acquire improved emotion classification accuracy, decreasing the system complexity and obtain noise robustness. Additionally, we create original methods for SER to merge features. We employ feature optimization methods that are based on the feature transformation and feature selection machine learning techniques in order to build SER. The following is a list of the upcoming events. A neural network can use either of these two techniques. As more feelings are taken into account, the feature fusion-acquired SER accuracy falls short of expectations, and the plague of dimensionality starts to spread due to the addition of speech features, which makes the SER system work harder to complete its task. This is due to the SER system becoming more complicated when voice elements are added. Therefore, it is crucial to create a SER system that is more trustworthy, has the most practical features, and uses the least amount of computing power possible. By using strategies that maximize current features, it is possible to streamline the feature selection process by reducing the total number of accessible choices to a more reasonable level. This piece employs a method known as Semi-Non Negative Matrix Factorization to lessen the amount of processing trash that the SER system generates. (Semi-NMF). This approach can be used to change traits that are capable of learning on their own.","PeriodicalId":218345,"journal":{"name":"International Journal of Innovative Research in Computer Science and Technology","volume":"104 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Innovative Research in Computer Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.55524/ijircst.2022.10.3.65","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This paper focuses on the development of a robust speech emotion recognition system using a combination of different speech features with feature optimization techniques and speech de-noising technique to acquire improved emotion classification accuracy, decreasing the system complexity and obtain noise robustness. Additionally, we create original methods for SER to merge features. We employ feature optimization methods that are based on the feature transformation and feature selection machine learning techniques in order to build SER. The following is a list of the upcoming events. A neural network can use either of these two techniques. As more feelings are taken into account, the feature fusion-acquired SER accuracy falls short of expectations, and the plague of dimensionality starts to spread due to the addition of speech features, which makes the SER system work harder to complete its task. This is due to the SER system becoming more complicated when voice elements are added. Therefore, it is crucial to create a SER system that is more trustworthy, has the most practical features, and uses the least amount of computing power possible. By using strategies that maximize current features, it is possible to streamline the feature selection process by reducing the total number of accessible choices to a more reasonable level. This piece employs a method known as Semi-Non Negative Matrix Factorization to lessen the amount of processing trash that the SER system generates. (Semi-NMF). This approach can be used to change traits that are capable of learning on their own.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于机器学习的语音情感识别研究进展
本文重点研究了将不同语音特征结合特征优化技术和语音去噪技术开发鲁棒性语音情感识别系统,以获得更高的情感分类精度,降低系统复杂度,并获得噪声鲁棒性。此外,我们还为SER创建了合并特性的原始方法。我们采用基于特征转换和特征选择机器学习技术的特征优化方法来构建SER。以下是即将举行的活动列表。神经网络可以使用这两种技术中的任何一种。由于考虑了更多的感受,特征融合获得的SER精度达不到预期,并且由于语音特征的增加,维度的瘟疫开始蔓延,这使得SER系统更难完成任务。这是因为添加语音元素后,SER系统变得更加复杂。因此,创建一个更值得信赖、具有最实用的功能并使用尽可能少的计算能力的SER系统至关重要。通过使用最大化当前特征的策略,可以通过将可访问选项的总数减少到更合理的水平来简化特征选择过程。本文采用一种称为半非负矩阵分解的方法来减少SER系统生成的处理垃圾的数量。(Semi-NMF)。这种方法可以用来改变能够自主学习的特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Comprehensive Review on Machine Learning Applications in Cloud Computing A Comparative Study of ChatGPT, Gemini, and Perplexity Exploring the Synergy of Web Usage Data and Content Mining for Personalized Effectiveness A Comparative Study of Cat Swarm Algorithm for Graph Coloring Problem: Convergence Analysis and Performance Evaluation A Comprehensive Review- Building A Secure Social Media Environment for Kids- Automated Content Filtering with Biometric Feedback
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1