Development of an advanced ai-based model for human psychoemotional state analysis

Q3 Mathematics Eastern-European Journal of Enterprise Technologies Pub Date : 2023-12-28 DOI:10.15587/1729-4061.2023.293011

Zharas Ainakulov, Kayrat Koshekov, Alexey Savostin, R. Anayatova, B. Seidakhmetov, G. Kurmankulova

{"title":"Development of an advanced ai-based model for human psychoemotional state analysis","authors":"Zharas Ainakulov, Kayrat Koshekov, Alexey Savostin, R. Anayatova, B. Seidakhmetov, G. Kurmankulova","doi":"10.15587/1729-4061.2023.293011","DOIUrl":null,"url":null,"abstract":"The research focuses on developing a novel method for the automatic recognition of human psychoemotional states (PES) using deep learning technology. This method is centered on analyzing speech signals to classify distinct emotional states. The primary challenge addressed by this research is to accurately perform multiclass classification of seven human psychoemotional states, namely joy, fear, anger, sadness, disgust, surprise, and a neutral state. Traditional methods have struggled to accurately distinguish these complex emotional nuances in speech. The study successfully developed a model capable of extracting informative features from audio recordings, specifically mel spectrograms and mel-frequency cepstral coefficients. These features were then used to train two deep convolutional neural networks, resulting in a classifier model. The uniqueness of this research lies in its use of a dual-feature approach and the employment of deep convolutional neural networks for classification. This approach has demonstrated high accuracy in emotion recognition, with an accuracy rate of 0.93 in the validation subset. The high accuracy and effectiveness of the model can be attributed to the comprehensive and synergistic use of mel spectrograms and mel-frequency cepstral coefficients, which provide a more nuanced analysis of emotional expressions in speech. The method presented in this research has broad applicability in various domains, including enhancing human-machine interface interactions, implementation in the aviation industry, healthcare, marketing, and other fields where understanding human emotions through speech is crucial","PeriodicalId":11433,"journal":{"name":"Eastern-European Journal of Enterprise Technologies","volume":"76 6","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Eastern-European Journal of Enterprise Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15587/1729-4061.2023.293011","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Mathematics","Score":null,"Total":0}

引用次数: 0

Abstract

The research focuses on developing a novel method for the automatic recognition of human psychoemotional states (PES) using deep learning technology. This method is centered on analyzing speech signals to classify distinct emotional states. The primary challenge addressed by this research is to accurately perform multiclass classification of seven human psychoemotional states, namely joy, fear, anger, sadness, disgust, surprise, and a neutral state. Traditional methods have struggled to accurately distinguish these complex emotional nuances in speech. The study successfully developed a model capable of extracting informative features from audio recordings, specifically mel spectrograms and mel-frequency cepstral coefficients. These features were then used to train two deep convolutional neural networks, resulting in a classifier model. The uniqueness of this research lies in its use of a dual-feature approach and the employment of deep convolutional neural networks for classification. This approach has demonstrated high accuracy in emotion recognition, with an accuracy rate of 0.93 in the validation subset. The high accuracy and effectiveness of the model can be attributed to the comprehensive and synergistic use of mel spectrograms and mel-frequency cepstral coefficients, which provide a more nuanced analysis of emotional expressions in speech. The method presented in this research has broad applicability in various domains, including enhancing human-machine interface interactions, implementation in the aviation industry, healthcare, marketing, and other fields where understanding human emotions through speech is crucial

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

开发基于人工智能的高级人类心理情绪状态分析模型

研究重点是利用深度学习技术开发一种自动识别人类心理情绪状态（PES）的新方法。该方法以分析语音信号为中心，对不同的情绪状态进行分类。这项研究面临的主要挑战是如何准确地对七种人类心理情绪状态（即喜悦、恐惧、愤怒、悲伤、厌恶、惊讶和中性状态）进行多类分类。传统方法很难准确区分语音中这些复杂的细微情绪变化。这项研究成功开发了一个模型，能够从录音中提取信息特征，特别是旋律频谱图和旋律频率倒频谱系数。然后，这些特征被用于训练两个深度卷积神经网络，最终形成一个分类器模型。这项研究的独特之处在于使用了双特征方法和深度卷积神经网络进行分类。这种方法在情绪识别方面表现出了很高的准确性，在验证子集中的准确率达到了 0.93。该模型的高准确率和有效性可归功于综合、协同使用了梅尔频谱图和梅尔频率倒频谱系数，从而对语音中的情感表达进行了更细致入微的分析。本研究提出的方法可广泛应用于各个领域，包括增强人机界面交互、航空业、医疗保健、市场营销以及其他通过语音理解人类情绪至关重要的领域。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Eastern-European Journal of Enterprise Technologies Mathematics-Applied Mathematics

CiteScore

2.00

自引率

0.00%

发文量

369

审稿时长

6 weeks

期刊介绍： Terminology used in the title of the "East European Journal of Enterprise Technologies" - "enterprise technologies" should be read as "industrial technologies". "Eastern-European Journal of Enterprise Technologies" publishes all those best ideas from the science, which can be introduced in the industry. Since, obtaining the high-quality, competitive industrial products is based on introducing high technologies from various independent spheres of scientific researches, but united by a common end result - a finished high-technology product. Among these scientific spheres, there are engineering, power engineering and energy saving, technologies of inorganic and organic substances and materials science, information technologies and control systems. Publishing scientific papers in these directions are the main development "vectors" of the "Eastern-European Journal of Enterprise Technologies". Since, these are those directions of scientific researches, the results of which can be directly used in modern industrial production: space and aircraft industry, instrument-making industry, mechanical engineering, power engineering, chemical industry and metallurgy.