Adding dimensional features for emotion recognition on speech

2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) Pub Date : 2020-09-01 DOI:10.1109/ATSIP49331.2020.9231766

Leila Ben Letaifa, M. I. Torres, R. Justo

引用次数: 7

Abstract

Developing accurate emotion recognition systems requires extracting suitable features of these emotions. In this paper, we propose an original approach of parameters extraction based on the strong, theoretical and empirical, correlation between the emotion categories and the dimensional emotions parameters. More precisely, acoustic features and dimensional emotion parameters are combined for better speech emotion characterisation. The procedure consists in developing arousal and valence models by regression on the training data and estimating, by classification, their values in the test data. Hence, when classifying an unknown sample into emotion categories, these estimations could be integrated into the feature vectors. It is noted that the results using this new set of parameters show a significant improvement of the speech emotion recognition performance.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

为语音情感识别添加维度特征

开发准确的情绪识别系统需要提取这些情绪的合适特征。在本文中，我们提出了一种基于情感类别与维度情感参数之间强烈的理论和经验相关性的参数提取方法。更准确地说，声学特征和维度情感参数相结合，以更好地表征语音情感。该过程包括通过对训练数据的回归和通过分类估计它们在测试数据中的值来开发唤醒和效价模型。因此，当将未知样本分类到情感类别时，这些估计可以集成到特征向量中。结果表明，使用该参数集的结果显著提高了语音情感识别的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)

自引率

0.00%

发文量

期刊最新文献

Automatic Recognition of Epileptiform EEG Abnormalities Using Machine Learning Approaches Generation of fuzzy evidence numbers for the evaluation of uncertainty measures Speckle Denoising of the Multipolarization Images by Hybrid Filters Identification of the user by using a hardware device Lightweight Hardware Architectures for the Piccolo Block Cipher in FPGA