SVM-MLP-PNN Classifiers on Speech Emotion Recognition Field - A Comparative Study

2010 Fifth International Conference on Digital Telecommunications Pub Date : 2010-06-13 DOI:10.1109/ICDT.2010.8

Theodoros Iliou, C. Anagnostopoulos

引用次数: 24

Abstract

In this paper, we present a comparative analysisof three classifiers for speech signal emotion recognition.Recognition was performed on emotional Berlin Database.This work focuses on speaker and utterance (phrase)dependent and independent framework. One hundred thirtythree (133) sound/speech features were extracted from Pitch,Mel Frequency Cepstral Coefficients, Energy and Formantsand were evaluated in order to create a feature set sufficient todiscriminate between seven emotions in acted speech. A set of26 features was selected by statistical method and MultilayerPercepton, Probabilistic Neural Networks and Support VectorMachine were used for the Emotion Classification at sevenclasses: anger, happiness, anxiety/fear, sadness, boredom,disgust and neutral. In speaker dependent framework,Probabilistic Neural Network classifier reached very highaccuracy of 94%, whereas in speaker independent framework,Support Vector Machine classification reached the bestaccuracy of 80%. The results of numerical experiments aregiven and discussed in the paper.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

SVM-MLP-PNN分类器在语音情感识别领域的比较研究

本文对语音信号情感识别中的三种分类器进行了比较分析。对情感柏林数据库进行识别。本研究的重点是说话人和话语(短语)依赖和独立的框架。从音高、Mel频率倒谱系数、能量和共振峰中提取133个声音/语音特征，并对其进行评估，以创建一个足以区分七种情绪的特征集。通过统计方法选取了26个特征，采用多层感知、概率神经网络和支持向量机对情绪进行了七种分类:愤怒、快乐、焦虑/恐惧、悲伤、无聊、厌恶和中性。在说话人依赖框架下，概率神经网络分类器的准确率达到了94%，而在说话人独立框架下，支持向量机分类器的准确率达到了80%。文中给出了数值实验结果并进行了讨论。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2010 Fifth International Conference on Digital Telecommunications

自引率

0.00%

发文量