SVM-MLP-PNN分类器在语音情感识别领域的比较研究

2010 Fifth International Conference on Digital Telecommunications Pub Date : 2010-06-13 DOI:10.1109/ICDT.2010.8

Theodoros Iliou, C. Anagnostopoulos

{"title":"SVM-MLP-PNN分类器在语音情感识别领域的比较研究","authors":"Theodoros Iliou, C. Anagnostopoulos","doi":"10.1109/ICDT.2010.8","DOIUrl":null,"url":null,"abstract":"In this paper, we present a comparative analysisof three classifiers for speech signal emotion recognition.Recognition was performed on emotional Berlin Database.This work focuses on speaker and utterance (phrase)dependent and independent framework. One hundred thirtythree (133) sound/speech features were extracted from Pitch,Mel Frequency Cepstral Coefficients, Energy and Formantsand were evaluated in order to create a feature set sufficient todiscriminate between seven emotions in acted speech. A set of26 features was selected by statistical method and MultilayerPercepton, Probabilistic Neural Networks and Support VectorMachine were used for the Emotion Classification at sevenclasses: anger, happiness, anxiety/fear, sadness, boredom,disgust and neutral. In speaker dependent framework,Probabilistic Neural Network classifier reached very highaccuracy of 94%, whereas in speaker independent framework,Support Vector Machine classification reached the bestaccuracy of 80%. The results of numerical experiments aregiven and discussed in the paper.","PeriodicalId":322589,"journal":{"name":"2010 Fifth International Conference on Digital Telecommunications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"24","resultStr":"{\"title\":\"SVM-MLP-PNN Classifiers on Speech Emotion Recognition Field - A Comparative Study\",\"authors\":\"Theodoros Iliou, C. Anagnostopoulos\",\"doi\":\"10.1109/ICDT.2010.8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we present a comparative analysisof three classifiers for speech signal emotion recognition.Recognition was performed on emotional Berlin Database.This work focuses on speaker and utterance (phrase)dependent and independent framework. One hundred thirtythree (133) sound/speech features were extracted from Pitch,Mel Frequency Cepstral Coefficients, Energy and Formantsand were evaluated in order to create a feature set sufficient todiscriminate between seven emotions in acted speech. A set of26 features was selected by statistical method and MultilayerPercepton, Probabilistic Neural Networks and Support VectorMachine were used for the Emotion Classification at sevenclasses: anger, happiness, anxiety/fear, sadness, boredom,disgust and neutral. In speaker dependent framework,Probabilistic Neural Network classifier reached very highaccuracy of 94%, whereas in speaker independent framework,Support Vector Machine classification reached the bestaccuracy of 80%. The results of numerical experiments aregiven and discussed in the paper.\",\"PeriodicalId\":322589,\"journal\":{\"name\":\"2010 Fifth International Conference on Digital Telecommunications\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-06-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"24\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 Fifth International Conference on Digital Telecommunications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDT.2010.8\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Fifth International Conference on Digital Telecommunications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDT.2010.8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 24

摘要

本文对语音信号情感识别中的三种分类器进行了比较分析。对情感柏林数据库进行识别。本研究的重点是说话人和话语(短语)依赖和独立的框架。从音高、Mel频率倒谱系数、能量和共振峰中提取133个声音/语音特征，并对其进行评估，以创建一个足以区分七种情绪的特征集。通过统计方法选取了26个特征，采用多层感知、概率神经网络和支持向量机对情绪进行了七种分类:愤怒、快乐、焦虑/恐惧、悲伤、无聊、厌恶和中性。在说话人依赖框架下，概率神经网络分类器的准确率达到了94%，而在说话人独立框架下，支持向量机分类器的准确率达到了80%。文中给出了数值实验结果并进行了讨论。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

SVM-MLP-PNN Classifiers on Speech Emotion Recognition Field - A Comparative Study

In this paper, we present a comparative analysisof three classifiers for speech signal emotion recognition.Recognition was performed on emotional Berlin Database.This work focuses on speaker and utterance (phrase)dependent and independent framework. One hundred thirtythree (133) sound/speech features were extracted from Pitch,Mel Frequency Cepstral Coefficients, Energy and Formantsand were evaluated in order to create a feature set sufficient todiscriminate between seven emotions in acted speech. A set of26 features was selected by statistical method and MultilayerPercepton, Probabilistic Neural Networks and Support VectorMachine were used for the Emotion Classification at sevenclasses: anger, happiness, anxiety/fear, sadness, boredom,disgust and neutral. In speaker dependent framework,Probabilistic Neural Network classifier reached very highaccuracy of 94%, whereas in speaker independent framework,Support Vector Machine classification reached the bestaccuracy of 80%. The results of numerical experiments aregiven and discussed in the paper.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2010 Fifth International Conference on Digital Telecommunications

自引率

0.00%

发文量