{"title":"基于TEO非线性特征的汉语语音情感分类","authors":"Gao Hui, Chen Shanguang, Su Guangchuan","doi":"10.1109/SNPD.2007.487","DOIUrl":null,"url":null,"abstract":"To study effective speech features which can represent different emotion styles in mandarin speech, nonlinear features based on Teager Energy Operator(TEO) are researched. Neutral state and 3 emotional states (i.e. happiness, anger and sadness) are classified from the mandarin speech database. MFCC extraction and HMM-based emotion recognition are used as baseline system to evaluate the emotional classification performance of TEO-based features. In comparison with MFCC, while text- dependent, improvements of classification capacity are obtained when using all 4 nonlinear features (i.e. NFD_Mel, AF_Mel, DAF_Mel, AM_SBCC). While text-independent, the performance of emotion classification are improved by using NFD_Mel, AF_Mel and DAF_Mel, but deteriorated by using AM_SBCC. The results of classification demonstrate that the nonlinear features based on TEO, when using NFD_Mel, AF_Mel and DAF_Mel, are better able to represent different emotion styles in speech than that of MFCC.","PeriodicalId":197058,"journal":{"name":"Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2007)","volume":"293 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Emotion Classification of Mandarin Speech Based on TEO Nonlinear Features\",\"authors\":\"Gao Hui, Chen Shanguang, Su Guangchuan\",\"doi\":\"10.1109/SNPD.2007.487\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To study effective speech features which can represent different emotion styles in mandarin speech, nonlinear features based on Teager Energy Operator(TEO) are researched. Neutral state and 3 emotional states (i.e. happiness, anger and sadness) are classified from the mandarin speech database. MFCC extraction and HMM-based emotion recognition are used as baseline system to evaluate the emotional classification performance of TEO-based features. In comparison with MFCC, while text- dependent, improvements of classification capacity are obtained when using all 4 nonlinear features (i.e. NFD_Mel, AF_Mel, DAF_Mel, AM_SBCC). While text-independent, the performance of emotion classification are improved by using NFD_Mel, AF_Mel and DAF_Mel, but deteriorated by using AM_SBCC. The results of classification demonstrate that the nonlinear features based on TEO, when using NFD_Mel, AF_Mel and DAF_Mel, are better able to represent different emotion styles in speech than that of MFCC.\",\"PeriodicalId\":197058,\"journal\":{\"name\":\"Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2007)\",\"volume\":\"293 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-07-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2007)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SNPD.2007.487\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SNPD.2007.487","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Emotion Classification of Mandarin Speech Based on TEO Nonlinear Features
To study effective speech features which can represent different emotion styles in mandarin speech, nonlinear features based on Teager Energy Operator(TEO) are researched. Neutral state and 3 emotional states (i.e. happiness, anger and sadness) are classified from the mandarin speech database. MFCC extraction and HMM-based emotion recognition are used as baseline system to evaluate the emotional classification performance of TEO-based features. In comparison with MFCC, while text- dependent, improvements of classification capacity are obtained when using all 4 nonlinear features (i.e. NFD_Mel, AF_Mel, DAF_Mel, AM_SBCC). While text-independent, the performance of emotion classification are improved by using NFD_Mel, AF_Mel and DAF_Mel, but deteriorated by using AM_SBCC. The results of classification demonstrate that the nonlinear features based on TEO, when using NFD_Mel, AF_Mel and DAF_Mel, are better able to represent different emotion styles in speech than that of MFCC.