{"title":"Speech based emotion classification","authors":"T. Nwe, Foo Say Wei, Liyanage, De Silva","doi":"10.1109/TENCON.2001.949600","DOIUrl":null,"url":null,"abstract":"In this paper, a speech based emotion classification method is presented. Six basic human emotions including anger, dislike, fear, happiness, sadness and surprise are investigated. The recognizer presented in this paper is based on the discrete hidden Markov model and a novel feature vector based on mel frequency short time speech power coefficients is proposed. A universal codebook is constructed based on emotions under observation for each experiment. The databases consist of 90 emotional utterances each from two speakers. Several experiments including ungrouped emotion classification and grouped emotion classification are conducted. For the ungrouped emotion classification, an average accuracy of 72.22% and 60% are obtained respectively for utterances of the two speakers. For grouped emotion classification, higher accuracy of 94.44% and 70% are achieved.","PeriodicalId":358168,"journal":{"name":"Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology. TENCON 2001 (Cat. No.01CH37239)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"94","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology. TENCON 2001 (Cat. No.01CH37239)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TENCON.2001.949600","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 94
Abstract
In this paper, a speech based emotion classification method is presented. Six basic human emotions including anger, dislike, fear, happiness, sadness and surprise are investigated. The recognizer presented in this paper is based on the discrete hidden Markov model and a novel feature vector based on mel frequency short time speech power coefficients is proposed. A universal codebook is constructed based on emotions under observation for each experiment. The databases consist of 90 emotional utterances each from two speakers. Several experiments including ungrouped emotion classification and grouped emotion classification are conducted. For the ungrouped emotion classification, an average accuracy of 72.22% and 60% are obtained respectively for utterances of the two speakers. For grouped emotion classification, higher accuracy of 94.44% and 70% are achieved.