B.N.W.M.R.A. Boragolla, F. Farook, H. Herath, M. Ekanayake, G. Godaliyadda, J. Wijayakulasooriya
{"title":"Hierarchical method to classify emotions in speech signals","authors":"B.N.W.M.R.A. Boragolla, F. Farook, H. Herath, M. Ekanayake, G. Godaliyadda, J. Wijayakulasooriya","doi":"10.1109/ICIAFS.2016.7946527","DOIUrl":null,"url":null,"abstract":"Recently studies have been performed on spectral features such as Mel Frequency Cepstral Coefficients (MFCC) and Linear Predictor Cepstral Coefficients (LPCC) for speech emotion recognition. It was found in our study that the Fourier Transform of MFCC time trajectories also play an important role in speech emotion recognition. And also a new hierarchical classification method was proposed based on K Nearest Neighbour (KNN) and Fisher Linear Discriminant Analysis (FLDA). This reduces the computational cost drastically because a lesser number of feature set is used. Also this methodology helps in resolving the non-linearity of the clustering problem. The emotions anger, sadness, disgust, neutrality, boredom, fear and happiness were recognized with the accuracies of 95%, 100%, 85%, 86%, 77%, 53% and 80% on the standard German database(EMODB).","PeriodicalId":237290,"journal":{"name":"2016 IEEE International Conference on Information and Automation for Sustainability (ICIAfS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Information and Automation for Sustainability (ICIAfS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIAFS.2016.7946527","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Recently studies have been performed on spectral features such as Mel Frequency Cepstral Coefficients (MFCC) and Linear Predictor Cepstral Coefficients (LPCC) for speech emotion recognition. It was found in our study that the Fourier Transform of MFCC time trajectories also play an important role in speech emotion recognition. And also a new hierarchical classification method was proposed based on K Nearest Neighbour (KNN) and Fisher Linear Discriminant Analysis (FLDA). This reduces the computational cost drastically because a lesser number of feature set is used. Also this methodology helps in resolving the non-linearity of the clustering problem. The emotions anger, sadness, disgust, neutrality, boredom, fear and happiness were recognized with the accuracies of 95%, 100%, 85%, 86%, 77%, 53% and 80% on the standard German database(EMODB).