Yongsen Tao, Kunxia Wang, Jing Yang, Ning An, Lian Li
{"title":"和谐搜索在语音情感识别中的特征选择","authors":"Yongsen Tao, Kunxia Wang, Jing Yang, Ning An, Lian Li","doi":"10.1109/ACII.2015.7344596","DOIUrl":null,"url":null,"abstract":"Feature selection is a significant aspect of speech emotion recognition system. How to select a small subset out of the thousands of speech data is important for accurate classification of speech emotion. In this paper we investigate heuristic algorithm Harmony search (HS) for feature selection. We extract 3 feature sets, including MFCC, Fourier Parameters (FP), and features extracted with The Munich open Speech and Music Interpretation by Large Space Extraction (openSMILE) toolkit, from Berlin German emotion database (EMODB) and Chinese Elderly emotion database (EESDB). And combine MFCC with FP as the fourth feature set. We use Harmony search to select subsets and decrease the dimension space, and employ 10-fold cross validation in LIBSVM to evaluate the change of accuracy between selected subsets and original sets. Experimental results show that each subset's size reduced by about 50%, however, there is no sharp degeneration on accuracy and the accuracy almost maintains the original ones.","PeriodicalId":6863,"journal":{"name":"2015 International Conference on Affective Computing and Intelligent Interaction (ACII)","volume":"42 1","pages":"362-367"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Harmony search for feature selection in speech emotion recognition\",\"authors\":\"Yongsen Tao, Kunxia Wang, Jing Yang, Ning An, Lian Li\",\"doi\":\"10.1109/ACII.2015.7344596\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Feature selection is a significant aspect of speech emotion recognition system. How to select a small subset out of the thousands of speech data is important for accurate classification of speech emotion. In this paper we investigate heuristic algorithm Harmony search (HS) for feature selection. We extract 3 feature sets, including MFCC, Fourier Parameters (FP), and features extracted with The Munich open Speech and Music Interpretation by Large Space Extraction (openSMILE) toolkit, from Berlin German emotion database (EMODB) and Chinese Elderly emotion database (EESDB). And combine MFCC with FP as the fourth feature set. We use Harmony search to select subsets and decrease the dimension space, and employ 10-fold cross validation in LIBSVM to evaluate the change of accuracy between selected subsets and original sets. Experimental results show that each subset's size reduced by about 50%, however, there is no sharp degeneration on accuracy and the accuracy almost maintains the original ones.\",\"PeriodicalId\":6863,\"journal\":{\"name\":\"2015 International Conference on Affective Computing and Intelligent Interaction (ACII)\",\"volume\":\"42 1\",\"pages\":\"362-367\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-09-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Conference on Affective Computing and Intelligent Interaction (ACII)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACII.2015.7344596\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Affective Computing and Intelligent Interaction (ACII)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACII.2015.7344596","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Harmony search for feature selection in speech emotion recognition
Feature selection is a significant aspect of speech emotion recognition system. How to select a small subset out of the thousands of speech data is important for accurate classification of speech emotion. In this paper we investigate heuristic algorithm Harmony search (HS) for feature selection. We extract 3 feature sets, including MFCC, Fourier Parameters (FP), and features extracted with The Munich open Speech and Music Interpretation by Large Space Extraction (openSMILE) toolkit, from Berlin German emotion database (EMODB) and Chinese Elderly emotion database (EESDB). And combine MFCC with FP as the fourth feature set. We use Harmony search to select subsets and decrease the dimension space, and employ 10-fold cross validation in LIBSVM to evaluate the change of accuracy between selected subsets and original sets. Experimental results show that each subset's size reduced by about 50%, however, there is no sharp degeneration on accuracy and the accuracy almost maintains the original ones.