{"title":"基于堆叠泛化集成神经网络的机器人宠物语音情感识别研究","authors":"Yongming Huang, Guobao Zhang, Xiaoli Xu","doi":"10.1109/CCPR.2009.5344020","DOIUrl":null,"url":null,"abstract":"In this paper, we present an emotion recognition system using the stacked generalization ensemble neural network for special human affective state in the speech signal. 450 short emotional sentences with different contents from 3 speakers were collected as experiment materials. The features relevant with energy, speech rate, pitch and formant are extracted from speech signals. Stacked Generalization Ensemble Neural Networks are used as the classifier for 5 emotions including anger, calmness, happiness, sadness and boredom. First, compared with the traditional BP network or wavelet neural network, the results of experiments show that the Stacked Generalization Ensemble Neural Network has faster convergence speed and higher recognition rate. Second, after discussing the advantage and disadvantage between different ensemble Neural Networks, suitable decision will be made for Robot Pet.","PeriodicalId":354468,"journal":{"name":"2009 Chinese Conference on Pattern Recognition","volume":"120 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Speech Emotion Recognition Research Based on the Stacked Generalization Ensemble Neural Network for Robot Pet\",\"authors\":\"Yongming Huang, Guobao Zhang, Xiaoli Xu\",\"doi\":\"10.1109/CCPR.2009.5344020\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we present an emotion recognition system using the stacked generalization ensemble neural network for special human affective state in the speech signal. 450 short emotional sentences with different contents from 3 speakers were collected as experiment materials. The features relevant with energy, speech rate, pitch and formant are extracted from speech signals. Stacked Generalization Ensemble Neural Networks are used as the classifier for 5 emotions including anger, calmness, happiness, sadness and boredom. First, compared with the traditional BP network or wavelet neural network, the results of experiments show that the Stacked Generalization Ensemble Neural Network has faster convergence speed and higher recognition rate. Second, after discussing the advantage and disadvantage between different ensemble Neural Networks, suitable decision will be made for Robot Pet.\",\"PeriodicalId\":354468,\"journal\":{\"name\":\"2009 Chinese Conference on Pattern Recognition\",\"volume\":\"120 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-12-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 Chinese Conference on Pattern Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCPR.2009.5344020\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Chinese Conference on Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCPR.2009.5344020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Speech Emotion Recognition Research Based on the Stacked Generalization Ensemble Neural Network for Robot Pet
In this paper, we present an emotion recognition system using the stacked generalization ensemble neural network for special human affective state in the speech signal. 450 short emotional sentences with different contents from 3 speakers were collected as experiment materials. The features relevant with energy, speech rate, pitch and formant are extracted from speech signals. Stacked Generalization Ensemble Neural Networks are used as the classifier for 5 emotions including anger, calmness, happiness, sadness and boredom. First, compared with the traditional BP network or wavelet neural network, the results of experiments show that the Stacked Generalization Ensemble Neural Network has faster convergence speed and higher recognition rate. Second, after discussing the advantage and disadvantage between different ensemble Neural Networks, suitable decision will be made for Robot Pet.