{"title":"基于声学和词汇特征的印尼语语音情感识别","authors":"Pipin Kurniawati, D. Lestari, M. L. Khodra","doi":"10.1109/ICSDA.2017.8384467","DOIUrl":null,"url":null,"abstract":"This paper describes our works to extend the previous work on emotion recognition for Indonesian spoken language. In this research, we construct an Indonesian emotional corpus (IDEC). In constructing the corpus, we aim at natural emotional occurrences from television talk shows. IDEC is utilized to construct the emotion recognizer using two main features, acoustic and lexical features. The Support Vector Machine (SVM), Random Forest (RF), and Multinomial Naive Bayes (MNB) algorithms are employed to model the emotions. Experiment result shows that SVM outperforms the RF and MNB algorithms. It achieves an average F- measure of 0.713 for 6 emotion classes by combining both acoustic and lexical features.","PeriodicalId":255147,"journal":{"name":"2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Speech emotion recognition from Indonesian spoken language using acoustic and lexical features\",\"authors\":\"Pipin Kurniawati, D. Lestari, M. L. Khodra\",\"doi\":\"10.1109/ICSDA.2017.8384467\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes our works to extend the previous work on emotion recognition for Indonesian spoken language. In this research, we construct an Indonesian emotional corpus (IDEC). In constructing the corpus, we aim at natural emotional occurrences from television talk shows. IDEC is utilized to construct the emotion recognizer using two main features, acoustic and lexical features. The Support Vector Machine (SVM), Random Forest (RF), and Multinomial Naive Bayes (MNB) algorithms are employed to model the emotions. Experiment result shows that SVM outperforms the RF and MNB algorithms. It achieves an average F- measure of 0.713 for 6 emotion classes by combining both acoustic and lexical features.\",\"PeriodicalId\":255147,\"journal\":{\"name\":\"2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA)\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSDA.2017.8384467\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSDA.2017.8384467","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Speech emotion recognition from Indonesian spoken language using acoustic and lexical features
This paper describes our works to extend the previous work on emotion recognition for Indonesian spoken language. In this research, we construct an Indonesian emotional corpus (IDEC). In constructing the corpus, we aim at natural emotional occurrences from television talk shows. IDEC is utilized to construct the emotion recognizer using two main features, acoustic and lexical features. The Support Vector Machine (SVM), Random Forest (RF), and Multinomial Naive Bayes (MNB) algorithms are employed to model the emotions. Experiment result shows that SVM outperforms the RF and MNB algorithms. It achieves an average F- measure of 0.713 for 6 emotion classes by combining both acoustic and lexical features.