基于声学和词汇特征的印尼语语音情感识别

2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA) Pub Date : 2017-11-01 DOI:10.1109/ICSDA.2017.8384467

Pipin Kurniawati, D. Lestari, M. L. Khodra

{"title":"基于声学和词汇特征的印尼语语音情感识别","authors":"Pipin Kurniawati, D. Lestari, M. L. Khodra","doi":"10.1109/ICSDA.2017.8384467","DOIUrl":null,"url":null,"abstract":"This paper describes our works to extend the previous work on emotion recognition for Indonesian spoken language. In this research, we construct an Indonesian emotional corpus (IDEC). In constructing the corpus, we aim at natural emotional occurrences from television talk shows. IDEC is utilized to construct the emotion recognizer using two main features, acoustic and lexical features. The Support Vector Machine (SVM), Random Forest (RF), and Multinomial Naive Bayes (MNB) algorithms are employed to model the emotions. Experiment result shows that SVM outperforms the RF and MNB algorithms. It achieves an average F- measure of 0.713 for 6 emotion classes by combining both acoustic and lexical features.","PeriodicalId":255147,"journal":{"name":"2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Speech emotion recognition from Indonesian spoken language using acoustic and lexical features\",\"authors\":\"Pipin Kurniawati, D. Lestari, M. L. Khodra\",\"doi\":\"10.1109/ICSDA.2017.8384467\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes our works to extend the previous work on emotion recognition for Indonesian spoken language. In this research, we construct an Indonesian emotional corpus (IDEC). In constructing the corpus, we aim at natural emotional occurrences from television talk shows. IDEC is utilized to construct the emotion recognizer using two main features, acoustic and lexical features. The Support Vector Machine (SVM), Random Forest (RF), and Multinomial Naive Bayes (MNB) algorithms are employed to model the emotions. Experiment result shows that SVM outperforms the RF and MNB algorithms. It achieves an average F- measure of 0.713 for 6 emotion classes by combining both acoustic and lexical features.\",\"PeriodicalId\":255147,\"journal\":{\"name\":\"2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA)\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSDA.2017.8384467\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSDA.2017.8384467","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

本文描述了我们在印尼语口语情感识别方面的工作。在本研究中，我们建构了一个印尼语情感语料库(IDEC)。在构建语料库时，我们以电视谈话节目中的自然情感事件为目标。利用IDEC技术，利用声学和词汇两个主要特征来构建情感识别器。采用支持向量机(SVM)、随机森林(RF)和多项朴素贝叶斯(MNB)算法对情绪进行建模。实验结果表明，SVM算法优于RF算法和MNB算法。结合声学特征和词汇特征，对6个情感类别的平均F-测量值为0.713。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Speech emotion recognition from Indonesian spoken language using acoustic and lexical features

This paper describes our works to extend the previous work on emotion recognition for Indonesian spoken language. In this research, we construct an Indonesian emotional corpus (IDEC). In constructing the corpus, we aim at natural emotional occurrences from television talk shows. IDEC is utilized to construct the emotion recognizer using two main features, acoustic and lexical features. The Support Vector Machine (SVM), Random Forest (RF), and Multinomial Naive Bayes (MNB) algorithms are employed to model the emotions. Experiment result shows that SVM outperforms the RF and MNB algorithms. It achieves an average F- measure of 0.713 for 6 emotion classes by combining both acoustic and lexical features.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA)

自引率

0.00%

发文量