{"title":"基于语音的情感识别","authors":"Preeti Chawaj, S. R. Khot","doi":"10.23883/ijrter.2019.5069.se0h6","DOIUrl":null,"url":null,"abstract":"This paper presents a method to identify the emotion of an audio segment with an intention to recognize human emotional/mental status. Four features namely energy, pitch, Formants, Mel frequency cepstral coefficients (MFCC) and their derivatives are used to recognize emotions such as fear, anger, happiness and sadness. PCA is used to reduce the feature dimensionality. Support vector machine is implemented to perform the emotional state classification. The overall recognition rate obtained is 84.99% using samples of Berlin emotional speech database. Keywords—MFCC, Formants, Pitch, Energy, PCA, Support Vector Machine (SVM)","PeriodicalId":143099,"journal":{"name":"INTERNATIONAL JOURNAL OF RECENT TRENDS IN ENGINEERING & RESEARCH","volume":"110 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Speech Based Emotion Recognition\",\"authors\":\"Preeti Chawaj, S. R. Khot\",\"doi\":\"10.23883/ijrter.2019.5069.se0h6\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a method to identify the emotion of an audio segment with an intention to recognize human emotional/mental status. Four features namely energy, pitch, Formants, Mel frequency cepstral coefficients (MFCC) and their derivatives are used to recognize emotions such as fear, anger, happiness and sadness. PCA is used to reduce the feature dimensionality. Support vector machine is implemented to perform the emotional state classification. The overall recognition rate obtained is 84.99% using samples of Berlin emotional speech database. Keywords—MFCC, Formants, Pitch, Energy, PCA, Support Vector Machine (SVM)\",\"PeriodicalId\":143099,\"journal\":{\"name\":\"INTERNATIONAL JOURNAL OF RECENT TRENDS IN ENGINEERING & RESEARCH\",\"volume\":\"110 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"INTERNATIONAL JOURNAL OF RECENT TRENDS IN ENGINEERING & RESEARCH\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23883/ijrter.2019.5069.se0h6\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"INTERNATIONAL JOURNAL OF RECENT TRENDS IN ENGINEERING & RESEARCH","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23883/ijrter.2019.5069.se0h6","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
This paper presents a method to identify the emotion of an audio segment with an intention to recognize human emotional/mental status. Four features namely energy, pitch, Formants, Mel frequency cepstral coefficients (MFCC) and their derivatives are used to recognize emotions such as fear, anger, happiness and sadness. PCA is used to reduce the feature dimensionality. Support vector machine is implemented to perform the emotional state classification. The overall recognition rate obtained is 84.99% using samples of Berlin emotional speech database. Keywords—MFCC, Formants, Pitch, Energy, PCA, Support Vector Machine (SVM)