{"title":"利用马来语自然语音进行情绪识别","authors":"Afiq Aiman Ahmad Fairuz Rizal, N. Hashim","doi":"10.1109/ICSIPA52582.2021.9576788","DOIUrl":null,"url":null,"abstract":"In recent years, the technology of emotion speech recognition has gradually become more important to the industries. This is proven by the integration of this system into many applications such as the interface with robots, audio surveillance, web-based E-learning, commercial applications, clinical studies and so on. Generally, speech emotion recognition (SER) is developed to help humans to understand and retrieve desired emotions. In this research, the analysis of using Bahasa Malaysia Language for three basic emotions of happy, sad and angry was analyzed. A total of 30 male and 30 female audio recordings were collected. Mel-frequency cepstral coefficient, chroma and mel spectrogram features were extracted. Feature dimensions were reduced using forward, backward and exhaustive selection methods before classification. Classification was performed using K-nearest neighbors, Support Vector Machine and Random Forest. The analysis demonstrated 78% accuracy for male speech and 78% for female speech.","PeriodicalId":326688,"journal":{"name":"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"41 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Emotion Recognition Using Bahasa Malaysia Natural Speech\",\"authors\":\"Afiq Aiman Ahmad Fairuz Rizal, N. Hashim\",\"doi\":\"10.1109/ICSIPA52582.2021.9576788\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, the technology of emotion speech recognition has gradually become more important to the industries. This is proven by the integration of this system into many applications such as the interface with robots, audio surveillance, web-based E-learning, commercial applications, clinical studies and so on. Generally, speech emotion recognition (SER) is developed to help humans to understand and retrieve desired emotions. In this research, the analysis of using Bahasa Malaysia Language for three basic emotions of happy, sad and angry was analyzed. A total of 30 male and 30 female audio recordings were collected. Mel-frequency cepstral coefficient, chroma and mel spectrogram features were extracted. Feature dimensions were reduced using forward, backward and exhaustive selection methods before classification. Classification was performed using K-nearest neighbors, Support Vector Machine and Random Forest. The analysis demonstrated 78% accuracy for male speech and 78% for female speech.\",\"PeriodicalId\":326688,\"journal\":{\"name\":\"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)\",\"volume\":\"41 4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSIPA52582.2021.9576788\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSIPA52582.2021.9576788","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Emotion Recognition Using Bahasa Malaysia Natural Speech
In recent years, the technology of emotion speech recognition has gradually become more important to the industries. This is proven by the integration of this system into many applications such as the interface with robots, audio surveillance, web-based E-learning, commercial applications, clinical studies and so on. Generally, speech emotion recognition (SER) is developed to help humans to understand and retrieve desired emotions. In this research, the analysis of using Bahasa Malaysia Language for three basic emotions of happy, sad and angry was analyzed. A total of 30 male and 30 female audio recordings were collected. Mel-frequency cepstral coefficient, chroma and mel spectrogram features were extracted. Feature dimensions were reduced using forward, backward and exhaustive selection methods before classification. Classification was performed using K-nearest neighbors, Support Vector Machine and Random Forest. The analysis demonstrated 78% accuracy for male speech and 78% for female speech.