{"title":"混响在语音情感识别中的作用","authors":"Shujie Zhao, Yan Yang, Jingdong Chen","doi":"10.1109/ICSEE.2018.8645989","DOIUrl":null,"url":null,"abstract":"In room environment, echo, reverberation, interference and additive noise cast the major challenges for emotional speech recognition due to degradation in quality and reliability of recorded speech signals. In this paper, we investigate effects of reverberation and noise on speech-based emotion recognition by comparing clean speech signal, adding simulated reverberant data, de-reverberant data and signal with added noise. First, we develop an emotional speech corpus of these four kinds of emotional speech data sources. Then we apply GMM-UBM framework to evaluate the performance of emotion recognition based on them. Results show that reverberation reduces emotion recognition accuracy by 5.87%, and a process of de-reverberation can largely cover this reduction.","PeriodicalId":254455,"journal":{"name":"2018 IEEE International Conference on the Science of Electrical Engineering in Israel (ICSEE)","volume":"91 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Effect of Reverberation in Speech-based Emotion Recognition\",\"authors\":\"Shujie Zhao, Yan Yang, Jingdong Chen\",\"doi\":\"10.1109/ICSEE.2018.8645989\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In room environment, echo, reverberation, interference and additive noise cast the major challenges for emotional speech recognition due to degradation in quality and reliability of recorded speech signals. In this paper, we investigate effects of reverberation and noise on speech-based emotion recognition by comparing clean speech signal, adding simulated reverberant data, de-reverberant data and signal with added noise. First, we develop an emotional speech corpus of these four kinds of emotional speech data sources. Then we apply GMM-UBM framework to evaluate the performance of emotion recognition based on them. Results show that reverberation reduces emotion recognition accuracy by 5.87%, and a process of de-reverberation can largely cover this reduction.\",\"PeriodicalId\":254455,\"journal\":{\"name\":\"2018 IEEE International Conference on the Science of Electrical Engineering in Israel (ICSEE)\",\"volume\":\"91 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE International Conference on the Science of Electrical Engineering in Israel (ICSEE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSEE.2018.8645989\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on the Science of Electrical Engineering in Israel (ICSEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSEE.2018.8645989","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Effect of Reverberation in Speech-based Emotion Recognition
In room environment, echo, reverberation, interference and additive noise cast the major challenges for emotional speech recognition due to degradation in quality and reliability of recorded speech signals. In this paper, we investigate effects of reverberation and noise on speech-based emotion recognition by comparing clean speech signal, adding simulated reverberant data, de-reverberant data and signal with added noise. First, we develop an emotional speech corpus of these four kinds of emotional speech data sources. Then we apply GMM-UBM framework to evaluate the performance of emotion recognition based on them. Results show that reverberation reduces emotion recognition accuracy by 5.87%, and a process of de-reverberation can largely cover this reduction.