{"title":"语音编码对情绪感知影响的主观评价","authors":"Felix Labelle, R. Lefebvre, P. Gournay","doi":"10.1109/ISPACS.2016.7824685","DOIUrl":null,"url":null,"abstract":"The accuracy of the reproduction of emotions by speech coders has only recently been identified as a relevant issue. Several published studies have shown that speech compression reduces the accuracy of emotions classification. These studies, however, were all conducted using objective evaluation methods that involve an automatic classifier. The only definitive way to prove or disprove that the emotional content of a speech signal is degraded by compression operations is by testing it with human subjects. This paper proposes a subjective evaluation method and applies it to emotional speech coded by the AMR-WB speech coder at 6.6 and 12.65 kbps. The results confirm that there is a significant degradation in the perception of emotions by human listeners at both bit rates. The proposed evaluation method, and the insight provided by the results, could be useful in developing new speech coders that better preserve the emotional content of speech signals.","PeriodicalId":131543,"journal":{"name":"2016 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A subjective evaluation of the effects of speech coding on the perception of emotions\",\"authors\":\"Felix Labelle, R. Lefebvre, P. Gournay\",\"doi\":\"10.1109/ISPACS.2016.7824685\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The accuracy of the reproduction of emotions by speech coders has only recently been identified as a relevant issue. Several published studies have shown that speech compression reduces the accuracy of emotions classification. These studies, however, were all conducted using objective evaluation methods that involve an automatic classifier. The only definitive way to prove or disprove that the emotional content of a speech signal is degraded by compression operations is by testing it with human subjects. This paper proposes a subjective evaluation method and applies it to emotional speech coded by the AMR-WB speech coder at 6.6 and 12.65 kbps. The results confirm that there is a significant degradation in the perception of emotions by human listeners at both bit rates. The proposed evaluation method, and the insight provided by the results, could be useful in developing new speech coders that better preserve the emotional content of speech signals.\",\"PeriodicalId\":131543,\"journal\":{\"name\":\"2016 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)\",\"volume\":\"59 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISPACS.2016.7824685\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPACS.2016.7824685","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A subjective evaluation of the effects of speech coding on the perception of emotions
The accuracy of the reproduction of emotions by speech coders has only recently been identified as a relevant issue. Several published studies have shown that speech compression reduces the accuracy of emotions classification. These studies, however, were all conducted using objective evaluation methods that involve an automatic classifier. The only definitive way to prove or disprove that the emotional content of a speech signal is degraded by compression operations is by testing it with human subjects. This paper proposes a subjective evaluation method and applies it to emotional speech coded by the AMR-WB speech coder at 6.6 and 12.65 kbps. The results confirm that there is a significant degradation in the perception of emotions by human listeners at both bit rates. The proposed evaluation method, and the insight provided by the results, could be useful in developing new speech coders that better preserve the emotional content of speech signals.