{"title":"Analysis and modification of spectral energy for neutral to sad emotion conversion","authors":"Arijul Haque, K. S. Rao","doi":"10.1109/IC3.2015.7346690","DOIUrl":null,"url":null,"abstract":"This work explores the spectral energies of neutral, sad and angry speech, and analyzes the potential of spectral energy modification to convert neutral speech to sad/angry speech. A method of modifying the spectral energy of neutral speech signals based on a filter bank implementation is proposed for the purpose of converting a given neutral speech to a target emotional speech. Since pitch plays a vital role in emotion expression, we modify the pitch contour first by using the method of Gaussian normalization. This is followed by modification of spectral energy using a method proposed in this paper. The expressiveness of the resultant speech is compared with speech obtained by modifying only the pitch contour, and we have observed improvements in expressiveness due to incorporation of proposed spectral energy modification. The method is found to be quite good for neutral to sad conversion. However, the quality of conversion to anger is not good, and the reasons behind this are analyzed.","PeriodicalId":217950,"journal":{"name":"2015 Eighth International Conference on Contemporary Computing (IC3)","volume":"148 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Eighth International Conference on Contemporary Computing (IC3)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC3.2015.7346690","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
This work explores the spectral energies of neutral, sad and angry speech, and analyzes the potential of spectral energy modification to convert neutral speech to sad/angry speech. A method of modifying the spectral energy of neutral speech signals based on a filter bank implementation is proposed for the purpose of converting a given neutral speech to a target emotional speech. Since pitch plays a vital role in emotion expression, we modify the pitch contour first by using the method of Gaussian normalization. This is followed by modification of spectral energy using a method proposed in this paper. The expressiveness of the resultant speech is compared with speech obtained by modifying only the pitch contour, and we have observed improvements in expressiveness due to incorporation of proposed spectral energy modification. The method is found to be quite good for neutral to sad conversion. However, the quality of conversion to anger is not good, and the reasons behind this are analyzed.