{"title":"基于连续F0模型的跨流依赖建模用于基于hmm的语音合成","authors":"Xin Wang, Zhenhua Ling, Lirong Dai","doi":"10.1109/ISCSLP.2012.6423457","DOIUrl":null,"url":null,"abstract":"In our previous work, we have presented a cross-stream dependency modeling method for hidden Markov model (HMM) based parametric speech synthesis. In this method, multi-space probability distribution (MSD) was adopted for F0 modeling and the voicing decision error influenced the accuracy of generated spectral features severely. Therefore, a cross-stream dependency modeling method using continuous F0 HMM (CF-HMM) is proposed in this paper to circumvent voicing decision during the generation of spectral features. Besides, in order to prevent over-fitting problem in model training, regression class is introduced to tie the transform matrices in dependency models. Experiments on proposed methods show both improvement on the accuracy of the generated spectral features and effectiveness of introducing regression class in dependency model training.","PeriodicalId":186099,"journal":{"name":"2012 8th International Symposium on Chinese Spoken Language Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Cross-stream dependency modeling using continuous F0 model for HMM-based speech synthesis\",\"authors\":\"Xin Wang, Zhenhua Ling, Lirong Dai\",\"doi\":\"10.1109/ISCSLP.2012.6423457\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In our previous work, we have presented a cross-stream dependency modeling method for hidden Markov model (HMM) based parametric speech synthesis. In this method, multi-space probability distribution (MSD) was adopted for F0 modeling and the voicing decision error influenced the accuracy of generated spectral features severely. Therefore, a cross-stream dependency modeling method using continuous F0 HMM (CF-HMM) is proposed in this paper to circumvent voicing decision during the generation of spectral features. Besides, in order to prevent over-fitting problem in model training, regression class is introduced to tie the transform matrices in dependency models. Experiments on proposed methods show both improvement on the accuracy of the generated spectral features and effectiveness of introducing regression class in dependency model training.\",\"PeriodicalId\":186099,\"journal\":{\"name\":\"2012 8th International Symposium on Chinese Spoken Language Processing\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 8th International Symposium on Chinese Spoken Language Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCSLP.2012.6423457\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 8th International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCSLP.2012.6423457","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Cross-stream dependency modeling using continuous F0 model for HMM-based speech synthesis
In our previous work, we have presented a cross-stream dependency modeling method for hidden Markov model (HMM) based parametric speech synthesis. In this method, multi-space probability distribution (MSD) was adopted for F0 modeling and the voicing decision error influenced the accuracy of generated spectral features severely. Therefore, a cross-stream dependency modeling method using continuous F0 HMM (CF-HMM) is proposed in this paper to circumvent voicing decision during the generation of spectral features. Besides, in order to prevent over-fitting problem in model training, regression class is introduced to tie the transform matrices in dependency models. Experiments on proposed methods show both improvement on the accuracy of the generated spectral features and effectiveness of introducing regression class in dependency model training.