{"title":"Cross gender voice morphing using Canonical Correlation Analysis","authors":"I. Baseer, Rabeea Basir","doi":"10.1109/C-CODE.2017.7918947","DOIUrl":null,"url":null,"abstract":"Voice morphing one of the speech synthesis frameworks, in simplest term aim to transforms speaker's identity from source to target speaker while preserving the original content of message. This paper presents a novel spectral envelope mapping algorithm based on Canonical Correlation Analysis(CCA) that find the association between spectral envelope characteristics of source speaker and target speaker in terms of correlation as a similarity metric. Moreover, the speech also undergoes to prosodic modification using PSOLA as pitch frequency is also an important parameter for varying identity. This morphing algorithm is evaluated by taking the utterances from freely available CMU-ARCTIC speech dataset. The subjective experiment shows that the proposed method successfully transforms speaker identity and produced high-quality morphed signal.","PeriodicalId":344222,"journal":{"name":"2017 International Conference on Communication, Computing and Digital Systems (C-CODE)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Communication, Computing and Digital Systems (C-CODE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/C-CODE.2017.7918947","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Voice morphing one of the speech synthesis frameworks, in simplest term aim to transforms speaker's identity from source to target speaker while preserving the original content of message. This paper presents a novel spectral envelope mapping algorithm based on Canonical Correlation Analysis(CCA) that find the association between spectral envelope characteristics of source speaker and target speaker in terms of correlation as a similarity metric. Moreover, the speech also undergoes to prosodic modification using PSOLA as pitch frequency is also an important parameter for varying identity. This morphing algorithm is evaluated by taking the utterances from freely available CMU-ARCTIC speech dataset. The subjective experiment shows that the proposed method successfully transforms speaker identity and produced high-quality morphed signal.