Chung-Hsien Wu, C. Hsia, Jiun-Fu Chen, Te-Hsien Liu
{"title":"Variable-length unit selection using LSA-based syntactic structure cost","authors":"Chung-Hsien Wu, C. Hsia, Jiun-Fu Chen, Te-Hsien Liu","doi":"10.1109/CHINSL.2004.1409621","DOIUrl":null,"url":null,"abstract":"The paper introduces a variable-length unit selection method for concatenative speech synthesis based on a syntactic structure based on latent semantic analysis (LSA). First, a probabilistic context free grammar (PCFG) based parser is used to construct the syntactic structure of the input text sentence. Second, the synthesizer selects the candidate units for each node of the syntactic structure. LSA is then adopted to estimate the syntactic cost between the target unit and the candidate units in the database. Finally, the concatenation of units with minimum cost is selected using a dynamic programming algorithm. Experimental results show that variable-length unit selection based on syntactic structure outperforms the synthesizer that does not consider syntactic structure. Also, the LSA-based syntactic cost provides a better estimation of substitution cost than that calculated only from acoustic features.","PeriodicalId":212562,"journal":{"name":"2004 International Symposium on Chinese Spoken Language Processing","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2004 International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CHINSL.2004.1409621","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The paper introduces a variable-length unit selection method for concatenative speech synthesis based on a syntactic structure based on latent semantic analysis (LSA). First, a probabilistic context free grammar (PCFG) based parser is used to construct the syntactic structure of the input text sentence. Second, the synthesizer selects the candidate units for each node of the syntactic structure. LSA is then adopted to estimate the syntactic cost between the target unit and the candidate units in the database. Finally, the concatenation of units with minimum cost is selected using a dynamic programming algorithm. Experimental results show that variable-length unit selection based on syntactic structure outperforms the synthesizer that does not consider syntactic structure. Also, the LSA-based syntactic cost provides a better estimation of substitution cost than that calculated only from acoustic features.