{"title":"Prosody and style controls in CU VOCAL using SSML and SAPI XML tags","authors":"T. Fung, Yuk-Chi Li, H. Meng, P. Ching","doi":"10.1109/CHINSL.2004.1409623","DOIUrl":null,"url":null,"abstract":"CU VOCAL is a Cantonese text-to-speech (TTS) engine. We use a syllable-based concatenative synthesis approach to generate intelligible and natural synthetic speech in Cantonese. The paper reports on our recent enhancements in CU VOCAL to support user adjustments in prosody and style with the use of the Speech Synthesis Markup Language (SSML) in the input text. CU VOCAL was previously developed as a SAPI-compliant engine to enable easy integration with other applications. The paper also reports on our enhancements in the CU VOCAL SAPI (speech API) engine to support the SAPI 5 XML tags.","PeriodicalId":212562,"journal":{"name":"2004 International Symposium on Chinese Spoken Language Processing","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2004 International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CHINSL.2004.1409623","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
CU VOCAL is a Cantonese text-to-speech (TTS) engine. We use a syllable-based concatenative synthesis approach to generate intelligible and natural synthetic speech in Cantonese. The paper reports on our recent enhancements in CU VOCAL to support user adjustments in prosody and style with the use of the Speech Synthesis Markup Language (SSML) in the input text. CU VOCAL was previously developed as a SAPI-compliant engine to enable easy integration with other applications. The paper also reports on our enhancements in the CU VOCAL SAPI (speech API) engine to support the SAPI 5 XML tags.