大半音共振过渡的分析与合成结论

Signals, Circuits and Systems, 2003. SCS 2003. International Symposium on Pub Date : 2003-07-10 DOI:10.1109/SCS.2003.1226977

D. Jitca, V. Apopei

{"title":"大半音共振过渡的分析与合成结论","authors":"D. Jitca, V. Apopei","doi":"10.1109/SCS.2003.1226977","DOIUrl":null,"url":null,"abstract":"A very important issue in formantic Text-to-Speech (TtS) systems refers to formants transitions between various successive sounds in the word context. While the small-scale transitions can be treated as linear transitions, the large ones must be carefully considered in order to model the equivalent natural transition. When glottal excitation is active during such transitions, specific sounds are produced that require explicit treatment. Our approach to modeling large-scale formant transitions in Klatt-type synthesizers refers to dividing the large transition period into segments of reduced slope (relative steady state) and large slope (transitions between steady states). The short constant frequency segments are created by inserting the corresponding allophones, described by the respective formantic set. The modeling process is exemplified by the case of semivowel i. As a starting point for the synthesis, we analyze words containing semivowel i as part of diphthongs preceded by unvoiced or voiced palatal consonants. The speech synthesis of the analyzed words is depicted in terms of Klatt synthesis parameters.","PeriodicalId":375963,"journal":{"name":"Signals, Circuits and Systems, 2003. SCS 2003. International Symposium on","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Conclusions on analysis and synthesis of large semivocalic formantic transitions\",\"authors\":\"D. Jitca, V. Apopei\",\"doi\":\"10.1109/SCS.2003.1226977\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A very important issue in formantic Text-to-Speech (TtS) systems refers to formants transitions between various successive sounds in the word context. While the small-scale transitions can be treated as linear transitions, the large ones must be carefully considered in order to model the equivalent natural transition. When glottal excitation is active during such transitions, specific sounds are produced that require explicit treatment. Our approach to modeling large-scale formant transitions in Klatt-type synthesizers refers to dividing the large transition period into segments of reduced slope (relative steady state) and large slope (transitions between steady states). The short constant frequency segments are created by inserting the corresponding allophones, described by the respective formantic set. The modeling process is exemplified by the case of semivowel i. As a starting point for the synthesis, we analyze words containing semivowel i as part of diphthongs preceded by unvoiced or voiced palatal consonants. The speech synthesis of the analyzed words is depicted in terms of Klatt synthesis parameters.\",\"PeriodicalId\":375963,\"journal\":{\"name\":\"Signals, Circuits and Systems, 2003. SCS 2003. International Symposium on\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2003-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Signals, Circuits and Systems, 2003. SCS 2003. International Symposium on\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SCS.2003.1226977\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signals, Circuits and Systems, 2003. SCS 2003. International Symposium on","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCS.2003.1226977","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

在文本到语音的共振系统中，一个非常重要的问题是单词上下文中不同连续语音之间的共振转换。虽然小规模的转变可以被视为线性转变，但为了模拟等效的自然转变，必须仔细考虑大规模的转变。当声门兴奋在这种转换过程中活跃时，就会产生特定的声音，需要明确的处理。我们在klatt型合成器中模拟大规模形成峰转变的方法是将大转变周期划分为减小斜率(相对稳态)和大斜率(稳态之间的过渡)段。短的恒频段是通过插入相应的音素来创建的，由各自的共振集描述。建模过程以半元音i为例。作为合成的起点，我们分析包含半元音i的单词，将其作为双元音的一部分，前面有不发音或发音的腭辅音。用Klatt合成参数描述被分析词的语音合成。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Conclusions on analysis and synthesis of large semivocalic formantic transitions

A very important issue in formantic Text-to-Speech (TtS) systems refers to formants transitions between various successive sounds in the word context. While the small-scale transitions can be treated as linear transitions, the large ones must be carefully considered in order to model the equivalent natural transition. When glottal excitation is active during such transitions, specific sounds are produced that require explicit treatment. Our approach to modeling large-scale formant transitions in Klatt-type synthesizers refers to dividing the large transition period into segments of reduced slope (relative steady state) and large slope (transitions between steady states). The short constant frequency segments are created by inserting the corresponding allophones, described by the respective formantic set. The modeling process is exemplified by the case of semivowel i. As a starting point for the synthesis, we analyze words containing semivowel i as part of diphthongs preceded by unvoiced or voiced palatal consonants. The speech synthesis of the analyzed words is depicted in terms of Klatt synthesis parameters.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Signals, Circuits and Systems, 2003. SCS 2003. International Symposium on

自引率

0.00%

发文量