{"title":"Conclusions on analysis and synthesis of large semivocalic formantic transitions","authors":"D. Jitca, V. Apopei","doi":"10.1109/SCS.2003.1226977","DOIUrl":null,"url":null,"abstract":"A very important issue in formantic Text-to-Speech (TtS) systems refers to formants transitions between various successive sounds in the word context. While the small-scale transitions can be treated as linear transitions, the large ones must be carefully considered in order to model the equivalent natural transition. When glottal excitation is active during such transitions, specific sounds are produced that require explicit treatment. Our approach to modeling large-scale formant transitions in Klatt-type synthesizers refers to dividing the large transition period into segments of reduced slope (relative steady state) and large slope (transitions between steady states). The short constant frequency segments are created by inserting the corresponding allophones, described by the respective formantic set. The modeling process is exemplified by the case of semivowel i. As a starting point for the synthesis, we analyze words containing semivowel i as part of diphthongs preceded by unvoiced or voiced palatal consonants. The speech synthesis of the analyzed words is depicted in terms of Klatt synthesis parameters.","PeriodicalId":375963,"journal":{"name":"Signals, Circuits and Systems, 2003. SCS 2003. International Symposium on","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signals, Circuits and Systems, 2003. SCS 2003. International Symposium on","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCS.2003.1226977","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
A very important issue in formantic Text-to-Speech (TtS) systems refers to formants transitions between various successive sounds in the word context. While the small-scale transitions can be treated as linear transitions, the large ones must be carefully considered in order to model the equivalent natural transition. When glottal excitation is active during such transitions, specific sounds are produced that require explicit treatment. Our approach to modeling large-scale formant transitions in Klatt-type synthesizers refers to dividing the large transition period into segments of reduced slope (relative steady state) and large slope (transitions between steady states). The short constant frequency segments are created by inserting the corresponding allophones, described by the respective formantic set. The modeling process is exemplified by the case of semivowel i. As a starting point for the synthesis, we analyze words containing semivowel i as part of diphthongs preceded by unvoiced or voiced palatal consonants. The speech synthesis of the analyzed words is depicted in terms of Klatt synthesis parameters.