大半音共振过渡的分析与合成结论

D. Jitca, V. Apopei
{"title":"大半音共振过渡的分析与合成结论","authors":"D. Jitca, V. Apopei","doi":"10.1109/SCS.2003.1226977","DOIUrl":null,"url":null,"abstract":"A very important issue in formantic Text-to-Speech (TtS) systems refers to formants transitions between various successive sounds in the word context. While the small-scale transitions can be treated as linear transitions, the large ones must be carefully considered in order to model the equivalent natural transition. When glottal excitation is active during such transitions, specific sounds are produced that require explicit treatment. Our approach to modeling large-scale formant transitions in Klatt-type synthesizers refers to dividing the large transition period into segments of reduced slope (relative steady state) and large slope (transitions between steady states). The short constant frequency segments are created by inserting the corresponding allophones, described by the respective formantic set. The modeling process is exemplified by the case of semivowel i. As a starting point for the synthesis, we analyze words containing semivowel i as part of diphthongs preceded by unvoiced or voiced palatal consonants. The speech synthesis of the analyzed words is depicted in terms of Klatt synthesis parameters.","PeriodicalId":375963,"journal":{"name":"Signals, Circuits and Systems, 2003. SCS 2003. International Symposium on","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Conclusions on analysis and synthesis of large semivocalic formantic transitions\",\"authors\":\"D. Jitca, V. Apopei\",\"doi\":\"10.1109/SCS.2003.1226977\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A very important issue in formantic Text-to-Speech (TtS) systems refers to formants transitions between various successive sounds in the word context. While the small-scale transitions can be treated as linear transitions, the large ones must be carefully considered in order to model the equivalent natural transition. When glottal excitation is active during such transitions, specific sounds are produced that require explicit treatment. Our approach to modeling large-scale formant transitions in Klatt-type synthesizers refers to dividing the large transition period into segments of reduced slope (relative steady state) and large slope (transitions between steady states). The short constant frequency segments are created by inserting the corresponding allophones, described by the respective formantic set. The modeling process is exemplified by the case of semivowel i. As a starting point for the synthesis, we analyze words containing semivowel i as part of diphthongs preceded by unvoiced or voiced palatal consonants. The speech synthesis of the analyzed words is depicted in terms of Klatt synthesis parameters.\",\"PeriodicalId\":375963,\"journal\":{\"name\":\"Signals, Circuits and Systems, 2003. SCS 2003. International Symposium on\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2003-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Signals, Circuits and Systems, 2003. SCS 2003. International Symposium on\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SCS.2003.1226977\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signals, Circuits and Systems, 2003. SCS 2003. International Symposium on","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCS.2003.1226977","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

在文本到语音的共振系统中,一个非常重要的问题是单词上下文中不同连续语音之间的共振转换。虽然小规模的转变可以被视为线性转变,但为了模拟等效的自然转变,必须仔细考虑大规模的转变。当声门兴奋在这种转换过程中活跃时,就会产生特定的声音,需要明确的处理。我们在klatt型合成器中模拟大规模形成峰转变的方法是将大转变周期划分为减小斜率(相对稳态)和大斜率(稳态之间的过渡)段。短的恒频段是通过插入相应的音素来创建的,由各自的共振集描述。建模过程以半元音i为例。作为合成的起点,我们分析包含半元音i的单词,将其作为双元音的一部分,前面有不发音或发音的腭辅音。用Klatt合成参数描述被分析词的语音合成。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Conclusions on analysis and synthesis of large semivocalic formantic transitions
A very important issue in formantic Text-to-Speech (TtS) systems refers to formants transitions between various successive sounds in the word context. While the small-scale transitions can be treated as linear transitions, the large ones must be carefully considered in order to model the equivalent natural transition. When glottal excitation is active during such transitions, specific sounds are produced that require explicit treatment. Our approach to modeling large-scale formant transitions in Klatt-type synthesizers refers to dividing the large transition period into segments of reduced slope (relative steady state) and large slope (transitions between steady states). The short constant frequency segments are created by inserting the corresponding allophones, described by the respective formantic set. The modeling process is exemplified by the case of semivowel i. As a starting point for the synthesis, we analyze words containing semivowel i as part of diphthongs preceded by unvoiced or voiced palatal consonants. The speech synthesis of the analyzed words is depicted in terms of Klatt synthesis parameters.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Genetic algorithm based dynamic channel assignment for cellular radio networks Voltage controlled integrators/differentiators using current feedback amplifier A low noise-high counting rate readout system for X-ray imaging applications Implementation of 3D-DCT based video encoder/decoder system Periodic chaotic spreading sequences with better correlation properties than conventional sequences - BER performances analysis
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1