用参数模型描述语调

5th International Conference on Spoken Language Processing (ICSLP 1998) Pub Date : 1998-11-30 DOI:10.21437/ICSLP.1998-59

G. Möhler

{"title":"用参数模型描述语调","authors":"G. Möhler","doi":"10.21437/ICSLP.1998-59","DOIUrl":null,"url":null,"abstract":"In this study a data-based approach to intonation modeling is presented. The model incorporates knowledge from intonation theories like the expected types of F 0 movements and syllable anchoring. The knowledge is integrated into the model using an appropriate approximation function for F 0 parametrization. The F 0 parameters that result from the parametrization are predicted from a set of features using neural nets. The quality of the generated contours is assessed by means of numerical measures and perception tests. They show that the basic hypotheses about intonation description and modeling are in principle correct and that they have the potential to be successfully applied to speech synthesis. We argue for a clear interface with a linguistic description (using pitch-accent and boundary labels as input) and discourse structure (using pitch-range normalized F 0 parameters), even though current text-to-speech systems usually still do not have the capability to predict most of the appropriate information.","PeriodicalId":117113,"journal":{"name":"5th International Conference on Spoken Language Processing (ICSLP 1998)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"25","resultStr":"{\"title\":\"Describing intonation with a parametric model\",\"authors\":\"G. Möhler\",\"doi\":\"10.21437/ICSLP.1998-59\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this study a data-based approach to intonation modeling is presented. The model incorporates knowledge from intonation theories like the expected types of F 0 movements and syllable anchoring. The knowledge is integrated into the model using an appropriate approximation function for F 0 parametrization. The F 0 parameters that result from the parametrization are predicted from a set of features using neural nets. The quality of the generated contours is assessed by means of numerical measures and perception tests. They show that the basic hypotheses about intonation description and modeling are in principle correct and that they have the potential to be successfully applied to speech synthesis. We argue for a clear interface with a linguistic description (using pitch-accent and boundary labels as input) and discourse structure (using pitch-range normalized F 0 parameters), even though current text-to-speech systems usually still do not have the capability to predict most of the appropriate information.\",\"PeriodicalId\":117113,\"journal\":{\"name\":\"5th International Conference on Spoken Language Processing (ICSLP 1998)\",\"volume\":\"45 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1998-11-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"25\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"5th International Conference on Spoken Language Processing (ICSLP 1998)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21437/ICSLP.1998-59\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"5th International Conference on Spoken Language Processing (ICSLP 1998)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21437/ICSLP.1998-59","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 25

摘要

本文提出了一种基于数据的语调建模方法。该模型结合了来自语调理论的知识，如f0动作的预期类型和音节锚定。知识是集成到模型使用适当的近似函数为f0参数化。由参数化产生的f0参数使用神经网络从一组特征中预测。通过数值测量和感知测试来评估生成轮廓的质量。结果表明，关于语调描述和建模的基本假设在原则上是正确的，并且具有成功应用于语音合成的潜力。我们主张使用语言描述(使用音调重音和边界标签作为输入)和话语结构(使用音调范围归一化f0参数)的清晰界面，即使当前的文本到语音系统通常仍然没有能力预测大多数适当的信息。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Describing intonation with a parametric model

In this study a data-based approach to intonation modeling is presented. The model incorporates knowledge from intonation theories like the expected types of F 0 movements and syllable anchoring. The knowledge is integrated into the model using an appropriate approximation function for F 0 parametrization. The F 0 parameters that result from the parametrization are predicted from a set of features using neural nets. The quality of the generated contours is assessed by means of numerical measures and perception tests. They show that the basic hypotheses about intonation description and modeling are in principle correct and that they have the potential to be successfully applied to speech synthesis. We argue for a clear interface with a linguistic description (using pitch-accent and boundary labels as input) and discourse structure (using pitch-range normalized F 0 parameters), even though current text-to-speech systems usually still do not have the capability to predict most of the appropriate information.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

5th International Conference on Spoken Language Processing (ICSLP 1998)

自引率

0.00%

发文量

期刊最新文献

Assimilation of place in Japanese and dutch Articulatory analysis using a codebook for articulatory based low bit-rate speech coding Phonetic and phonological characteristics of paralinguistic information in spoken Japanese HMM-based visual speech recognition using intensity and location normalization Speech recognition via phonetically featured syllables