The Information Structure–prosody interface in text-to-speech technologies. An empirical perspective

IF 1 2区 文学 0 LANGUAGE & LINGUISTICS Corpus Linguistics and Linguistic Theory Pub Date : 2021-01-28 DOI:10.1515/CLLT-2020-0008
Mónica Domínguez, M. Farrús, L. Wanner
{"title":"The Information Structure–prosody interface in text-to-speech technologies. An empirical perspective","authors":"Mónica Domínguez, M. Farrús, L. Wanner","doi":"10.1515/CLLT-2020-0008","DOIUrl":null,"url":null,"abstract":"Abstract The correspondence between the communicative intention of a speaker in terms of Information Structure and the way this speaker reflects communicative aspects by means of prosody have been a fruitful field of study in Linguistics. However, text-to-speech applications still lack the variability and richness found in human speech in terms of how humans display their communication skills. Some attempts were made in the past to model one aspect of Information Structure, namely thematicity for its application to intonation generation in text-to-speech technologies. Yet, these applications suffer from two limitations: (i) they draw upon a small number of made-up simple question-answer pairs rather than on real (spoken or written) corpus material; and (ii) they do not explore whether any other interpretation would better suit a wider range of textual genres beyond dialogs. In this paper, two different interpretations of thematicity in the field of speech technologies are examined: the state-of-art binary (and flat) theme-rheme, and the hierarchical thematicity defined by Igor Mel’čuk within the Meaning-Text Theory. The outcome of the experiments on a corpus of native speakers of US English suggests that the latter interpretation of thematicity has a versatile implementation potential for text-to-speech applications of the Information Structure–prosody interface.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":"18 1","pages":"419 - 445"},"PeriodicalIF":1.0000,"publicationDate":"2021-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/CLLT-2020-0008","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Corpus Linguistics and Linguistic Theory","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1515/CLLT-2020-0008","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 1

Abstract

Abstract The correspondence between the communicative intention of a speaker in terms of Information Structure and the way this speaker reflects communicative aspects by means of prosody have been a fruitful field of study in Linguistics. However, text-to-speech applications still lack the variability and richness found in human speech in terms of how humans display their communication skills. Some attempts were made in the past to model one aspect of Information Structure, namely thematicity for its application to intonation generation in text-to-speech technologies. Yet, these applications suffer from two limitations: (i) they draw upon a small number of made-up simple question-answer pairs rather than on real (spoken or written) corpus material; and (ii) they do not explore whether any other interpretation would better suit a wider range of textual genres beyond dialogs. In this paper, two different interpretations of thematicity in the field of speech technologies are examined: the state-of-art binary (and flat) theme-rheme, and the hierarchical thematicity defined by Igor Mel’čuk within the Meaning-Text Theory. The outcome of the experiments on a corpus of native speakers of US English suggests that the latter interpretation of thematicity has a versatile implementation potential for text-to-speech applications of the Information Structure–prosody interface.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
文本转语音技术中的信息结构-韵律接口。经验主义视角
摘要说话人在信息结构方面的交际意图与说话人通过韵律反映交际方面的方式之间的对应关系一直是语言学研究的一个富有成果的领域。然而,在人类如何展示其通信技能方面,文本到语音的应用仍然缺乏人类语音中的可变性和丰富性。过去曾尝试对信息结构的一个方面进行建模,即在文本到语音技术中应用于语调生成的主题性。然而,这些应用受到两个限制:(i)它们利用了少量虚构的简单问答对,而不是真实的(口语或书面的)语料库材料;以及(ii)他们没有探讨是否有任何其他解释更适合对话之外的更广泛的文本类型。本文考察了语音技术领域对主题性的两种不同解释:最先进的二元(和平面)主题修辞,以及Igor Mel’čuk在意义文本理论中定义的层次主题性。在以美国英语为母语的语料库上进行的实验结果表明,后一种主题性解释在信息结构-韵律界面的文本到语音应用中具有广泛的实现潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
4.20
自引率
12.50%
发文量
15
期刊介绍: Corpus Linguistics and Linguistic Theory (CLLT) is a peer-reviewed journal publishing high-quality original corpus-based research focusing on theoretically relevant issues in all core areas of linguistic research, or other recognized topic areas. It provides a forum for researchers from different theoretical backgrounds and different areas of interest that share a commitment to the systematic and exhaustive analysis of naturally occurring language. Contributions from all theoretical frameworks are welcome but they should be addressed at a general audience and thus be explicit about their assumptions and discovery procedures and provide sufficient theoretical background to be accessible to researchers from different frameworks. Topics Corpus Linguistics Quantitative Linguistics Phonology Morphology Semantics Syntax Pragmatics.
期刊最新文献
The red dress is cute: why subjective adjectives are more often predicative A corpus-based study on semantic and cognitive features of bei sentences in Mandarin Chinese Verb influence on French wh-placement: a parallel corpus study Idiosyncratic entrenchment: tracing change in constructional schematicity with nested random effects Transfer five ways: applications of multiple distinctive collexeme analysis to the dative alternation in Mandarin Chinese
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1