Towards a Prosodic Model for Synthesized Speech of Mathematical Expressions in MathML

Adriana Souza, D. Freitas
{"title":"Towards a Prosodic Model for Synthesized Speech of Mathematical Expressions in MathML","authors":"Adriana Souza, D. Freitas","doi":"10.1145/3439231.3440617","DOIUrl":null,"url":null,"abstract":"The use of the MathML language made possible to improve the accessibility of mathematics for blind or low-vision persons in digital media. Synthetic speech technologies have advanced significantly using MathML, however, the speech synthesizers' standard reading style is still not suitable for mathematics. Making mathematical reading of the speech synthesizers more natural and expressive is still a challenge. The creation of models to produce the appropriate prosody in the synthesized speech of math content is therefore necessary, as shown in previous research. This article presents a proposal for a model to improve prosody in the synthesized speech of mathematical expressions based on MathML. A corpus of mathematical expressions spoken by Mathematics teachers was created to support the model's development. The Fujisaki intonation model was adopted for intonation control, accent and phrase commands have been extracted from the corpus, and some adjustments have been made to manipulate prosodic parameters in the speech of mathematical expression in correlation with the MathML tree; additionally, a pattern of pauses control is being created.","PeriodicalId":210400,"journal":{"name":"Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3439231.3440617","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The use of the MathML language made possible to improve the accessibility of mathematics for blind or low-vision persons in digital media. Synthetic speech technologies have advanced significantly using MathML, however, the speech synthesizers' standard reading style is still not suitable for mathematics. Making mathematical reading of the speech synthesizers more natural and expressive is still a challenge. The creation of models to produce the appropriate prosody in the synthesized speech of math content is therefore necessary, as shown in previous research. This article presents a proposal for a model to improve prosody in the synthesized speech of mathematical expressions based on MathML. A corpus of mathematical expressions spoken by Mathematics teachers was created to support the model's development. The Fujisaki intonation model was adopted for intonation control, accent and phrase commands have been extracted from the corpus, and some adjustments have been made to manipulate prosodic parameters in the speech of mathematical expression in correlation with the MathML tree; additionally, a pattern of pauses control is being created.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
MathML中数学表达式合成语音的韵律模型
MathML语言的使用使盲人或弱视人士在数字媒体上更容易获得数学知识成为可能。使用MathML,合成语音技术有了很大的进步,然而,语音合成器的标准阅读风格仍然不适合数学。使语音合成器的数学阅读更加自然和富有表现力仍然是一个挑战。因此,如以往的研究所示,在数学内容的合成语音中创建产生适当韵律的模型是必要的。本文提出了一种基于MathML的数学表达式合成语音韵律改善模型。建立了数学教师的数学表达语料库,以支持模型的发展。采用Fujisaki语调模型进行语调控制,从语料库中提取重音和短语命令,并根据MathML树对数学表达式语音中的韵律参数进行调整;此外,还创建了一个暂停控制模式。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Can children of typical development benefit from inclusion intervention with Daisy Robot - a socially assistive robot? Pedagogical Triangulations: from the online forum to the e-magazine: a praxiological experience about school and its actor during COVID19 confinement CovidSense: A Smartphone-based Initiative for Fighting COVID-19 Spreading Apple Siri (input) + Voice Over (output) = a de facto marriage: An exploratory case study with blind people Is Simple English Wikipedia As Simple And Easy-to-Understand As We Expect It To Be?
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1