A hybrid neural network/rule based system for bilingual text-to-phoneme mapping

E. B. Bilcu, J. Astola, J. Saarinen
{"title":"A hybrid neural network/rule based system for bilingual text-to-phoneme mapping","authors":"E. B. Bilcu, J. Astola, J. Saarinen","doi":"10.1109/MLSP.2004.1422992","DOIUrl":null,"url":null,"abstract":"Text-to-phoneme (TTP) mapping is a preliminary step in text-to-speech synthesis and it affects the naturalness and understandability of synthetic speech. In this paper, we propose a hybrid neural network/rule based system for bilingual text-to-phoneme mapping. Our system uses three neural networks and a simple rule to perform the phoneme transcription. The first network is trained to convert the letters from the first language into their corresponding phonemes, the second one is used to obtain the phonemes for the second language whereas the third neural network together with a simple rule is responsible of the language recognition. The proposed approach can be easily extended for multilingual applications when more neural networks are introduced. Simulations performed on a bilingual dictionary (English+French) show the improvements in terms of phoneme accuracy of our method against the approach that uses a single neural network for multilingual TTP","PeriodicalId":70952,"journal":{"name":"信号处理","volume":"109 1","pages":"345-354"},"PeriodicalIF":0.0000,"publicationDate":"2004-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"信号处理","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.1109/MLSP.2004.1422992","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

Text-to-phoneme (TTP) mapping is a preliminary step in text-to-speech synthesis and it affects the naturalness and understandability of synthetic speech. In this paper, we propose a hybrid neural network/rule based system for bilingual text-to-phoneme mapping. Our system uses three neural networks and a simple rule to perform the phoneme transcription. The first network is trained to convert the letters from the first language into their corresponding phonemes, the second one is used to obtain the phonemes for the second language whereas the third neural network together with a simple rule is responsible of the language recognition. The proposed approach can be easily extended for multilingual applications when more neural networks are introduced. Simulations performed on a bilingual dictionary (English+French) show the improvements in terms of phoneme accuracy of our method against the approach that uses a single neural network for multilingual TTP
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于混合神经网络/规则的双语文本到音素映射系统
文本-音素映射是文本-语音合成的第一步,它直接影响到合成语音的自然度和可理解性。本文提出了一种基于神经网络/规则的双语文本-音素映射混合系统。我们的系统使用三个神经网络和一个简单的规则来执行音素转录。第一个神经网络用于将第一语言中的字母转换为对应的音素,第二个神经网络用于获取第二语言的音素,第三个神经网络与一个简单的规则一起负责语言识别。当引入更多的神经网络时,该方法可以很容易地扩展到多语言应用中。在双语词典(英语+法语)上进行的模拟表明,与使用单一神经网络进行多语言TTP的方法相比,我们的方法在音素准确性方面有所提高
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
5812
期刊介绍: Journal of Signal Processing is an academic journal supervised by China Association for Science and Technology and sponsored by China Institute of Electronics. The journal is an academic journal that reflects the latest research results and technological progress in the field of signal processing and related disciplines. It covers academic papers and review articles on new theories, new ideas, and new technologies in the field of signal processing. The journal aims to provide a platform for academic exchanges for scientific researchers and engineering and technical personnel engaged in basic research and applied research in signal processing, thereby promoting the development of information science and technology. At present, the journal has been included in the three major domestic core journal databases "China Science Citation Database (CSCD), China Science and Technology Core Journals (CSTPCD), Chinese Core Journals Overview" and Coaj. It is also included in many foreign databases such as Scopus, CSA, EBSCO host, INSPEC, JST, etc.
期刊最新文献
A 4/sup N/-QAM adaptive decision device to mitigate I/Q imbalance and impairments caused by time-varying flat fading channels GMM and kernel-based speaker recognition with the ISIP toolkit Approximate leave-one-out error estimation for learning with smooth, strictly convex margin loss functions Speech enhancement by lateral inhibition and binaural masking A hybrid neural network/rule based system for bilingual text-to-phoneme mapping
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1