Accent level adjustment in bilingual Thai-English text-to-speech synthesis

2011 IEEE Workshop on Automatic Speech Recognition & Understanding Pub Date : 2011-12-01 DOI:10.1109/ASRU.2011.6163947

C. Wutiwiwatchai, A. Thangthai, A. Chotimongkol, C. Hansakunbuntheung, N. Thatphithakkul

引用次数: 9

Abstract

This paper introduces an accent level adjustment mechanism for Thai-English text-to-speech synthesis (TTS). English words often appearing in modern Thai writing can be speech synthesized by either Thai TTS using corresponding Thai phones or by separated English TTS using English phones. As many Thai native listeners may not prefer any of such extreme accent styles, a mechanism that allows selecting accent level preference is proposed. In HMM-based TTS, adjusting the accent level is done by interpolating HMMs of purely Thai and purely English sounds. Solutions for cross-language phone alignment and HMM state mapping are addressed. Evaluations are performed by a listening test on sounds synthesized with varied accent levels. Experimental results show that the proposed method is acceptable by the majority of human listeners.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

泰英双语文本-语音合成中的口音水平调整

本文介绍了一种用于泰英文本语音合成(TTS)的重音水平调整机制。现代泰语写作中经常出现的英语单词可以由泰语TTS使用相应的泰语电话合成，也可以由分开的英语TTS使用英语电话合成。由于许多泰语母语听众可能不喜欢这种极端的口音风格，因此提出了一种允许选择口音级别偏好的机制。在基于hmm的TTS中，通过插入纯泰语和纯英语语音的hmm来调整口音级别。解决了跨语言电话对齐和HMM状态映射。评估是通过对不同口音水平合成的声音进行听力测试来完成的。实验结果表明，所提出的方法被大多数人类听众所接受。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2011 IEEE Workshop on Automatic Speech Recognition & Understanding

自引率

0.00%

发文量

期刊最新文献

Applying feature bagging for more accurate and robust automated speaking assessment Towards choosing better primes for spoken dialog systems Accent level adjustment in bilingual Thai-English text-to-speech synthesis Fast speaker diarization using a high-level scripting language Evaluating prosodic features for automated scoring of non-native read speech