Control concepts for articulatory speech synthesis

Speech Synthesis Workshop Pub Date : 1900-01-01 DOI:10.22028/D291-23506

P. Birkholz, I. Steiner, S. Breuer

引用次数: 14

Abstract

We present two concepts for the generation of gestural scores to control an articulatory speech synthesizer. Gestural scores are the common input to the synthesizer and constitute an organized pattern of articulatory gestures. The first concept generates the gestures for an utterance using the phonetic transcriptions, phone durations, and intonation commands predicted by the Bonn Open Synthesis System (BOSS) from an arbitrary input text. This concept extends the synthesizer to a text-to-speech synthesis system. The idea of the second concept is to use timing information extracted from Electromagnetic Articulography signals to generate the articulatory gestures. Therefore, it is a concept for the re-synthesis of natural utterances. Finally, application prospects for the presented synthesizer are discussed.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

发音语音合成的控制概念

我们提出了两个概念，用于生成手势分数来控制发音语音合成器。手势分数是合成器的共同输入，构成了发音手势的有组织的模式。第一个概念使用波恩开放合成系统(BOSS)从任意输入文本中预测的语音转录、电话持续时间和语调命令来生成话语的手势。这个概念将合成器扩展为文本到语音的合成系统。第二个概念的思想是利用从电磁发音信号中提取的定时信息来生成发音手势。因此，它是一个对自然话语进行再合成的概念。最后，对该合成器的应用前景进行了展望。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Speech Synthesis Workshop

自引率

0.00%

发文量