WordNet Based Sindhi Text to Speech Synthesis System

2010 Second International Conference on Computer Research and Development Pub Date : 2010-05-07 DOI:10.1109/ICCRD.2010.31

J. Mahar, G. Q. Memon, Syed Hyder Abbass Shah

{"title":"WordNet Based Sindhi Text to Speech Synthesis System","authors":"J. Mahar, G. Q. Memon, Syed Hyder Abbass Shah","doi":"10.1109/ICCRD.2010.31","DOIUrl":null,"url":null,"abstract":"The text-to-speech (TTS) synthesis technology enables machine to convert text into audible speech and used throughout the world to enhance the accessibility of the information. The important component of any TTS synthesis system is the database of sounds. In this study, three types of sound units i.e., phonemes, diphones and syllables are concatenated to produce natural sound for good quality Sindhi text to speech (STTS) system. The object of this paper consists in treating the phonemes, diphones and syllables under the aspect of the lexicon. The methodology used in STTS is to exploit acoustic representations of speech for synthesis, together with linguistic analyses of text. Sindhi is highly homographic language, the text is written without diacritics in real life applications, that creates lexical and morphological ambiguity. The problem of understating non-diacritic words can be solved using semantic knowledge. This paper describes a Sindhi TTS synthesis system that relies on a WordNet to identify the analogical relations between words in the text. The proposed approach is focused on the use of WordNet structures for the task of synthesis. The architecture and novel algorithm for STTS is proposed. The experiments using WordNet that show promising results and the accuracy of our proposed approach is acceptable.","PeriodicalId":158568,"journal":{"name":"2010 Second International Conference on Computer Research and Development","volume":"81 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Second International Conference on Computer Research and Development","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCRD.2010.31","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

The text-to-speech (TTS) synthesis technology enables machine to convert text into audible speech and used throughout the world to enhance the accessibility of the information. The important component of any TTS synthesis system is the database of sounds. In this study, three types of sound units i.e., phonemes, diphones and syllables are concatenated to produce natural sound for good quality Sindhi text to speech (STTS) system. The object of this paper consists in treating the phonemes, diphones and syllables under the aspect of the lexicon. The methodology used in STTS is to exploit acoustic representations of speech for synthesis, together with linguistic analyses of text. Sindhi is highly homographic language, the text is written without diacritics in real life applications, that creates lexical and morphological ambiguity. The problem of understating non-diacritic words can be solved using semantic knowledge. This paper describes a Sindhi TTS synthesis system that relies on a WordNet to identify the analogical relations between words in the text. The proposed approach is focused on the use of WordNet structures for the task of synthesis. The architecture and novel algorithm for STTS is proposed. The experiments using WordNet that show promising results and the accuracy of our proposed approach is acceptable.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于WordNet的信德语文本语音合成系统

文本到语音(TTS)合成技术使机器能够将文本转换为可听的语音，并在世界范围内使用，以提高信息的可及性。任何TTS合成系统的重要组成部分是声音数据库。在本研究中，三种类型的声音单位，即音素，双音和音节连接产生自然的声音，为优质的信德语文本到语音(STTS)系统。本文的目的在于从词汇的角度对音素、双音和音节进行分析。STTS中使用的方法是利用语音的声学表示进行合成，同时对文本进行语言分析。信德语是一种高度同源的语言，在现实生活中没有变音符，这就造成了词汇和形态上的歧义。非变音符词的理解问题可以用语义知识来解决。本文描述了一个基于WordNet的Sindhi TTS合成系统，该系统可以识别文本中单词之间的类比关系。所提出的方法侧重于使用WordNet结构来完成合成任务。提出了STTS的结构和新算法。使用WordNet进行的实验显示了令人满意的结果，我们提出的方法的准确性是可以接受的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2010 Second International Conference on Computer Research and Development

自引率

0.00%

发文量

期刊最新文献

Preparation and Characterization of NiCoFerrite Nanoparticles by Thermal Method Implementing the TLS Protocol on a Bare PC A Design of Grid Supported Services for Mobile Learning System Performance Comparison of Bow-Tie and Slot Antenna Based on RWG Edge Elements Simulation for RFID-Based Red Light Violation Detection: Violation Detection and Flow Prediction