印度文本到语音系统:一个简短的调查

2022 International Conference on Connected Systems & Intelligence (CSI) Pub Date : 2022-08-31 DOI:10.1109/CSI54720.2022.9924085

Jayashree Nair, Akhila Krishnan, Vrinda S

{"title":"印度文本到语音系统:一个简短的调查","authors":"Jayashree Nair, Akhila Krishnan, Vrinda S","doi":"10.1109/CSI54720.2022.9924085","DOIUrl":null,"url":null,"abstract":"Speech and spoken words have always played a key role in everyday life. Speech synthesis is a means of artificially synthesizing speech, whereas text-to-speech (TTS) is a technology that converts written text in a human language into an analogous spoken waveform [speech form].The written form is represented by the text, a sequence of characters, whereas the verbal form is represented by the speech. TTS synthesizers are computer-based systems that read text out loud. The TTS system is divided into two phases: text processing and speech creation. Despite the availability of several TTS systems in various languages, Indian languages continue to lag behind in terms of producing high-quality speech. Acceptability and intelligibility are used to rate the quality of speech. The main objective of this paper is to perform a study on available text-to-speech technologies in Indian languages.","PeriodicalId":221137,"journal":{"name":"2022 International Conference on Connected Systems & Intelligence (CSI)","volume":"2009 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Indian Text to Speech Systems: A Short Survey\",\"authors\":\"Jayashree Nair, Akhila Krishnan, Vrinda S\",\"doi\":\"10.1109/CSI54720.2022.9924085\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speech and spoken words have always played a key role in everyday life. Speech synthesis is a means of artificially synthesizing speech, whereas text-to-speech (TTS) is a technology that converts written text in a human language into an analogous spoken waveform [speech form].The written form is represented by the text, a sequence of characters, whereas the verbal form is represented by the speech. TTS synthesizers are computer-based systems that read text out loud. The TTS system is divided into two phases: text processing and speech creation. Despite the availability of several TTS systems in various languages, Indian languages continue to lag behind in terms of producing high-quality speech. Acceptability and intelligibility are used to rate the quality of speech. The main objective of this paper is to perform a study on available text-to-speech technologies in Indian languages.\",\"PeriodicalId\":221137,\"journal\":{\"name\":\"2022 International Conference on Connected Systems & Intelligence (CSI)\",\"volume\":\"2009 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Connected Systems & Intelligence (CSI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSI54720.2022.9924085\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Connected Systems & Intelligence (CSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSI54720.2022.9924085","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

演讲和口语在日常生活中一直扮演着关键的角色。语音合成是一种人工合成语音的手段，而文本到语音(TTS)是一种将人类语言中的书面文本转换为类似的口头波形[语音形式]的技术。书面形式由文本(一串字符)表示，而口头形式由言语表示。TTS合成器是基于计算机的系统，可以大声朗读文本。TTS系统分为两个阶段:文本处理和语音生成。尽管有几种不同语言的TTS系统，但印度语言在产生高质量语音方面仍然落后。可接受性和可理解性是用来评价语音质量的。本文的主要目的是对印度语言中可用的文本到语音技术进行研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Indian Text to Speech Systems: A Short Survey

Speech and spoken words have always played a key role in everyday life. Speech synthesis is a means of artificially synthesizing speech, whereas text-to-speech (TTS) is a technology that converts written text in a human language into an analogous spoken waveform [speech form].The written form is represented by the text, a sequence of characters, whereas the verbal form is represented by the speech. TTS synthesizers are computer-based systems that read text out loud. The TTS system is divided into two phases: text processing and speech creation. Despite the availability of several TTS systems in various languages, Indian languages continue to lag behind in terms of producing high-quality speech. Acceptability and intelligibility are used to rate the quality of speech. The main objective of this paper is to perform a study on available text-to-speech technologies in Indian languages.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 International Conference on Connected Systems & Intelligence (CSI)

自引率

0.00%

发文量