{"title":"Diphone编写的孟加拉语文本到语音的合成","authors":"M. Rashid, Md. Akter Hussain, M. S. Rahman","doi":"10.1109/ICCIT.2009.5407135","DOIUrl":null,"url":null,"abstract":"This paper presents methodologies involved in diphone preparation for Bangla text to speech synthesis. A concatenation based synthesis system comprises basically two modules- one is natural language processing and other is digital signal processing (DSP). Natural language processing implies converting text to its pronounceable text, called text normalization and the diphone selection method based on the normalized text is called Graphene to Phoneme (G2P) conversion. We developed a speech synthesizer for Bangla using diphone based concatenative approach. Diphone preparation, labeling and selection techniques are described in this paper.","PeriodicalId":443258,"journal":{"name":"2009 12th International Conference on Computers and Information Technology","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Diphone preparation for Bangla text to speech synthesis\",\"authors\":\"M. Rashid, Md. Akter Hussain, M. S. Rahman\",\"doi\":\"10.1109/ICCIT.2009.5407135\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents methodologies involved in diphone preparation for Bangla text to speech synthesis. A concatenation based synthesis system comprises basically two modules- one is natural language processing and other is digital signal processing (DSP). Natural language processing implies converting text to its pronounceable text, called text normalization and the diphone selection method based on the normalized text is called Graphene to Phoneme (G2P) conversion. We developed a speech synthesizer for Bangla using diphone based concatenative approach. Diphone preparation, labeling and selection techniques are described in this paper.\",\"PeriodicalId\":443258,\"journal\":{\"name\":\"2009 12th International Conference on Computers and Information Technology\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 12th International Conference on Computers and Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCIT.2009.5407135\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 12th International Conference on Computers and Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIT.2009.5407135","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Diphone preparation for Bangla text to speech synthesis
This paper presents methodologies involved in diphone preparation for Bangla text to speech synthesis. A concatenation based synthesis system comprises basically two modules- one is natural language processing and other is digital signal processing (DSP). Natural language processing implies converting text to its pronounceable text, called text normalization and the diphone selection method based on the normalized text is called Graphene to Phoneme (G2P) conversion. We developed a speech synthesizer for Bangla using diphone based concatenative approach. Diphone preparation, labeling and selection techniques are described in this paper.