Sarah Flora Samson Juan, V. Edwin, Chai Yeen Cheong, Jun Choi Lee, A. Yeo
{"title":"马来语音节结构在伊班语和比达尤语音节语音合成器中的应用","authors":"Sarah Flora Samson Juan, V. Edwin, Chai Yeen Cheong, Jun Choi Lee, A. Yeo","doi":"10.1109/IALP.2011.21","DOIUrl":null,"url":null,"abstract":"Sarawak, Malaysia, has many under-resourced languages, which stands to become extinct if measures are not taken to preserve and maintain them. These languages are mostly spoken by the indigenous groups and not all of the languages are documented or studied. As an initiative to preserve, a Text to Speech (TTS) system has been built for Iban and Bidayuh languages, two out of 44 living languages in Sarawak. To expedite the development, we employed knowledge of closely-related language, i.e. Malay, which is the first language in Malaysia. In this paper, we employed a syllabification algorithm based on Malay syllable structure to build the Iban and Bidayuh syllable list and speech corpus. An accuracy test for the algorithm was conducted to determine the quality of the output from the TTS system using Categorical Estimation (CE). Test showed high percentage in accuracy and quality has a mean score of 3.07 out of 5, suggesting the approach works.","PeriodicalId":297167,"journal":{"name":"2011 International Conference on Asian Language Processing","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Adopting Malay Syllable Structure for Syllable Based Speech Synthesizer for Iban and Bidayuh Languages\",\"authors\":\"Sarah Flora Samson Juan, V. Edwin, Chai Yeen Cheong, Jun Choi Lee, A. Yeo\",\"doi\":\"10.1109/IALP.2011.21\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sarawak, Malaysia, has many under-resourced languages, which stands to become extinct if measures are not taken to preserve and maintain them. These languages are mostly spoken by the indigenous groups and not all of the languages are documented or studied. As an initiative to preserve, a Text to Speech (TTS) system has been built for Iban and Bidayuh languages, two out of 44 living languages in Sarawak. To expedite the development, we employed knowledge of closely-related language, i.e. Malay, which is the first language in Malaysia. In this paper, we employed a syllabification algorithm based on Malay syllable structure to build the Iban and Bidayuh syllable list and speech corpus. An accuracy test for the algorithm was conducted to determine the quality of the output from the TTS system using Categorical Estimation (CE). Test showed high percentage in accuracy and quality has a mean score of 3.07 out of 5, suggesting the approach works.\",\"PeriodicalId\":297167,\"journal\":{\"name\":\"2011 International Conference on Asian Language Processing\",\"volume\":\"62 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-11-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 International Conference on Asian Language Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IALP.2011.21\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 International Conference on Asian Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IALP.2011.21","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Adopting Malay Syllable Structure for Syllable Based Speech Synthesizer for Iban and Bidayuh Languages
Sarawak, Malaysia, has many under-resourced languages, which stands to become extinct if measures are not taken to preserve and maintain them. These languages are mostly spoken by the indigenous groups and not all of the languages are documented or studied. As an initiative to preserve, a Text to Speech (TTS) system has been built for Iban and Bidayuh languages, two out of 44 living languages in Sarawak. To expedite the development, we employed knowledge of closely-related language, i.e. Malay, which is the first language in Malaysia. In this paper, we employed a syllabification algorithm based on Malay syllable structure to build the Iban and Bidayuh syllable list and speech corpus. An accuracy test for the algorithm was conducted to determine the quality of the output from the TTS system using Categorical Estimation (CE). Test showed high percentage in accuracy and quality has a mean score of 3.07 out of 5, suggesting the approach works.