{"title":"使用灵活语境和范畴矩阵的上下文依赖的字素-音素评价语料库","authors":"C. Hansakunbuntheung, Sumonmas Thatphithakkul","doi":"10.1109/ICSDA.2015.7357884","DOIUrl":null,"url":null,"abstract":"Context-dependent pronunciation, e.g. homographs, is a difficult grapheme-to-phoneme conversion (G2P) issue. It causes accuracy downgrade in speech synthesis and speech recognition. However, the context-dependent pronunciation issue is rarely considered in collecting pronunciation corpus for evaluating accuracy of G2P. Thus, this paper proposes a context-dependent pronunciation corpus using grapheme-phoneme pairs with their context information for G2P assessment. The context information includes 1) Categorial Matrix for representing orthographic types and usage domains of orthographic groups (OG). Categorial Matrix is designed to investigate problem categories in the G2P. 2) regular-expression-based flexible context for representing context variation. 3) OG Classes for representing interchangeable OGs in the flexible context. The flexible context and the word classes are designed to remove redundant contexts while covering context variation with minimal sets of patterns. By using the proposed corpus, automatic context generation for G2P evaluation can be implemented.","PeriodicalId":290790,"journal":{"name":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Context-dependent grapheme-to-phoneme evaluation corpus using flexible contexts and Categorial Matrix\",\"authors\":\"C. Hansakunbuntheung, Sumonmas Thatphithakkul\",\"doi\":\"10.1109/ICSDA.2015.7357884\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Context-dependent pronunciation, e.g. homographs, is a difficult grapheme-to-phoneme conversion (G2P) issue. It causes accuracy downgrade in speech synthesis and speech recognition. However, the context-dependent pronunciation issue is rarely considered in collecting pronunciation corpus for evaluating accuracy of G2P. Thus, this paper proposes a context-dependent pronunciation corpus using grapheme-phoneme pairs with their context information for G2P assessment. The context information includes 1) Categorial Matrix for representing orthographic types and usage domains of orthographic groups (OG). Categorial Matrix is designed to investigate problem categories in the G2P. 2) regular-expression-based flexible context for representing context variation. 3) OG Classes for representing interchangeable OGs in the flexible context. The flexible context and the word classes are designed to remove redundant contexts while covering context variation with minimal sets of patterns. By using the proposed corpus, automatic context generation for G2P evaluation can be implemented.\",\"PeriodicalId\":290790,\"journal\":{\"name\":\"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSDA.2015.7357884\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSDA.2015.7357884","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Context-dependent grapheme-to-phoneme evaluation corpus using flexible contexts and Categorial Matrix
Context-dependent pronunciation, e.g. homographs, is a difficult grapheme-to-phoneme conversion (G2P) issue. It causes accuracy downgrade in speech synthesis and speech recognition. However, the context-dependent pronunciation issue is rarely considered in collecting pronunciation corpus for evaluating accuracy of G2P. Thus, this paper proposes a context-dependent pronunciation corpus using grapheme-phoneme pairs with their context information for G2P assessment. The context information includes 1) Categorial Matrix for representing orthographic types and usage domains of orthographic groups (OG). Categorial Matrix is designed to investigate problem categories in the G2P. 2) regular-expression-based flexible context for representing context variation. 3) OG Classes for representing interchangeable OGs in the flexible context. The flexible context and the word classes are designed to remove redundant contexts while covering context variation with minimal sets of patterns. By using the proposed corpus, automatic context generation for G2P evaluation can be implemented.