{"title":"文本-音素映射中双向输入上下文依赖的递归神经网络","authors":"E. B. Bilcu, J. Astola, J. Saarinen","doi":"10.1109/ISCCSP.2004.1296463","DOIUrl":null,"url":null,"abstract":"Among many neural network architectures that exist in the literature, the recurrent neural networks (RNN's) are of special interest due to their ability to deal with spatial temporal problems. However, in an earlier published paper, the authors shown that RNN's have poor performance in terms of phoneme accuracy when applied to the specific problem of converting text streams into their phonetic transcriptions. This is due to the fact that RNN's contains a weak left side context dependence between letters and the right side context dependence is not included. In this paper, we study the behavior of RNN that includes the context information between adjacent letters at the input. The results in terms of phoneme accuracy, for the RNN with both side input context dependence, multilayer perception and RNN, in the context of text-to-phoneme mapping, are shown.","PeriodicalId":146713,"journal":{"name":"First International Symposium on Control, Communications and Signal Processing, 2004.","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Recurrent neural network with both side input context dependence for text-to-phoneme mapping\",\"authors\":\"E. B. Bilcu, J. Astola, J. Saarinen\",\"doi\":\"10.1109/ISCCSP.2004.1296463\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Among many neural network architectures that exist in the literature, the recurrent neural networks (RNN's) are of special interest due to their ability to deal with spatial temporal problems. However, in an earlier published paper, the authors shown that RNN's have poor performance in terms of phoneme accuracy when applied to the specific problem of converting text streams into their phonetic transcriptions. This is due to the fact that RNN's contains a weak left side context dependence between letters and the right side context dependence is not included. In this paper, we study the behavior of RNN that includes the context information between adjacent letters at the input. The results in terms of phoneme accuracy, for the RNN with both side input context dependence, multilayer perception and RNN, in the context of text-to-phoneme mapping, are shown.\",\"PeriodicalId\":146713,\"journal\":{\"name\":\"First International Symposium on Control, Communications and Signal Processing, 2004.\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2004-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"First International Symposium on Control, Communications and Signal Processing, 2004.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCCSP.2004.1296463\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"First International Symposium on Control, Communications and Signal Processing, 2004.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCCSP.2004.1296463","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Recurrent neural network with both side input context dependence for text-to-phoneme mapping
Among many neural network architectures that exist in the literature, the recurrent neural networks (RNN's) are of special interest due to their ability to deal with spatial temporal problems. However, in an earlier published paper, the authors shown that RNN's have poor performance in terms of phoneme accuracy when applied to the specific problem of converting text streams into their phonetic transcriptions. This is due to the fact that RNN's contains a weak left side context dependence between letters and the right side context dependence is not included. In this paper, we study the behavior of RNN that includes the context information between adjacent letters at the input. The results in terms of phoneme accuracy, for the RNN with both side input context dependence, multilayer perception and RNN, in the context of text-to-phoneme mapping, are shown.