{"title":"基于知识和倒谱特征的音素识别混合组合","authors":"Rudolph van der Merwe, J. du Preez","doi":"10.1109/COMSIG.1998.736923","DOIUrl":null,"url":null,"abstract":"A new, general, mathematically sound technique is developed to integrate knowledge-based information with standard cepstral features into the formal HMM framework for phoneme recognition. By using these hybrid features, the maximum amount of information contained in the speech signal can be utilised. It is shown that a trivial extension of the statistical models used to model the cepstral features, cannot be used to model the hybrid feature vectors, as this results in a decrease in phoneme recognition accuracy. By using the proposed hybrid technique though, a statistically significant increase in phoneme recognition accuracy is achieved.","PeriodicalId":294473,"journal":{"name":"Proceedings of the 1998 South African Symposium on Communications and Signal Processing-COMSIG '98 (Cat. No. 98EX214)","volume":"26 21","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Hybrid combination of knowledge- and cepstral-based features for phoneme recognition\",\"authors\":\"Rudolph van der Merwe, J. du Preez\",\"doi\":\"10.1109/COMSIG.1998.736923\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A new, general, mathematically sound technique is developed to integrate knowledge-based information with standard cepstral features into the formal HMM framework for phoneme recognition. By using these hybrid features, the maximum amount of information contained in the speech signal can be utilised. It is shown that a trivial extension of the statistical models used to model the cepstral features, cannot be used to model the hybrid feature vectors, as this results in a decrease in phoneme recognition accuracy. By using the proposed hybrid technique though, a statistically significant increase in phoneme recognition accuracy is achieved.\",\"PeriodicalId\":294473,\"journal\":{\"name\":\"Proceedings of the 1998 South African Symposium on Communications and Signal Processing-COMSIG '98 (Cat. No. 98EX214)\",\"volume\":\"26 21\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1998-09-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 1998 South African Symposium on Communications and Signal Processing-COMSIG '98 (Cat. No. 98EX214)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/COMSIG.1998.736923\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1998 South African Symposium on Communications and Signal Processing-COMSIG '98 (Cat. No. 98EX214)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COMSIG.1998.736923","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Hybrid combination of knowledge- and cepstral-based features for phoneme recognition
A new, general, mathematically sound technique is developed to integrate knowledge-based information with standard cepstral features into the formal HMM framework for phoneme recognition. By using these hybrid features, the maximum amount of information contained in the speech signal can be utilised. It is shown that a trivial extension of the statistical models used to model the cepstral features, cannot be used to model the hybrid feature vectors, as this results in a decrease in phoneme recognition accuracy. By using the proposed hybrid technique though, a statistically significant increase in phoneme recognition accuracy is achieved.