{"title":"斯洛伐克LVCSR的高级语言建模技术评估","authors":"D. Zlacký, J. Staš, J. Juhár, A. Cizmár","doi":"10.1109/RADIOELEK.2015.7129007","DOIUrl":null,"url":null,"abstract":"In this paper we compare several advanced language modeling techniques for the Slovak continuous speech recognition. Five different language modeling techniques were analyzed, considering their model size and perplexity, speech recognition performance and complexity of their usage in real conditions of speech recognition in Slovak. The preliminary experimental results show that the convenient n-gram models smoothed by the Witten-Bell back-off algorithm produce the best performance according to the model perplexity and recognition accuracy. Other modeling techniques including Maximum Entropy, Power Law Discounting, Hierarchical Pitman-Yor process, or Variable-order Kneser-Ney smoothed models achieved better results only in the model perplexity. However, the increased computational requirements and worse recognition performance limit their usage in the real speech recognition tasks in Slovak.","PeriodicalId":193275,"journal":{"name":"2015 25th International Conference Radioelektronika (RADIOELEKTRONIKA)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evaluation of advanced language modeling techniques for the Slovak LVCSR\",\"authors\":\"D. Zlacký, J. Staš, J. Juhár, A. Cizmár\",\"doi\":\"10.1109/RADIOELEK.2015.7129007\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper we compare several advanced language modeling techniques for the Slovak continuous speech recognition. Five different language modeling techniques were analyzed, considering their model size and perplexity, speech recognition performance and complexity of their usage in real conditions of speech recognition in Slovak. The preliminary experimental results show that the convenient n-gram models smoothed by the Witten-Bell back-off algorithm produce the best performance according to the model perplexity and recognition accuracy. Other modeling techniques including Maximum Entropy, Power Law Discounting, Hierarchical Pitman-Yor process, or Variable-order Kneser-Ney smoothed models achieved better results only in the model perplexity. However, the increased computational requirements and worse recognition performance limit their usage in the real speech recognition tasks in Slovak.\",\"PeriodicalId\":193275,\"journal\":{\"name\":\"2015 25th International Conference Radioelektronika (RADIOELEKTRONIKA)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-04-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 25th International Conference Radioelektronika (RADIOELEKTRONIKA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RADIOELEK.2015.7129007\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 25th International Conference Radioelektronika (RADIOELEKTRONIKA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RADIOELEK.2015.7129007","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Evaluation of advanced language modeling techniques for the Slovak LVCSR
In this paper we compare several advanced language modeling techniques for the Slovak continuous speech recognition. Five different language modeling techniques were analyzed, considering their model size and perplexity, speech recognition performance and complexity of their usage in real conditions of speech recognition in Slovak. The preliminary experimental results show that the convenient n-gram models smoothed by the Witten-Bell back-off algorithm produce the best performance according to the model perplexity and recognition accuracy. Other modeling techniques including Maximum Entropy, Power Law Discounting, Hierarchical Pitman-Yor process, or Variable-order Kneser-Ney smoothed models achieved better results only in the model perplexity. However, the increased computational requirements and worse recognition performance limit their usage in the real speech recognition tasks in Slovak.