{"title":"The training of Slovak speech recognition system based on Sphinx 4 for GSM networks","authors":"J. Vojtko, J. Kacur, G. Rozinaj","doi":"10.1109/ELMAR.2007.4418818","DOIUrl":null,"url":null,"abstract":"In the submitted paper we present the training process of HMM models that are designed to be used in ASR systems employed in GSM networks. First a brief overview regarding the current problems and applications of ASR systems is given, followed by the description of MOBILDAT-SK speech database and the SPHINX 4 and SphitixTrain capabilities. Then the process of HMM models training is presented utilizing the facility of the SphinxTrain system adjusted for the structure of MOBILDAT database and the Slovak language. The article is concluded by presenting the achieved results using the tools of the SHINX 4 by the means of 3 types of tests: application words, isolated digits, and looped digits. The WER for the looped digits and CD phoneme models is 1.8% which is roughly comparable to the performance of other systems.","PeriodicalId":170000,"journal":{"name":"ELMAR 2007","volume":"515 1-2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ELMAR 2007","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ELMAR.2007.4418818","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
In the submitted paper we present the training process of HMM models that are designed to be used in ASR systems employed in GSM networks. First a brief overview regarding the current problems and applications of ASR systems is given, followed by the description of MOBILDAT-SK speech database and the SPHINX 4 and SphitixTrain capabilities. Then the process of HMM models training is presented utilizing the facility of the SphinxTrain system adjusted for the structure of MOBILDAT database and the Slovak language. The article is concluded by presenting the achieved results using the tools of the SHINX 4 by the means of 3 types of tests: application words, isolated digits, and looped digits. The WER for the looped digits and CD phoneme models is 1.8% which is roughly comparable to the performance of other systems.