{"title":"Developing a hybrid language model for open vocabulary automatic speech recognition in a lecture speech task","authors":"Marc-Antoine Rondeau, R. Rose","doi":"10.1109/ISSPA.2012.6310464","DOIUrl":null,"url":null,"abstract":"This paper addresses the problem of open vocabulary automatic speech recognition (ASR) using hybrid statistical language models (LMs). Hybrid LMs differ from closed vocabulary LMs in that the word level lexicon is augmented with an inventory of sub-lexical units (SLUs). The procedures used for selecting these SLUs and expanding out-of-vocabulary (OOV) words according to the SLUs is presented in the paper. The open-vocabulary ASR performance obtained using these techniques is presented for a lecture speech task domain.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSPA.2012.6310464","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
This paper addresses the problem of open vocabulary automatic speech recognition (ASR) using hybrid statistical language models (LMs). Hybrid LMs differ from closed vocabulary LMs in that the word level lexicon is augmented with an inventory of sub-lexical units (SLUs). The procedures used for selecting these SLUs and expanding out-of-vocabulary (OOV) words according to the SLUs is presented in the paper. The open-vocabulary ASR performance obtained using these techniques is presented for a lecture speech task domain.