S. Leow, T. S. Lau, Alvina Goh, Han Meng Peh, Teck Khim Ng, S. Siniscalchi, Chin-Hui Lee
{"title":"A new confidence measure combining Hidden Markov Models and Artificial Neural Networks of phonemes for effective keyword spotting","authors":"S. Leow, T. S. Lau, Alvina Goh, Han Meng Peh, Teck Khim Ng, S. Siniscalchi, Chin-Hui Lee","doi":"10.1109/ISCSLP.2012.6423455","DOIUrl":null,"url":null,"abstract":"In this paper, we present an acoustic keyword spotter that operates in two stages, detection and verification. In the detection stage, keywords are detected in the utterances, and in the verification stage, confidence measures are used to verify the detected keywords and reject false alarms. A new confidence measure, based on phoneme models trained on an Artificial Neural Network, is used in the verification stage to reduce false alarms. We have found that this ANN-based confidence, together with existing HMM-based confidence measures, is very effective in rejecting false alarms. Experiments are performed on two Mandarin databases and our results show that the proposed method is able to significantly reduce the number of false alarms.","PeriodicalId":186099,"journal":{"name":"2012 8th International Symposium on Chinese Spoken Language Processing","volume":"119 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 8th International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCSLP.2012.6423455","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
In this paper, we present an acoustic keyword spotter that operates in two stages, detection and verification. In the detection stage, keywords are detected in the utterances, and in the verification stage, confidence measures are used to verify the detected keywords and reject false alarms. A new confidence measure, based on phoneme models trained on an Artificial Neural Network, is used in the verification stage to reduce false alarms. We have found that this ANN-based confidence, together with existing HMM-based confidence measures, is very effective in rejecting false alarms. Experiments are performed on two Mandarin databases and our results show that the proposed method is able to significantly reduce the number of false alarms.