{"title":"Feature masking in an embedded Mandarin speech recognition system","authors":"Yuezhong Tang, Xia Wang, Yang Cao, Feng Ding","doi":"10.1109/CHINSL.2004.1409632","DOIUrl":null,"url":null,"abstract":"In this paper, we explored a feature component masking scheme for embedded tonal language recognition systems, in order to reduce the computational complexity with least degradation of recognition accuracy. We carried out a lot of experiments on a Mandarin isolated word recognition task with a tone-confusable vocabulary. With consideration of both clean and noisy conditions, we were able to find a masking scheme that filtered out 31 of 54 components and still outperformed the baseline with 54 components in the feature set, with dramatically less computational and memory complexity. The results showed that feature masking was a promising approach for complexity reduction in embedded tonal language recognition systems. The results also verified the effectiveness of higher order cepstral coefficients for tonal language recognition because most of them were preserved during the feature masking experiments.","PeriodicalId":212562,"journal":{"name":"2004 International Symposium on Chinese Spoken Language Processing","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2004 International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CHINSL.2004.1409632","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In this paper, we explored a feature component masking scheme for embedded tonal language recognition systems, in order to reduce the computational complexity with least degradation of recognition accuracy. We carried out a lot of experiments on a Mandarin isolated word recognition task with a tone-confusable vocabulary. With consideration of both clean and noisy conditions, we were able to find a masking scheme that filtered out 31 of 54 components and still outperformed the baseline with 54 components in the feature set, with dramatically less computational and memory complexity. The results showed that feature masking was a promising approach for complexity reduction in embedded tonal language recognition systems. The results also verified the effectiveness of higher order cepstral coefficients for tonal language recognition because most of them were preserved during the feature masking experiments.