{"title":"Improving recognition of syallabic units of Hindi languagae using combined features of Throat Microphone and Normal Microphone speech","authors":"N. Radha, A. Shahina, G. Vinoth, A. N. Khan","doi":"10.1109/ICCICCT.2014.6993171","DOIUrl":null,"url":null,"abstract":"The performance of Automatic Speech recognition system (ASR) built using close talk microphones degrades in noisy environments. AS R built using Throat Microphone (TM) speech shows relatively better performance under such adverse situations. However, some of the sounds are not well captured in TM. In this work we explore the combined use of Normal Microphone (NM) and TM features to improve the recognition rate of AS R. In the proposed work, the combined Mel-Frequency Cepstral Coefficients (MFCC) derived from the two signals are used to built an AS R in the HMM framework to recognize the 145 syllabic units of Indian language Hindi. The performance of this combined AS R system shows a significant improvement in performance when compared with individual AS R systems built using NM and TM features, respectively.","PeriodicalId":6615,"journal":{"name":"2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT)","volume":"9 1","pages":"1343-1348"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCICCT.2014.6993171","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
The performance of Automatic Speech recognition system (ASR) built using close talk microphones degrades in noisy environments. AS R built using Throat Microphone (TM) speech shows relatively better performance under such adverse situations. However, some of the sounds are not well captured in TM. In this work we explore the combined use of Normal Microphone (NM) and TM features to improve the recognition rate of AS R. In the proposed work, the combined Mel-Frequency Cepstral Coefficients (MFCC) derived from the two signals are used to built an AS R in the HMM framework to recognize the 145 syllabic units of Indian language Hindi. The performance of this combined AS R system shows a significant improvement in performance when compared with individual AS R systems built using NM and TM features, respectively.