{"title":"TDOA information based vad for robust speech recognition in directional and diffuse noise field","authors":"Kuan-Lang Huang, T. Chi","doi":"10.1109/ISCSLP.2012.6423514","DOIUrl":null,"url":null,"abstract":"A two-microphone algorithm is proposed to improve automatic speech recognition (ASR) rates when target speech is corrupted by directional interferences and diffuse noise simultaneously. The algorithm adopts the time difference of arrival (TDOA) to suppress directional interferences and a TDOA-information based voice activity detector (VAD) to suppress diffuse noise. Simulation results show the proposed algorithm is effective in improving ASR rates in a sound field mixed with a directional interference and diffuse noise. Compared with the phase difference (PD) algorithm, the proposed method gives comparable recognition rates when facing a directional interference and much higher and more robust recognition rates when diffuse noise emerges.","PeriodicalId":186099,"journal":{"name":"2012 8th International Symposium on Chinese Spoken Language Processing","volume":"65 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 8th International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCSLP.2012.6423514","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
A two-microphone algorithm is proposed to improve automatic speech recognition (ASR) rates when target speech is corrupted by directional interferences and diffuse noise simultaneously. The algorithm adopts the time difference of arrival (TDOA) to suppress directional interferences and a TDOA-information based voice activity detector (VAD) to suppress diffuse noise. Simulation results show the proposed algorithm is effective in improving ASR rates in a sound field mixed with a directional interference and diffuse noise. Compared with the phase difference (PD) algorithm, the proposed method gives comparable recognition rates when facing a directional interference and much higher and more robust recognition rates when diffuse noise emerges.