{"title":"Audio signal delay estimation using partial whitening","authors":"K. D. Donohue, A. Agrinsoni, J. Hannemann","doi":"10.1109/SECON.2007.342946","DOIUrl":null,"url":null,"abstract":"This work examines time and frequency domain implementations for estimating delays between acoustic signals arriving at spatially distributed microphones. A parametric variant of the phase-only transform (PHAT) is introduced for partially whitening the signal before estimating the delay. The PHAT variant is referred to as the PHAT-beta and is shown to be advantageous when processing signals corrupted by both independent noise and reverberation effects. Simulations show superior performance for the time-domain implementation under conditions of independent noise for time-limited broadband signals, achieving low estimation errors at signal-to-noise ratios 8 to 13 dB lower than that required for a frequency-domain implementation. Extensive Monte Carlo simulations are also performed for the time-domain delay estimator using the PHAT-beta on speech signals corrupted by reverberation and independent noise. Performance metrics include percent anomalous detections as well as the root mean square estimation error. Results show that partial whitening leads to significant improvements over zero or total whitening (as in the case of the standard PHAT). Simulations indicate that robust performance can be achieved for beta values near 0.4 when both reverberations and independent noises are present.","PeriodicalId":423683,"journal":{"name":"Proceedings 2007 IEEE SoutheastCon","volume":"68 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 2007 IEEE SoutheastCon","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SECON.2007.342946","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
This work examines time and frequency domain implementations for estimating delays between acoustic signals arriving at spatially distributed microphones. A parametric variant of the phase-only transform (PHAT) is introduced for partially whitening the signal before estimating the delay. The PHAT variant is referred to as the PHAT-beta and is shown to be advantageous when processing signals corrupted by both independent noise and reverberation effects. Simulations show superior performance for the time-domain implementation under conditions of independent noise for time-limited broadband signals, achieving low estimation errors at signal-to-noise ratios 8 to 13 dB lower than that required for a frequency-domain implementation. Extensive Monte Carlo simulations are also performed for the time-domain delay estimator using the PHAT-beta on speech signals corrupted by reverberation and independent noise. Performance metrics include percent anomalous detections as well as the root mean square estimation error. Results show that partial whitening leads to significant improvements over zero or total whitening (as in the case of the standard PHAT). Simulations indicate that robust performance can be achieved for beta values near 0.4 when both reverberations and independent noises are present.