{"title":"Real-time microphone array processing for sound source separation and localization","authors":"Longji Sun, Qi Cheng","doi":"10.1109/CISS.2013.6552257","DOIUrl":null,"url":null,"abstract":"In this paper, the problem of sound source separation and localization is studied using a microphone array. A pure delay mixture model which is typical in outdoor environments is adopted. Our proposed approach utilizes the subspace method to estimate the directions of arrival (DOAs) of the sources from the collected mixtures. Since sound signals are generally considered broadband, the DOA estimates for a source at different frequencies are used to approximate the probability density function of the DOA. The maximum likelihood criterion is used to determine the final DOA estimate for the source. Using the estimated DOAs, the corresponding mixing and demixing matrices in the frequency domain are computed, and the source signals are recovered using the inverse short time Fourier transform (STFT). Our algorithm inherits the robustness to noise of the subspace method and also supports real-time implementation. Comprehensive simulations and experiments have been conducted to examine various aspects of the algorithm.","PeriodicalId":268095,"journal":{"name":"2013 47th Annual Conference on Information Sciences and Systems (CISS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 47th Annual Conference on Information Sciences and Systems (CISS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISS.2013.6552257","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
In this paper, the problem of sound source separation and localization is studied using a microphone array. A pure delay mixture model which is typical in outdoor environments is adopted. Our proposed approach utilizes the subspace method to estimate the directions of arrival (DOAs) of the sources from the collected mixtures. Since sound signals are generally considered broadband, the DOA estimates for a source at different frequencies are used to approximate the probability density function of the DOA. The maximum likelihood criterion is used to determine the final DOA estimate for the source. Using the estimated DOAs, the corresponding mixing and demixing matrices in the frequency domain are computed, and the source signals are recovered using the inverse short time Fourier transform (STFT). Our algorithm inherits the robustness to noise of the subspace method and also supports real-time implementation. Comprehensive simulations and experiments have been conducted to examine various aspects of the algorithm.