{"title":"音乐播放过程中多麦克风设备声源跟踪的在线期望最大化算法","authors":"D. Giacobello","doi":"10.23919/EUSIPCO.2018.8553331","DOIUrl":null,"url":null,"abstract":"In this paper, we propose an expectation-maximization algorithm to perform online tracking of moving sources around multi-microphone devices. We are particularly targeting the application scenario of distant-talking control of a music playback device. The goal is to perform spatial tracking of the moving sources and to estimate the probability that each of these sources is active. In particular, we use the expectation-maximization algorithm to capture the statistical behavior of the feature space representing the ensemble of sources as a Gaussian mixture model, assigning each Gaussian component to an individual acoustic source. The features used exploit a wide range of information on the sources behavior making the system robust to noise, reverberation, and music playback. We then differentiate between desired and interfering sources. The spatial information and activity level is then determined for each desired source. Experimental evaluation of a real acoustic source tracking problem with and without music playback shows promising results for the proposed approach.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"An Online Expectation-Maximization Algorithm for Tracking Acoustic Sources in Multi-Microphone Devices During Music Playback\",\"authors\":\"D. Giacobello\",\"doi\":\"10.23919/EUSIPCO.2018.8553331\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose an expectation-maximization algorithm to perform online tracking of moving sources around multi-microphone devices. We are particularly targeting the application scenario of distant-talking control of a music playback device. The goal is to perform spatial tracking of the moving sources and to estimate the probability that each of these sources is active. In particular, we use the expectation-maximization algorithm to capture the statistical behavior of the feature space representing the ensemble of sources as a Gaussian mixture model, assigning each Gaussian component to an individual acoustic source. The features used exploit a wide range of information on the sources behavior making the system robust to noise, reverberation, and music playback. We then differentiate between desired and interfering sources. The spatial information and activity level is then determined for each desired source. Experimental evaluation of a real acoustic source tracking problem with and without music playback shows promising results for the proposed approach.\",\"PeriodicalId\":303069,\"journal\":{\"name\":\"2018 26th European Signal Processing Conference (EUSIPCO)\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 26th European Signal Processing Conference (EUSIPCO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/EUSIPCO.2018.8553331\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 26th European Signal Processing Conference (EUSIPCO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/EUSIPCO.2018.8553331","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Online Expectation-Maximization Algorithm for Tracking Acoustic Sources in Multi-Microphone Devices During Music Playback
In this paper, we propose an expectation-maximization algorithm to perform online tracking of moving sources around multi-microphone devices. We are particularly targeting the application scenario of distant-talking control of a music playback device. The goal is to perform spatial tracking of the moving sources and to estimate the probability that each of these sources is active. In particular, we use the expectation-maximization algorithm to capture the statistical behavior of the feature space representing the ensemble of sources as a Gaussian mixture model, assigning each Gaussian component to an individual acoustic source. The features used exploit a wide range of information on the sources behavior making the system robust to noise, reverberation, and music playback. We then differentiate between desired and interfering sources. The spatial information and activity level is then determined for each desired source. Experimental evaluation of a real acoustic source tracking problem with and without music playback shows promising results for the proposed approach.