{"title":"A microphone array beamforming-based system for multi-talker speech separation","authors":"Adel Hidri, H. Amiri","doi":"10.1504/IJSISE.2016.078257","DOIUrl":null,"url":null,"abstract":"This paper presents a Multichannel Speech Separation System (MCSS) based on new beamforming frequency domain method. The beamformer exploits the spatial properties of the source signals using a microphone array. Therefore, it is based on a prior knowledge of the position of the speakers relative to the array. The proposed beamformer is defined with two processing steps: the first one is to keep a unit gain of the desired signal and the other blocks the wanted signal and minimises the output power of the interferences within only one step. In order to separate multiple speakers, multiple beamformers are used simultaneously, where a beamformer is computed for each source considering the remaining sources as interferers. We test and evaluate the proposed MCSS on real recording mixtures extracted from 'Multichannel In-Car Speech Database'. The experimental results proved the effectiveness of the proposed system in terms of speech separation. The quality of speech will be improved compared to the state-of-the-art.","PeriodicalId":56359,"journal":{"name":"International Journal of Signal and Imaging Systems Engineering","volume":"9 1","pages":"209"},"PeriodicalIF":0.6000,"publicationDate":"2016-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1504/IJSISE.2016.078257","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Signal and Imaging Systems Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJSISE.2016.078257","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 0
Abstract
This paper presents a Multichannel Speech Separation System (MCSS) based on new beamforming frequency domain method. The beamformer exploits the spatial properties of the source signals using a microphone array. Therefore, it is based on a prior knowledge of the position of the speakers relative to the array. The proposed beamformer is defined with two processing steps: the first one is to keep a unit gain of the desired signal and the other blocks the wanted signal and minimises the output power of the interferences within only one step. In order to separate multiple speakers, multiple beamformers are used simultaneously, where a beamformer is computed for each source considering the remaining sources as interferers. We test and evaluate the proposed MCSS on real recording mixtures extracted from 'Multichannel In-Car Speech Database'. The experimental results proved the effectiveness of the proposed system in terms of speech separation. The quality of speech will be improved compared to the state-of-the-art.