{"title":"Joint system for speech separation from speaking and non-speaking background, and de-reverberation: Application on real-world recordings","authors":"Belhedi Wiem, M. B. Messaoud, A. Bouzid","doi":"10.1109/ICFSP.2017.8097055","DOIUrl":null,"url":null,"abstract":"In real-life environment, the speech of interest is often correlated with different kinds of perturbation. Perturbation can be caused by speaking or non-speaking noise, or even by reverberation. This could make the speech signal auditable but not intelligible. In this case, speech cannot be exploited by other automated applications such as voice-command or speech/speaker identification and identification. Extracting a meaningful signal of good quality is a bigger challenge in monaural case. In this paper, we propose an extensible full joint system that deals with real-environment perturbations that include speaking and non-speaking background as well as reverberation. After introducing the input signal, a decision is taken on the process to opt for. The main chain blocks of the proposed system are speech denoising, speech separation and speech de-reverberation. The system operates in single channel case in a fully unsupervised manner. Furthermore, it requires minimal information about the reference signal. As the system is targeting real-time enhancement, results evaluation is conduct in terms of non-intrusive metrics in addition to intrusive metrics. The evaluation results prove the effectiveness of the proposed system in cancelling difficult noise types, in extracting desired speaker from speaking background, and in enhancing reverberated speech.","PeriodicalId":382413,"journal":{"name":"2017 3rd International Conference on Frontiers of Signal Processing (ICFSP)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 3rd International Conference on Frontiers of Signal Processing (ICFSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICFSP.2017.8097055","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In real-life environment, the speech of interest is often correlated with different kinds of perturbation. Perturbation can be caused by speaking or non-speaking noise, or even by reverberation. This could make the speech signal auditable but not intelligible. In this case, speech cannot be exploited by other automated applications such as voice-command or speech/speaker identification and identification. Extracting a meaningful signal of good quality is a bigger challenge in monaural case. In this paper, we propose an extensible full joint system that deals with real-environment perturbations that include speaking and non-speaking background as well as reverberation. After introducing the input signal, a decision is taken on the process to opt for. The main chain blocks of the proposed system are speech denoising, speech separation and speech de-reverberation. The system operates in single channel case in a fully unsupervised manner. Furthermore, it requires minimal information about the reference signal. As the system is targeting real-time enhancement, results evaluation is conduct in terms of non-intrusive metrics in addition to intrusive metrics. The evaluation results prove the effectiveness of the proposed system in cancelling difficult noise types, in extracting desired speaker from speaking background, and in enhancing reverberated speech.