{"title":"Time-frequency masks for monaural speech separation: A comparative review","authors":"Belhedi Wiem, M. B. Ben Messaoud, Bouzid Aicha","doi":"10.1109/SETIT.2016.7939911","DOIUrl":null,"url":null,"abstract":"In this paper we present a comparative analysis of different time-frequency (T-F) masking techniques used for single channel speech separation (SCSS). We survey T-F masking concept and compare different types of masks in different criteria. The comparison is conduct theoretically by mathematical study and numerically by objective and subjective assessment. Also, we study the effect of the masking techniques on the perceptual quality of speech and their ability to separate a target speech from monaural mixing.","PeriodicalId":426951,"journal":{"name":"2016 7th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 7th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SETIT.2016.7939911","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper we present a comparative analysis of different time-frequency (T-F) masking techniques used for single channel speech separation (SCSS). We survey T-F masking concept and compare different types of masks in different criteria. The comparison is conduct theoretically by mathematical study and numerically by objective and subjective assessment. Also, we study the effect of the masking techniques on the perceptual quality of speech and their ability to separate a target speech from monaural mixing.