A Method for Speech Dereverberation Based on an Image Deblurring Algorithm Using the Prior of Speech Magnitude Gradient Distribution in the Time-Frequency Domain
{"title":"A Method for Speech Dereverberation Based on an Image Deblurring Algorithm Using the Prior of Speech Magnitude Gradient Distribution in the Time-Frequency Domain","authors":"W. Jo, Ji-Won Cho, Changsoo Je, Hyung-Min Park","doi":"10.1145/2814940.2814992","DOIUrl":null,"url":null,"abstract":"We propose a speech dereverberation method in the time-frequency domain, based on an image deblurring algorithm. A reverberant speech magnitude can be modeled as a convolution of a clean speech with a reverberation filter in time-frequency domain. Then, dereverberation problem can be regarded as that of image deblurring. Therefore, the proposed method estimates the clean speech magnitude in the time-frequency domain by using the fast image deconvolution method with priors on sparsity of the clean speech magnitude gradient and exponentially decaying property of reverberation filters along the time axis. Then, scaling the reverberation speech magnitude by a mask obtained from the estimated clean one performs dereverberation. Experimental results show that the described method can enhance speech.","PeriodicalId":427567,"journal":{"name":"Proceedings of the 3rd International Conference on Human-Agent Interaction","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 3rd International Conference on Human-Agent Interaction","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2814940.2814992","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We propose a speech dereverberation method in the time-frequency domain, based on an image deblurring algorithm. A reverberant speech magnitude can be modeled as a convolution of a clean speech with a reverberation filter in time-frequency domain. Then, dereverberation problem can be regarded as that of image deblurring. Therefore, the proposed method estimates the clean speech magnitude in the time-frequency domain by using the fast image deconvolution method with priors on sparsity of the clean speech magnitude gradient and exponentially decaying property of reverberation filters along the time axis. Then, scaling the reverberation speech magnitude by a mask obtained from the estimated clean one performs dereverberation. Experimental results show that the described method can enhance speech.