Monaural speech separation based on linear regression optimized using gradient descent

2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) Pub Date : 2020-09-01 DOI:10.1109/ATSIP49331.2020.9231542

Belhedi Wiem, M. B. Messaoud, A. Bouzid

引用次数: 0

Abstract

Monaural speech separation (MSS) is useful for many real-world applications. In this work, we propose a novel method for MSS based on the observation that a composite speech signals can be modeled as the linear summation of each speaker with respect to participation coefficients. Hence, speech signals are separated using linear regression. Partial derivative with respect to each variable is then used to perform gradient descent in order to optimize the estimation and therefore the separation. The proposed speech separation method for is applicable to known speakers.The proposed method was assessed using metrics characterized by good correlation coefficients with subjective listening tests. Evaluation results reveal the effectiveness of the proposed approach.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于梯度下降优化线性回归的单耳语音分离

单耳语音分离(MSS)在许多实际应用中都很有用。在这项工作中，我们提出了一种新的MSS方法，该方法基于观察到复合语音信号可以建模为每个说话者关于参与系数的线性求和。因此，使用线性回归分离语音信号。然后使用对每个变量的偏导数来执行梯度下降，以优化估计，从而优化分离。所提出的语音分离方法适用于已知说话人。使用与主观听力测试具有良好相关系数的指标来评估所提出的方法。评价结果表明了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2020 5th International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)

自引率

0.00%

发文量

期刊最新文献

Automatic Recognition of Epileptiform EEG Abnormalities Using Machine Learning Approaches Generation of fuzzy evidence numbers for the evaluation of uncertainty measures Speckle Denoising of the Multipolarization Images by Hybrid Filters Identification of the user by using a hardware device Lightweight Hardware Architectures for the Piccolo Block Cipher in FPGA