{"title":"Multiscale collaborative speech denoising based on deep stacking network","authors":"Wei Jiang, Hao Zheng, Shuai Nie, Wenju Liu","doi":"10.1109/IJCNN.2015.7280604","DOIUrl":null,"url":null,"abstract":"A growing number of noise reduction algorithms based on supervised learning have begun to emerge in recent years and show great promise. In this study, we focus on the problem of speech denoising at very low signal-to-noise ratio (SNR) conditions using artificial neural networks. The overall objective is to increase speech intelligibility in the presence of noise. Inspired by multitask learning (MTL), a novel framework based on deep stacking network (DSN) is proposed to do speech denoising at three different time-frequency scales simultaneously and collaboratively. Experiment results show that our algorithm outperforms a state-of-the-art method that is based on traditional deep neural network (DNN).","PeriodicalId":6539,"journal":{"name":"2015 International Joint Conference on Neural Networks (IJCNN)","volume":"60 1","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2015-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Joint Conference on Neural Networks (IJCNN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IJCNN.2015.7280604","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
A growing number of noise reduction algorithms based on supervised learning have begun to emerge in recent years and show great promise. In this study, we focus on the problem of speech denoising at very low signal-to-noise ratio (SNR) conditions using artificial neural networks. The overall objective is to increase speech intelligibility in the presence of noise. Inspired by multitask learning (MTL), a novel framework based on deep stacking network (DSN) is proposed to do speech denoising at three different time-frequency scales simultaneously and collaboratively. Experiment results show that our algorithm outperforms a state-of-the-art method that is based on traditional deep neural network (DNN).