{"title":"Encoder Re-training with Mixture Signals on FastMVAE Method","authors":"Shuhei Yamaji, Taishi Nakashima, Nobutaka Ono, Li Li, H. Kameoka","doi":"10.23919/APSIPAASC55919.2022.9979843","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a new network training to improve the source separation performance of the fast multichannel variational autoencoder (FastMVAE) method. The FastMVAE method is very effective for supervised source separation. It also significantly reduces the processing time by replacing the backpropagation steps in the MVAE method with a single forward propagation of the encoder for estimating latent variables. In previous studies, the encoder is trained together with the decoder using clean speech. In contrast, in this study, we re-train only the encoder by using the mixed signals with the decoder fixed. More specifically, using the imperfectly separated signals obtained in the process of the source separation algorithm, we train the encoder to find the optimal latent variables that minimize the objective function for source separation. Experimental results show that the proposed method reduces the objective function at almost every iteration and achieves higher separation performance than the conventional method.","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"2015 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/APSIPAASC55919.2022.9979843","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we propose a new network training to improve the source separation performance of the fast multichannel variational autoencoder (FastMVAE) method. The FastMVAE method is very effective for supervised source separation. It also significantly reduces the processing time by replacing the backpropagation steps in the MVAE method with a single forward propagation of the encoder for estimating latent variables. In previous studies, the encoder is trained together with the decoder using clean speech. In contrast, in this study, we re-train only the encoder by using the mixed signals with the decoder fixed. More specifically, using the imperfectly separated signals obtained in the process of the source separation algorithm, we train the encoder to find the optimal latent variables that minimize the objective function for source separation. Experimental results show that the proposed method reduces the objective function at almost every iteration and achieves higher separation performance than the conventional method.