Ranny Ranny, D. Lestari, Tati Latifah Erawati Rajab, I. Suwardi
{"title":"Separation of Overlapping Sound using Nonnegative Matrix Factorization","authors":"Ranny Ranny, D. Lestari, Tati Latifah Erawati Rajab, I. Suwardi","doi":"10.1109/ISRITI48646.2019.9034580","DOIUrl":null,"url":null,"abstract":"One of the most common problems in sound recognition is the overlapping sound. This phenomena requires sound separation beforehand in order to be recognized. Most studies related to sound separation used artificial data in their research, i.e. using experiment sound data from a controlled environment which is augmented with one or more sound types, and achieve good results. However, when it is implemented in the real condition, it’s performance has dropped dramatically. Thus, in this research we use overlapping data recorded in real environments. The purpose of this research is to separate the speech and non-speech, and noise by using the Non-negative Matrix Factorization (NMF). Our experimental results show that the NMF works well when separating sound and non-sound, and has helped the performance of sound recognition.","PeriodicalId":367363,"journal":{"name":"2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISRITI48646.2019.9034580","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
One of the most common problems in sound recognition is the overlapping sound. This phenomena requires sound separation beforehand in order to be recognized. Most studies related to sound separation used artificial data in their research, i.e. using experiment sound data from a controlled environment which is augmented with one or more sound types, and achieve good results. However, when it is implemented in the real condition, it’s performance has dropped dramatically. Thus, in this research we use overlapping data recorded in real environments. The purpose of this research is to separate the speech and non-speech, and noise by using the Non-negative Matrix Factorization (NMF). Our experimental results show that the NMF works well when separating sound and non-sound, and has helped the performance of sound recognition.