Ranny Ranny, D. Lestari, Tati Latifah Erawati Rajab, I. Suwardi
{"title":"基于非负矩阵分解的重叠声分离","authors":"Ranny Ranny, D. Lestari, Tati Latifah Erawati Rajab, I. Suwardi","doi":"10.1109/ISRITI48646.2019.9034580","DOIUrl":null,"url":null,"abstract":"One of the most common problems in sound recognition is the overlapping sound. This phenomena requires sound separation beforehand in order to be recognized. Most studies related to sound separation used artificial data in their research, i.e. using experiment sound data from a controlled environment which is augmented with one or more sound types, and achieve good results. However, when it is implemented in the real condition, it’s performance has dropped dramatically. Thus, in this research we use overlapping data recorded in real environments. The purpose of this research is to separate the speech and non-speech, and noise by using the Non-negative Matrix Factorization (NMF). Our experimental results show that the NMF works well when separating sound and non-sound, and has helped the performance of sound recognition.","PeriodicalId":367363,"journal":{"name":"2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Separation of Overlapping Sound using Nonnegative Matrix Factorization\",\"authors\":\"Ranny Ranny, D. Lestari, Tati Latifah Erawati Rajab, I. Suwardi\",\"doi\":\"10.1109/ISRITI48646.2019.9034580\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"One of the most common problems in sound recognition is the overlapping sound. This phenomena requires sound separation beforehand in order to be recognized. Most studies related to sound separation used artificial data in their research, i.e. using experiment sound data from a controlled environment which is augmented with one or more sound types, and achieve good results. However, when it is implemented in the real condition, it’s performance has dropped dramatically. Thus, in this research we use overlapping data recorded in real environments. The purpose of this research is to separate the speech and non-speech, and noise by using the Non-negative Matrix Factorization (NMF). Our experimental results show that the NMF works well when separating sound and non-sound, and has helped the performance of sound recognition.\",\"PeriodicalId\":367363,\"journal\":{\"name\":\"2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISRITI48646.2019.9034580\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISRITI48646.2019.9034580","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Separation of Overlapping Sound using Nonnegative Matrix Factorization
One of the most common problems in sound recognition is the overlapping sound. This phenomena requires sound separation beforehand in order to be recognized. Most studies related to sound separation used artificial data in their research, i.e. using experiment sound data from a controlled environment which is augmented with one or more sound types, and achieve good results. However, when it is implemented in the real condition, it’s performance has dropped dramatically. Thus, in this research we use overlapping data recorded in real environments. The purpose of this research is to separate the speech and non-speech, and noise by using the Non-negative Matrix Factorization (NMF). Our experimental results show that the NMF works well when separating sound and non-sound, and has helped the performance of sound recognition.