{"title":"Exploring the Effectiveness of Linear Matrix Factorizations After Nonlinear Processing","authors":"Bradley M. Whitaker, David V. Anderson","doi":"10.1109/IETC47856.2020.9249107","DOIUrl":null,"url":null,"abstract":"In this paper, we explore the weaknesses of using sparse coding and nonnegative matrix factorization (NMF) on data that has already been processed in a nonlinear manner. The underlying assumption of matrix factorization techniques is that an input signal can be represented as a linear combination of some set of features. This is a valid assumption in many feature extraction tasks, including several audio applications such as source separation and sound scene analysis. However, sparse coding and NMF are often used on data known to be composed of a nonlinear combination of features, such as the magnitude of an audio spectrum. This paper uses two synthetic datasets to probe the ability of linear sparse coding and NMF to discover known features that are combined nonlinearly. Even in a small dataset, common nonlinearities cause interference that prevents the algorithms from recovering the known features. However, we validate the use of NMF and sparse coding in audio applications by demonstrating that the factorization process is more effective at recovering hidden features in a dataset with harmonic structure. Finally, we show that the reconstruction error associated with modeling either dataset can be reduced by taking into account the behavior of a known nonlinearity.","PeriodicalId":186446,"journal":{"name":"2020 Intermountain Engineering, Technology and Computing (IETC)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Intermountain Engineering, Technology and Computing (IETC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IETC47856.2020.9249107","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we explore the weaknesses of using sparse coding and nonnegative matrix factorization (NMF) on data that has already been processed in a nonlinear manner. The underlying assumption of matrix factorization techniques is that an input signal can be represented as a linear combination of some set of features. This is a valid assumption in many feature extraction tasks, including several audio applications such as source separation and sound scene analysis. However, sparse coding and NMF are often used on data known to be composed of a nonlinear combination of features, such as the magnitude of an audio spectrum. This paper uses two synthetic datasets to probe the ability of linear sparse coding and NMF to discover known features that are combined nonlinearly. Even in a small dataset, common nonlinearities cause interference that prevents the algorithms from recovering the known features. However, we validate the use of NMF and sparse coding in audio applications by demonstrating that the factorization process is more effective at recovering hidden features in a dataset with harmonic structure. Finally, we show that the reconstruction error associated with modeling either dataset can be reduced by taking into account the behavior of a known nonlinearity.