{"title":"Support Recovery in Mixture Models With Sparse Parameters","authors":"Arya Mazumdar;Soumyabrata Pal","doi":"10.1109/TIT.2024.3462937","DOIUrl":null,"url":null,"abstract":"Mixture models are widely used to fit complex and multimodal datasets. In this paper we study mixtures with high dimensional sparse parameter vectors and consider the problem of support recovery of those vectors. While parameter learning in mixture models is well-studied, the sparsity constraint remains relatively unexplored. Sparsity of parameter vectors is a natural assumption in high dimensional settings, and support recovery is a major step towards parameter estimation. We provide efficient algorithms for support recovery that have a logarithmic sample complexity dependence on the dimensionality of the latent space, and also poly-logarithmic dependence on sparsity. Our algorithms, applicable to mixtures of many different canonical distributions including high dimensional Uniform, Poisson, Laplace, Gaussians, etc., are based on the <italic>method of moments</i>. In most of these settings, our results are the first guarantees on the problem while in the rest, our results provide improvements on or are competitive with existing works.","PeriodicalId":13494,"journal":{"name":"IEEE Transactions on Information Theory","volume":"71 2","pages":"1184-1199"},"PeriodicalIF":2.2000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Information Theory","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10683776/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Mixture models are widely used to fit complex and multimodal datasets. In this paper we study mixtures with high dimensional sparse parameter vectors and consider the problem of support recovery of those vectors. While parameter learning in mixture models is well-studied, the sparsity constraint remains relatively unexplored. Sparsity of parameter vectors is a natural assumption in high dimensional settings, and support recovery is a major step towards parameter estimation. We provide efficient algorithms for support recovery that have a logarithmic sample complexity dependence on the dimensionality of the latent space, and also poly-logarithmic dependence on sparsity. Our algorithms, applicable to mixtures of many different canonical distributions including high dimensional Uniform, Poisson, Laplace, Gaussians, etc., are based on the method of moments. In most of these settings, our results are the first guarantees on the problem while in the rest, our results provide improvements on or are competitive with existing works.
期刊介绍:
The IEEE Transactions on Information Theory is a journal that publishes theoretical and experimental papers concerned with the transmission, processing, and utilization of information. The boundaries of acceptable subject matter are intentionally not sharply delimited. Rather, it is hoped that as the focus of research activity changes, a flexible policy will permit this Transactions to follow suit. Current appropriate topics are best reflected by recent Tables of Contents; they are summarized in the titles of editorial areas that appear on the inside front cover.