混合模型中基于分类的高效重标注。

IF 2.1 4区数学 Q1 STATISTICS & PROBABILITY American Statistician Pub Date : 2011-02-01 DOI:10.1198/tast.2011.10170

Andrew J Cron, Mike West

{"title":"混合模型中基于分类的高效重标注。","authors":"Andrew J Cron, Mike West","doi":"10.1198/tast.2011.10170","DOIUrl":null,"url":null,"abstract":"Effective component relabeling in Bayesian analyses of mixture models is critical to the routine use of mixtures in classification with analysis based on Markov chain Monte Carlo methods. The classification-based relabeling approach here is computationally attractive and statistically effective, and scales well with sample size and number of mixture components concordant with enabling routine analyses of increasingly large data sets. Building on the best of existing methods, practical relabeling aims to match data:component classification indicators in MCMC iterates with those of a defined reference mixture distribution. The method performs as well as or better than existing methods in small dimensional problems, while being practically superior in problems with larger data sets as the approach is scalable. We describe examples and computational benchmarks, and provide supporting code with efficient computational implementation of the algorithm that will be of use to others in practical applications of mixture models.","PeriodicalId":50801,"journal":{"name":"American Statistician","volume":"65 1","pages":"16-20"},"PeriodicalIF":2.1000,"publicationDate":"2011-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3110018/pdf/nihms269892.pdf","citationCount":"0","resultStr":"{\"title\":\"Efficient Classification-Based Relabeling in Mixture Models.\",\"authors\":\"Andrew J Cron, Mike West\",\"doi\":\"10.1198/tast.2011.10170\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Effective component relabeling in Bayesian analyses of mixture models is critical to the routine use of mixtures in classification with analysis based on Markov chain Monte Carlo methods. The classification-based relabeling approach here is computationally attractive and statistically effective, and scales well with sample size and number of mixture components concordant with enabling routine analyses of increasingly large data sets. Building on the best of existing methods, practical relabeling aims to match data:component classification indicators in MCMC iterates with those of a defined reference mixture distribution. The method performs as well as or better than existing methods in small dimensional problems, while being practically superior in problems with larger data sets as the approach is scalable. We describe examples and computational benchmarks, and provide supporting code with efficient computational implementation of the algorithm that will be of use to others in practical applications of mixture models.\",\"PeriodicalId\":50801,\"journal\":{\"name\":\"American Statistician\",\"volume\":\"65 1\",\"pages\":\"16-20\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2011-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3110018/pdf/nihms269892.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"American Statistician\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1198/tast.2011.10170\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Statistician","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1198/tast.2011.10170","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}

引用次数: 0

摘要

混合模型贝叶斯分析中有效的成分重标记对于混合模型在基于马尔可夫链蒙特卡罗方法的分类分析中的常规应用至关重要。基于分类的重新标记方法在计算上很有吸引力，在统计上也很有效，并且可以很好地扩展样本量和混合成分的数量，从而能够对越来越大的数据集进行常规分析。在现有最佳方法的基础上，实际的重标注旨在匹配数据：MCMC中的成分分类指标与定义的参考混合分布的成分分类指标进行迭代。该方法在小维度问题上的表现与现有方法一样好，甚至更好，同时由于该方法具有可扩展性，因此在具有较大数据集的问题上实际上更优越。我们描述了示例和计算基准，并提供了算法的有效计算实现的支持代码，这些代码将用于混合模型的实际应用中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Efficient Classification-Based Relabeling in Mixture Models.

Effective component relabeling in Bayesian analyses of mixture models is critical to the routine use of mixtures in classification with analysis based on Markov chain Monte Carlo methods. The classification-based relabeling approach here is computationally attractive and statistically effective, and scales well with sample size and number of mixture components concordant with enabling routine analyses of increasingly large data sets. Building on the best of existing methods, practical relabeling aims to match data:component classification indicators in MCMC iterates with those of a defined reference mixture distribution. The method performs as well as or better than existing methods in small dimensional problems, while being practically superior in problems with larger data sets as the approach is scalable. We describe examples and computational benchmarks, and provide supporting code with efficient computational implementation of the algorithm that will be of use to others in practical applications of mixture models.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

American Statistician 数学-统计学与概率论

CiteScore

3.50

自引率

5.60%

发文量

审稿时长

>12 weeks

期刊介绍： Are you looking for general-interest articles about current national and international statistical problems and programs; interesting and fun articles of a general nature about statistics and its applications; or the teaching of statistics? Then you are looking for The American Statistician (TAS), published quarterly by the American Statistical Association. TAS contains timely articles organized into the following sections: Statistical Practice, General, Teacher''s Corner, History Corner, Interdisciplinary, Statistical Computing and Graphics, Reviews of Books and Teaching Materials, and Letters to the Editor.