Efficient multiple change point detection for high-dimensional generalized linear models

IF 0.8 4区数学 Q3 STATISTICS & PROBABILITY Canadian Journal of Statistics-Revue Canadienne De Statistique Pub Date : 2022-09-16 DOI:10.1002/cjs.11721

Xianru Wang, Bin Liu, Xinsheng Zhang, Yufeng Liu, for the Alzheimer's Disease Neuroimaging Initiative

{"title":"Efficient multiple change point detection for high-dimensional generalized linear models","authors":"Xianru Wang, Bin Liu, Xinsheng Zhang, Yufeng Liu, for the Alzheimer's Disease Neuroimaging Initiative","doi":"10.1002/cjs.11721","DOIUrl":null,"url":null,"abstract":"<p>Change point detection for high-dimensional data is an important yet challenging problem for many applications. In this article, we consider multiple change point detection in the context of high-dimensional generalized linear models, allowing the covariate dimension <math>\n <mrow>\n <mi>p</mi>\n </mrow></math> to grow exponentially with the sample size <math>\n <mrow>\n <mi>n</mi>\n </mrow></math>. The model considered is general and flexible in the sense that it covers various specific models as special cases. It can automatically account for the underlying data generation mechanism without specifying any prior knowledge about the number of change points. Based on dynamic programming and binary segmentation techniques, two algorithms are proposed to detect multiple change points, allowing the number of change points to grow with <math>\n <mrow>\n <mi>n</mi>\n </mrow></math>. To further improve the computational efficiency, a more efficient algorithm designed for the case of a single change point is proposed. We present theoretical properties of our proposed algorithms, including estimation consistency for the number and locations of change points as well as consistency and asymptotic distributions for the underlying regression coefficients. Finally, extensive simulation studies and application to the Alzheimer's Disease Neuroimaging Initiative data further demonstrate the competitive performance of our proposed methods.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":"51 2","pages":"596-629"},"PeriodicalIF":0.8000,"publicationDate":"2022-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11721","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Canadian Journal of Statistics-Revue Canadienne De Statistique","FirstCategoryId":"100","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cjs.11721","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}

引用次数: 2

Abstract

Change point detection for high-dimensional data is an important yet challenging problem for many applications. In this article, we consider multiple change point detection in the context of high-dimensional generalized linear models, allowing the covariate dimension $p$ to grow exponentially with the sample size $n$ . The model considered is general and flexible in the sense that it covers various specific models as special cases. It can automatically account for the underlying data generation mechanism without specifying any prior knowledge about the number of change points. Based on dynamic programming and binary segmentation techniques, two algorithms are proposed to detect multiple change points, allowing the number of change points to grow with $n$ . To further improve the computational efficiency, a more efficient algorithm designed for the case of a single change point is proposed. We present theoretical properties of our proposed algorithms, including estimation consistency for the number and locations of change points as well as consistency and asymptotic distributions for the underlying regression coefficients. Finally, extensive simulation studies and application to the Alzheimer's Disease Neuroimaging Initiative data further demonstrate the competitive performance of our proposed methods.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

高维广义线性模型的高效多变点检测

对于许多应用来说，高维数据的变化点检测是一个重要而又具有挑战性的问题。在本文中，我们考虑在高维广义线性模型背景下的多变化点检测，允许协变量维p随样本量n呈指数增长。所考虑的模型是通用的和灵活的，因为它将各种特定模型作为特殊情况涵盖。它可以自动解释底层数据生成机制，而无需指定任何关于更改点数量的先验知识。基于动态规划和二值分割技术，提出了两种检测多个变化点的算法，允许变化点的数量随n增长。为了进一步提高计算效率，提出了一种针对单变化点情况的高效算法。我们提出了我们提出的算法的理论性质，包括对变化点的数量和位置的估计一致性以及底层回归系数的一致性和渐近分布。最后，广泛的模拟研究和阿尔茨海默病神经成像倡议数据的应用进一步证明了我们提出的方法的竞争性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Canadian Journal of Statistics-Revue Canadienne De Statistique 数学-统计学与概率论

CiteScore

1.40

自引率

0.00%

发文量

审稿时长

>12 weeks

期刊介绍： The Canadian Journal of Statistics is the official journal of the Statistical Society of Canada. It has a reputation internationally as an excellent journal. The editorial board is comprised of statistical scientists with applied, computational, methodological, theoretical and probabilistic interests. Their role is to ensure that the journal continues to provide an international forum for the discipline of Statistics. The journal seeks papers making broad points of interest to many readers, whereas papers making important points of more specific interest are better placed in more specialized journals. The levels of innovation and impact are key in the evaluation of submitted manuscripts.

期刊最新文献

Issue Information True and false discoveries with independent and sequential e-values Issue Information Multiple change-point detection for regression curves Robust estimation of loss-based measures of model performance under covariate shift