Efficient multiple change point detection for high-dimensional generalized linear models

Pub Date : 2022-09-16 DOI:10.1002/cjs.11721
Xianru Wang, Bin Liu, Xinsheng Zhang, Yufeng Liu, for the Alzheimer's Disease Neuroimaging Initiative
{"title":"Efficient multiple change point detection for high-dimensional generalized linear models","authors":"Xianru Wang,&nbsp;Bin Liu,&nbsp;Xinsheng Zhang,&nbsp;Yufeng Liu,&nbsp;for the Alzheimer's Disease Neuroimaging Initiative","doi":"10.1002/cjs.11721","DOIUrl":null,"url":null,"abstract":"<p>Change point detection for high-dimensional data is an important yet challenging problem for many applications. In this article, we consider multiple change point detection in the context of high-dimensional generalized linear models, allowing the covariate dimension <math>\n <mrow>\n <mi>p</mi>\n </mrow></math> to grow exponentially with the sample size <math>\n <mrow>\n <mi>n</mi>\n </mrow></math>. The model considered is general and flexible in the sense that it covers various specific models as special cases. It can automatically account for the underlying data generation mechanism without specifying any prior knowledge about the number of change points. Based on dynamic programming and binary segmentation techniques, two algorithms are proposed to detect multiple change points, allowing the number of change points to grow with <math>\n <mrow>\n <mi>n</mi>\n </mrow></math>. To further improve the computational efficiency, a more efficient algorithm designed for the case of a single change point is proposed. We present theoretical properties of our proposed algorithms, including estimation consistency for the number and locations of change points as well as consistency and asymptotic distributions for the underlying regression coefficients. Finally, extensive simulation studies and application to the Alzheimer's Disease Neuroimaging Initiative data further demonstrate the competitive performance of our proposed methods.</p>","PeriodicalId":0,"journal":{"name":"","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11721","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"","FirstCategoryId":"100","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cjs.11721","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Change point detection for high-dimensional data is an important yet challenging problem for many applications. In this article, we consider multiple change point detection in the context of high-dimensional generalized linear models, allowing the covariate dimension p to grow exponentially with the sample size n . The model considered is general and flexible in the sense that it covers various specific models as special cases. It can automatically account for the underlying data generation mechanism without specifying any prior knowledge about the number of change points. Based on dynamic programming and binary segmentation techniques, two algorithms are proposed to detect multiple change points, allowing the number of change points to grow with n . To further improve the computational efficiency, a more efficient algorithm designed for the case of a single change point is proposed. We present theoretical properties of our proposed algorithms, including estimation consistency for the number and locations of change points as well as consistency and asymptotic distributions for the underlying regression coefficients. Finally, extensive simulation studies and application to the Alzheimer's Disease Neuroimaging Initiative data further demonstrate the competitive performance of our proposed methods.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
高维广义线性模型的高效多变点检测
对于许多应用来说,高维数据的变化点检测是一个重要而又具有挑战性的问题。在本文中,我们考虑在高维广义线性模型背景下的多变化点检测,允许协变量维p随样本量n呈指数增长。所考虑的模型是通用的和灵活的,因为它将各种特定模型作为特殊情况涵盖。它可以自动解释底层数据生成机制,而无需指定任何关于更改点数量的先验知识。基于动态规划和二值分割技术,提出了两种检测多个变化点的算法,允许变化点的数量随n增长。为了进一步提高计算效率,提出了一种针对单变化点情况的高效算法。我们提出了我们提出的算法的理论性质,包括对变化点的数量和位置的估计一致性以及底层回归系数的一致性和渐近分布。最后,广泛的模拟研究和阿尔茨海默病神经成像倡议数据的应用进一步证明了我们提出的方法的竞争性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1