高维数据的多变化点检测

IF 1.2 4区 数学 Q2 STATISTICS & PROBABILITY Test Pub Date : 2024-03-25 DOI:10.1007/s11749-024-00926-w
Wenbiao Zhao, Lixing Zhu, Falong Tan
{"title":"高维数据的多变化点检测","authors":"Wenbiao Zhao, Lixing Zhu, Falong Tan","doi":"10.1007/s11749-024-00926-w","DOIUrl":null,"url":null,"abstract":"<p>This research investigates the detection of multiple change points in high-dimensional data without particular sparse or dense structure, where the dimension can be of exponential order in relation to the sample size. The estimation approach proposed employs a signal statistic based on a sequence of signal screening-based local U-statistics. This technique avoids costly computations that exhaustive search algorithms require and mitigates false positives, which hypothesis testing-based methods need to control. Consistency of estimation can be achieved for both the locations and number of change points, even when the number of change points diverges at a certain rate as the sample size increases. Additionally, the visualization nature of the proposed approach makes plotting the signal statistic a useful tool to identify locations of change points, which distinguishes it from existing methods in the literature. Numerical studies are performed to evaluate the effectiveness of the proposed technique in finite sample scenarios, and a real data analysis is presented to illustrate its application.\n</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"19 1","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multiple change point detection for high-dimensional data\",\"authors\":\"Wenbiao Zhao, Lixing Zhu, Falong Tan\",\"doi\":\"10.1007/s11749-024-00926-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This research investigates the detection of multiple change points in high-dimensional data without particular sparse or dense structure, where the dimension can be of exponential order in relation to the sample size. The estimation approach proposed employs a signal statistic based on a sequence of signal screening-based local U-statistics. This technique avoids costly computations that exhaustive search algorithms require and mitigates false positives, which hypothesis testing-based methods need to control. Consistency of estimation can be achieved for both the locations and number of change points, even when the number of change points diverges at a certain rate as the sample size increases. Additionally, the visualization nature of the proposed approach makes plotting the signal statistic a useful tool to identify locations of change points, which distinguishes it from existing methods in the literature. Numerical studies are performed to evaluate the effectiveness of the proposed technique in finite sample scenarios, and a real data analysis is presented to illustrate its application.\\n</p>\",\"PeriodicalId\":51189,\"journal\":{\"name\":\"Test\",\"volume\":\"19 1\",\"pages\":\"\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2024-03-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Test\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1007/s11749-024-00926-w\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Test","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s11749-024-00926-w","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0

摘要

本研究探讨了在无特定稀疏或密集结构的高维数据中检测多个变化点的问题,其中维数可能是与样本大小相关的指数阶。所提出的估计方法采用了基于信号筛选的局部 U 统计序列的信号统计。这种技术避免了穷举搜索算法所需的昂贵计算,并减少了基于假设检验的方法需要控制的假阳性。即使随着样本量的增加,变化点的数量以一定的速度发生变化,也能实现对变化点位置和数量的一致性估计。此外,所提方法的可视化特性使绘制信号统计图成为识别变化点位置的有用工具,这使其有别于文献中的现有方法。我们进行了数值研究,以评估拟议技术在有限样本情况下的有效性,并通过实际数据分析来说明其应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Multiple change point detection for high-dimensional data

This research investigates the detection of multiple change points in high-dimensional data without particular sparse or dense structure, where the dimension can be of exponential order in relation to the sample size. The estimation approach proposed employs a signal statistic based on a sequence of signal screening-based local U-statistics. This technique avoids costly computations that exhaustive search algorithms require and mitigates false positives, which hypothesis testing-based methods need to control. Consistency of estimation can be achieved for both the locations and number of change points, even when the number of change points diverges at a certain rate as the sample size increases. Additionally, the visualization nature of the proposed approach makes plotting the signal statistic a useful tool to identify locations of change points, which distinguishes it from existing methods in the literature. Numerical studies are performed to evaluate the effectiveness of the proposed technique in finite sample scenarios, and a real data analysis is presented to illustrate its application.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Test
Test 数学-统计学与概率论
CiteScore
2.20
自引率
7.70%
发文量
41
审稿时长
>12 weeks
期刊介绍: TEST is an international journal of Statistics and Probability, sponsored by the Spanish Society of Statistics and Operations Research. English is the official language of the journal. The emphasis of TEST is placed on papers containing original theoretical contributions of direct or potential value in applications. In this respect, the methodological contents are considered to be crucial for the papers published in TEST, but the practical implications of the methodological aspects are also relevant. Original sound manuscripts on either well-established or emerging areas in the scope of the journal are welcome. One volume is published annually in four issues. In addition to the regular contributions, each issue of TEST contains an invited paper from a world-wide recognized outstanding statistician on an up-to-date challenging topic, including discussions.
期刊最新文献
Jackknife empirical likelihood for the correlation coefficient with additive distortion measurement errors Nonparametric conditional survival function estimation and plug-in bandwidth selection with multiple covariates Higher-order spatial autoregressive varying coefficient model: estimation and specification test Composite quantile estimation in partially functional linear regression model with randomly censored responses Bayesian inference and cure rate modeling for event history data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1