Some clustering-based change-point detection methods applicable to high dimension, low sample size data

IF 0.8 4区数学 Q3 STATISTICS & PROBABILITY Journal of Statistical Planning and Inference Pub Date : 2024-07-16 DOI:10.1016/j.jspi.2024.106212

Trisha Dawn , Angshuman Roy , Alokesh Manna , Anil K. Ghosh

引用次数: 0

Abstract

Detection of change-points in a sequence of high dimensional observations is a challenging problem, and this becomes even more challenging when the sample size (i.e., the sequence length) is small. In this article, we propose some change-point detection methods based on clustering, which can be conveniently used in such high dimension, low sample size situations. First, we consider the single change-point problem. Using $k$ -means clustering based on a suitable dissimilarity measures, we propose some methods for testing the existence of a change-point and estimating its location. High dimensional behavior of these proposed methods are investigated under appropriate regularity conditions. Next, we extend our methods for detection of multiple change-points. We carry out extensive numerical studies and analyze a real data set to compare the performance of our proposed methods with some state-of-the-art methods.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一些适用于高维度、低样本量数据的基于聚类的变化点检测方法

在高维观测序列中检测变化点是一个具有挑战性的问题，而当样本量（即序列长度）较小时，这个问题就变得更具挑战性。在本文中，我们提出了一些基于聚类的变化点检测方法，可以方便地用于这种高维度、低样本量的情况。首先，我们考虑单个变化点问题。利用基于合适的异或度量的均值聚类，我们提出了一些检测变化点是否存在并估计其位置的方法。在适当的正则条件下，我们对这些方法的高维行为进行了研究。接下来，我们扩展了检测多个变化点的方法。我们进行了大量的数值研究，并分析了一个真实数据集，将我们提出的方法与一些最先进的方法进行了性能比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Statistical Planning and Inference 数学-统计学与概率论

CiteScore

2.10

自引率

11.10%

发文量

审稿时长

3-6 weeks

期刊介绍： The Journal of Statistical Planning and Inference offers itself as a multifaceted and all-inclusive bridge between classical aspects of statistics and probability, and the emerging interdisciplinary aspects that have a potential of revolutionizing the subject. While we maintain our traditional strength in statistical inference, design, classical probability, and large sample methods, we also have a far more inclusive and broadened scope to keep up with the new problems that confront us as statisticians, mathematicians, and scientists. We publish high quality articles in all branches of statistics, probability, discrete mathematics, machine learning, and bioinformatics. We also especially welcome well written and up to date review articles on fundamental themes of statistics, probability, machine learning, and general biostatistics. Thoughtful letters to the editors, interesting problems in need of a solution, and short notes carrying an element of elegance or beauty are equally welcome.