标量时间序列聚类的一种测试方法

IF 1.2 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Journal of Time Series Analysis Pub Date : 2023-06-17 DOI:10.1111/jtsa.12706
Daniel Peña, Ruey S. Tsay
{"title":"标量时间序列聚类的一种测试方法","authors":"Daniel Peña,&nbsp;Ruey S. Tsay","doi":"10.1111/jtsa.12706","DOIUrl":null,"url":null,"abstract":"<p>This article considers clustering stationary scalar time series using their marginal properties and a hierarchical method. Two major issues involved are to detect the existence of clusters and to determine their number. We propose a new test statistic for detecting whether a data set consists of multiple clusters and a new procedure to determine the number of clusters. The proposed method is based on the jumps, that is, the increments, in the heights of the dendrogram when a hierarchical clustering is applied to the data. We use autoregressive sieve bootstrap to obtain a reference distribution of the test statistics and propose an iterative procedure to find the number of clusters. The clusters found are internally homogeneous according to the test statistics used in the analysis. The performance of the proposed procedure in finite samples is investigated by Monte Carlo simulations and illustrated by some empirical examples. Comparisons with some existing methods for selecting the number of clusters are also investigated.</p>","PeriodicalId":49973,"journal":{"name":"Journal of Time Series Analysis","volume":null,"pages":null},"PeriodicalIF":1.2000,"publicationDate":"2023-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/jtsa.12706","citationCount":"1","resultStr":"{\"title\":\"A testing approach to clustering scalar time series\",\"authors\":\"Daniel Peña,&nbsp;Ruey S. Tsay\",\"doi\":\"10.1111/jtsa.12706\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This article considers clustering stationary scalar time series using their marginal properties and a hierarchical method. Two major issues involved are to detect the existence of clusters and to determine their number. We propose a new test statistic for detecting whether a data set consists of multiple clusters and a new procedure to determine the number of clusters. The proposed method is based on the jumps, that is, the increments, in the heights of the dendrogram when a hierarchical clustering is applied to the data. We use autoregressive sieve bootstrap to obtain a reference distribution of the test statistics and propose an iterative procedure to find the number of clusters. The clusters found are internally homogeneous according to the test statistics used in the analysis. The performance of the proposed procedure in finite samples is investigated by Monte Carlo simulations and illustrated by some empirical examples. Comparisons with some existing methods for selecting the number of clusters are also investigated.</p>\",\"PeriodicalId\":49973,\"journal\":{\"name\":\"Journal of Time Series Analysis\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2023-06-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/jtsa.12706\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Time Series Analysis\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/jtsa.12706\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Time Series Analysis","FirstCategoryId":"100","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/jtsa.12706","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 1

摘要

本文利用平稳标量时间序列的边际性质和层次方法对其进行聚类。涉及的两个主要问题是检测集群的存在和确定集群的数量。我们提出了一种新的测试统计量来检测数据集是否由多个聚类组成,并提出了一个确定聚类数量的新过程。所提出的方法是基于对数据应用分层聚类时树状图高度的跳跃,即增量。我们使用自回归筛自举来获得测试统计的参考分布,并提出了一个迭代程序来找到聚类的数量。根据分析中使用的测试统计数据,发现的聚类在内部是同质的。通过蒙特卡洛模拟研究了所提出的程序在有限样本中的性能,并通过一些经验示例进行了说明。还研究了与一些现有的聚类数量选择方法的比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A testing approach to clustering scalar time series

This article considers clustering stationary scalar time series using their marginal properties and a hierarchical method. Two major issues involved are to detect the existence of clusters and to determine their number. We propose a new test statistic for detecting whether a data set consists of multiple clusters and a new procedure to determine the number of clusters. The proposed method is based on the jumps, that is, the increments, in the heights of the dendrogram when a hierarchical clustering is applied to the data. We use autoregressive sieve bootstrap to obtain a reference distribution of the test statistics and propose an iterative procedure to find the number of clusters. The clusters found are internally homogeneous according to the test statistics used in the analysis. The performance of the proposed procedure in finite samples is investigated by Monte Carlo simulations and illustrated by some empirical examples. Comparisons with some existing methods for selecting the number of clusters are also investigated.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Time Series Analysis
Journal of Time Series Analysis 数学-数学跨学科应用
CiteScore
2.00
自引率
0.00%
发文量
39
审稿时长
6-12 weeks
期刊介绍: During the last 30 years Time Series Analysis has become one of the most important and widely used branches of Mathematical Statistics. Its fields of application range from neurophysiology to astrophysics and it covers such well-known areas as economic forecasting, study of biological data, control systems, signal processing and communications and vibrations engineering. The Journal of Time Series Analysis started in 1980, has since become the leading journal in its field, publishing papers on both fundamental theory and applications, as well as review papers dealing with recent advances in major areas of the subject and short communications on theoretical developments. The editorial board consists of many of the world''s leading experts in Time Series Analysis.
期刊最新文献
Non‐causal and non‐invertible ARMA models: Identification, estimation and application in equity portfolios Mixing properties of non‐stationary multi‐variate count processes Mean‐preserving rounding integer‐valued ARMA models Forecasting the yield curve: the role of additional and time‐varying decay parameters, conditional heteroscedasticity, and macro‐economic factors Weighted discrete ARMA models for categorical time series
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1