{"title":"极值指数估计的阈值选择","authors":"Natalia M. Markovich, Igor V. Rodionov","doi":"10.1080/10485252.2023.2266050","DOIUrl":null,"url":null,"abstract":"ABSTRACTWe propose a new threshold selection method for nonparametric estimation of the extremal index of stochastic processes. The discrepancy method was proposed as a data-driven smoothing tool for estimation of a probability density function. Now it is modified to select a threshold parameter of an extremal index estimator. A modification of the discrepancy statistic based on the Cramér–von Mises–Smirnov statistic ω2 is calculated by k largest order statistics instead of an entire sample. Its asymptotic distribution as k→∞ is proved to coincide with the ω2-distribution. Its quantiles are used as discrepancy values. The convergence rate of an extremal index estimate coupled with the discrepancy method is derived. The discrepancy method is used as an automatic threshold selection for the intervals and K-gaps estimators. It may be applied to other estimators of the extremal index. The performance of our method is evaluated by simulated and real data examples.KEYWORDS: Cramér–von Mises–Smirnov statisticdiscrepancy methodextremal indexnonparametric estimationthreshold selectionAMS SUBJECT CLASSIFICATION:: 62G32 Disclosure statementNo potential conflict of interest was reported by the author(s).Notes1 The connection between (Equation1(1) ωn2=n∫−∞∞(Fn(x)−F(x))2dF(x)(1) ) and (Equation2(2) ω^n2(h)=∑i=1n(F^h(Xi,n)−i−0.5n)2+112n(2) ) can be found in Markovich (Citation2007, p. 81).2 Theoretically, events {Ti=1} are allowed. In practice, such cases related to single inter-arrival times between consecutive exceedances are meaningless.3 The modification (ω^n2−0.4/n+0.6/n2)(1+1/n) of classical statistic (Equation2(2) ω^n2(h)=∑i=1n(F^h(Xi,n)−i−0.5n)2+112n(2) ) eliminates the dependence of the percentage points of the C–M–S statistic on the sample size (Stephens Citation1974). For n>40 it changes the statistic on less than one percent. One can use the modification with regard to ω~L2(θ^) for finite L due to the closeness of its distribution to the limit distribution of the C–M–S statistic by Theorem 3.2.Additional informationFundingThe work of N.M. Markovich in Sections 1, 2, 4 and 5 was supported by the Russian Science Foundation [grant number 22-21-00177]. The work of I. V. Rodionov in Section 3 and proofs in Markovich and Rodionov (Citation2022) was performed at the Institute for Information Transmission Problems (Kharkevich Institute) of the Russian Academy of Sciences with the support of the Russian Science Foundation (grant No. 21-71-00035).","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"27 1","pages":"0"},"PeriodicalIF":0.8000,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Threshold selection for extremal index estimation\",\"authors\":\"Natalia M. Markovich, Igor V. Rodionov\",\"doi\":\"10.1080/10485252.2023.2266050\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABSTRACTWe propose a new threshold selection method for nonparametric estimation of the extremal index of stochastic processes. The discrepancy method was proposed as a data-driven smoothing tool for estimation of a probability density function. Now it is modified to select a threshold parameter of an extremal index estimator. A modification of the discrepancy statistic based on the Cramér–von Mises–Smirnov statistic ω2 is calculated by k largest order statistics instead of an entire sample. Its asymptotic distribution as k→∞ is proved to coincide with the ω2-distribution. Its quantiles are used as discrepancy values. The convergence rate of an extremal index estimate coupled with the discrepancy method is derived. The discrepancy method is used as an automatic threshold selection for the intervals and K-gaps estimators. It may be applied to other estimators of the extremal index. The performance of our method is evaluated by simulated and real data examples.KEYWORDS: Cramér–von Mises–Smirnov statisticdiscrepancy methodextremal indexnonparametric estimationthreshold selectionAMS SUBJECT CLASSIFICATION:: 62G32 Disclosure statementNo potential conflict of interest was reported by the author(s).Notes1 The connection between (Equation1(1) ωn2=n∫−∞∞(Fn(x)−F(x))2dF(x)(1) ) and (Equation2(2) ω^n2(h)=∑i=1n(F^h(Xi,n)−i−0.5n)2+112n(2) ) can be found in Markovich (Citation2007, p. 81).2 Theoretically, events {Ti=1} are allowed. In practice, such cases related to single inter-arrival times between consecutive exceedances are meaningless.3 The modification (ω^n2−0.4/n+0.6/n2)(1+1/n) of classical statistic (Equation2(2) ω^n2(h)=∑i=1n(F^h(Xi,n)−i−0.5n)2+112n(2) ) eliminates the dependence of the percentage points of the C–M–S statistic on the sample size (Stephens Citation1974). For n>40 it changes the statistic on less than one percent. One can use the modification with regard to ω~L2(θ^) for finite L due to the closeness of its distribution to the limit distribution of the C–M–S statistic by Theorem 3.2.Additional informationFundingThe work of N.M. Markovich in Sections 1, 2, 4 and 5 was supported by the Russian Science Foundation [grant number 22-21-00177]. The work of I. V. Rodionov in Section 3 and proofs in Markovich and Rodionov (Citation2022) was performed at the Institute for Information Transmission Problems (Kharkevich Institute) of the Russian Academy of Sciences with the support of the Russian Science Foundation (grant No. 21-71-00035).\",\"PeriodicalId\":50112,\"journal\":{\"name\":\"Journal of Nonparametric Statistics\",\"volume\":\"27 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2023-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Nonparametric Statistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/10485252.2023.2266050\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Nonparametric Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/10485252.2023.2266050","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 3
摘要
摘要针对随机过程极值指标的非参数估计,提出了一种新的阈值选择方法。提出了一种数据驱动的平滑方法,用于估计概率密度函数。现在将其修改为选择极值索引估计器的阈值参数。基于cram - von Mises-Smirnov统计量ω2的差异统计量的修正是通过k个最大阶统计量而不是整个样本来计算的。证明了它在k→∞时的渐近分布与ω - 2分布一致。其分位数用作差异值。导出了与差值法相结合的极值指数估计的收敛速度。差异方法被用作区间和k -间隙估计器的自动阈值选择。它可以应用于极值指数的其他估计。通过仿真和实际数据实例对该方法的性能进行了评价。关键词:克拉姆萨姆-冯·米斯-斯米尔诺夫统计差异法极值指数非参数估计阈值选择ams主题分类::62G32披露声明作者未报告潜在利益冲突。理论上,事件{Ti=1}是允许的。在实践中,这类与连续超标之间的单一到达间隔时间有关的情况是没有意义的对于n>40,它对统计量的改变小于1%。对于有限的L,由于它的分布与定理3.2中C-M-S统计量的极限分布很接近,我们可以使用关于ω~L2(θ^)的修正。N.M. Markovich在第1,2,4和5部分的工作得到了俄罗斯科学基金会的支持[资助号22-21-00177]。I. V. Rodionov在第3节中的工作以及Markovich和Rodionov的证明(Citation2022)由俄罗斯科学院信息传输问题研究所(Kharkevich研究所)在俄罗斯科学基金会(资助号21-71-00035)的支持下完成。
ABSTRACTWe propose a new threshold selection method for nonparametric estimation of the extremal index of stochastic processes. The discrepancy method was proposed as a data-driven smoothing tool for estimation of a probability density function. Now it is modified to select a threshold parameter of an extremal index estimator. A modification of the discrepancy statistic based on the Cramér–von Mises–Smirnov statistic ω2 is calculated by k largest order statistics instead of an entire sample. Its asymptotic distribution as k→∞ is proved to coincide with the ω2-distribution. Its quantiles are used as discrepancy values. The convergence rate of an extremal index estimate coupled with the discrepancy method is derived. The discrepancy method is used as an automatic threshold selection for the intervals and K-gaps estimators. It may be applied to other estimators of the extremal index. The performance of our method is evaluated by simulated and real data examples.KEYWORDS: Cramér–von Mises–Smirnov statisticdiscrepancy methodextremal indexnonparametric estimationthreshold selectionAMS SUBJECT CLASSIFICATION:: 62G32 Disclosure statementNo potential conflict of interest was reported by the author(s).Notes1 The connection between (Equation1(1) ωn2=n∫−∞∞(Fn(x)−F(x))2dF(x)(1) ) and (Equation2(2) ω^n2(h)=∑i=1n(F^h(Xi,n)−i−0.5n)2+112n(2) ) can be found in Markovich (Citation2007, p. 81).2 Theoretically, events {Ti=1} are allowed. In practice, such cases related to single inter-arrival times between consecutive exceedances are meaningless.3 The modification (ω^n2−0.4/n+0.6/n2)(1+1/n) of classical statistic (Equation2(2) ω^n2(h)=∑i=1n(F^h(Xi,n)−i−0.5n)2+112n(2) ) eliminates the dependence of the percentage points of the C–M–S statistic on the sample size (Stephens Citation1974). For n>40 it changes the statistic on less than one percent. One can use the modification with regard to ω~L2(θ^) for finite L due to the closeness of its distribution to the limit distribution of the C–M–S statistic by Theorem 3.2.Additional informationFundingThe work of N.M. Markovich in Sections 1, 2, 4 and 5 was supported by the Russian Science Foundation [grant number 22-21-00177]. The work of I. V. Rodionov in Section 3 and proofs in Markovich and Rodionov (Citation2022) was performed at the Institute for Information Transmission Problems (Kharkevich Institute) of the Russian Academy of Sciences with the support of the Russian Science Foundation (grant No. 21-71-00035).
期刊介绍:
Journal of Nonparametric Statistics provides a medium for the publication of research and survey work in nonparametric statistics and related areas. The scope includes, but is not limited to the following topics:
Nonparametric modeling,
Nonparametric function estimation,
Rank and other robust and distribution-free procedures,
Resampling methods,
Lack-of-fit testing,
Multivariate analysis,
Inference with high-dimensional data,
Dimension reduction and variable selection,
Methods for errors in variables, missing, censored, and other incomplete data structures,
Inference of stochastic processes,
Sample surveys,
Time series analysis,
Longitudinal and functional data analysis,
Nonparametric Bayes methods and decision procedures,
Semiparametric models and procedures,
Statistical methods for imaging and tomography,
Statistical inverse problems,
Financial statistics and econometrics,
Bioinformatics and comparative genomics,
Statistical algorithms and machine learning.
Both the theory and applications of nonparametric statistics are covered in the journal. Research applying nonparametric methods to medicine, engineering, technology, science and humanities is welcomed, provided the novelty and quality level are of the highest order.
Authors are encouraged to submit supplementary technical arguments, computer code, data analysed in the paper or any additional information for online publication along with the published paper.