首页 > 最新文献

Canadian Journal of Statistics-Revue Canadienne De Statistique最新文献

英文 中文
New highly efficient high-breakdown estimator of multivariate scatter and location for elliptical distributions 椭圆分布多变量散射和定位的高效高分解估计器
IF 0.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-04-16 DOI: 10.1002/cjs.11770
Justin Fishbone, Lamine Mili

High-breakdown-point estimators of multivariate location and shape matrices, such as the MM-estimator with smoothed hard rejection and the Rocke S-estimator, are generally designed to have high efficiency for Gaussian data. However, many phenomena are non-Gaussian, and these estimators can therefore have poor efficiency. This article proposes a new tunable S-estimator, termed the Sq-estimator, for the general class of symmetric elliptical distributions, a class containing many common families such as the multivariate Gaussian, t-, Cauchy, Laplace, hyperbolic, and normal inverse Gaussian distributions. Across this class, the Sq-estimator is shown to generally provide higher maximum efficiency than other leading high-breakdown estimators while maintaining the maximum breakdown point. Furthermore, the Sq-estimator is demonstrated to be distributionally robust, and its robustness to outliers is demonstrated to be on par with these leading estimators while also being more stable with respect to initial conditions. From a practical viewpoint, these properties make the Sq-estimator broadly applicable for practitioners. These advantages are demonstrated with an example application—the minimum-variance optimal allocation of financial portfolio investments.

多变量位置和形状矩阵的高崩溃点估计器,如光滑硬抑制mm估计器和rock s估计器,通常在高斯分布下具有很高的效率。然而,许多现象是非高斯的,因此这些估计器的效率很低。对于一般的对称椭圆分布,本文提出了一种新的可调s估计量,称为S-q估计量,这类分布包含许多常见的族,如多元高斯分布、t-分布、柯西分布、拉普拉斯分布、双曲分布和正态逆高斯分布。在这个类别中,S-q估计器通常比其他领先的高击穿估计器提供更高的最大效率,同时保持最大击穿点。此外,它的鲁棒性被证明与这些领先的估计相当,同时相对于初始条件也更稳定。从实际的角度来看,这些特性使S-q广泛适用于从业者。这通过一个示例应用程序来演示——金融组合投资的最小方差最优分配。
{"title":"New highly efficient high-breakdown estimator of multivariate scatter and location for elliptical distributions","authors":"Justin Fishbone,&nbsp;Lamine Mili","doi":"10.1002/cjs.11770","DOIUrl":"10.1002/cjs.11770","url":null,"abstract":"<p>High-breakdown-point estimators of multivariate location and shape matrices, such as the <span></span><math>\u0000 <mrow>\u0000 <mtext>MM</mtext>\u0000 </mrow></math>-<i>estimator</i> with smoothed hard rejection and the Rocke <span></span><math>\u0000 <mrow>\u0000 <mi>S</mi>\u0000 </mrow></math>-estimator, are generally designed to have high efficiency for Gaussian data. However, many phenomena are non-Gaussian, and these estimators can therefore have poor efficiency. This article proposes a new tunable <span></span><math>\u0000 <mrow>\u0000 <mi>S</mi>\u0000 </mrow></math>-estimator, termed the <span></span><math>\u0000 <mrow>\u0000 <msub>\u0000 <mrow>\u0000 <mi>S</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mi>q</mi>\u0000 </mrow>\u0000 </msub>\u0000 </mrow></math>-estimator, for the general class of symmetric elliptical distributions, a class containing many common families such as the multivariate Gaussian, <span></span><math>\u0000 <mrow>\u0000 <mi>t</mi>\u0000 </mrow></math>-, Cauchy, Laplace, hyperbolic, and normal inverse Gaussian distributions. Across this class, the <span></span><math>\u0000 <mrow>\u0000 <msub>\u0000 <mrow>\u0000 <mi>S</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mi>q</mi>\u0000 </mrow>\u0000 </msub>\u0000 </mrow></math>-estimator is shown to generally provide higher maximum efficiency than other leading high-breakdown estimators while maintaining the maximum breakdown point. Furthermore, the <span></span><math>\u0000 <mrow>\u0000 <msub>\u0000 <mrow>\u0000 <mi>S</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mi>q</mi>\u0000 </mrow>\u0000 </msub>\u0000 </mrow></math>-estimator is demonstrated to be distributionally robust, and its robustness to outliers is demonstrated to be on par with these leading estimators while also being more stable with respect to initial conditions. From a practical viewpoint, these properties make the <span></span><math>\u0000 <mrow>\u0000 <msub>\u0000 <mrow>\u0000 <mi>S</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mi>q</mi>\u0000 </mrow>\u0000 </msub>\u0000 </mrow></math>-estimator broadly applicable for practitioners. These advantages are demonstrated with an example application—the minimum-variance optimal allocation of financial portfolio investments.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":"52 2","pages":"437-460"},"PeriodicalIF":0.6,"publicationDate":"2023-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11770","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44556703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A zero-modified geometric INAR(1) model for analyzing count time series with multiple features 一个零修正的几何INAR(1)模型用于分析具有多特征的计数时间序列
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-04-04 DOI: 10.1002/cjs.11774
Yao Kang, Fukang Zhu, Dehui Wang, Shuhui Wang

Zero inflation, zero deflation, overdispersion, and underdispersion are commonly encountered in count time series. To better describe these characteristics of counts, this article introduces a zero-modified geometric first-order integer-valued autoregressive (INAR(1)) model based on the generalized negative binomial thinning operator, which contains dependent zero-inflated geometric counting series. The new model contains the NGINAR(1) model, ZMGINAR(1) model, and GNBINAR(1) model with geometric marginals as special cases. Some statistical properties are studied, and estimates of the model parameters are derived by the Yule–Walker, conditional least squares, and maximum likelihood methods. Asymptotic properties and numerical results of the estimators are also studied. In addition, some test and forecasting problems are addressed. Three real-data examples are given to show the flexibility and practicability of the new model.

零膨胀、零紧缩、过度分散和分散不足是计数时间序列中经常遇到的问题。为了更好地描述计数的这些特征,本文介绍了基于广义负二叉稀疏算子的零修正几何一阶整数值自回归(INAR(1))模型,该模型包含依赖的零膨胀几何计数序列。新模型包含作为特例的 NGINAR(1) 模型、ZMGINAR(1) 模型和具有几何边际的 GNBINAR(1) 模型。研究了一些统计特性,并通过 Yule-Walker、条件最小二乘法和最大似然法得出了模型参数的估计值。还研究了估计值的渐近特性和数值结果。此外,还讨论了一些测试和预测问题。还给出了三个真实数据示例,以展示新模型的灵活性和实用性。
{"title":"A zero-modified geometric INAR(1) model for analyzing count time series with multiple features","authors":"Yao Kang,&nbsp;Fukang Zhu,&nbsp;Dehui Wang,&nbsp;Shuhui Wang","doi":"10.1002/cjs.11774","DOIUrl":"10.1002/cjs.11774","url":null,"abstract":"<p>Zero inflation, zero deflation, overdispersion, and underdispersion are commonly encountered in count time series. To better describe these characteristics of counts, this article introduces a zero-modified geometric first-order integer-valued autoregressive (INAR(1)) model based on the generalized negative binomial thinning operator, which contains dependent zero-inflated geometric counting series. The new model contains the NGINAR(1) model, ZMGINAR(1) model, and GNBINAR(1) model with geometric marginals as special cases. Some statistical properties are studied, and estimates of the model parameters are derived by the Yule–Walker, conditional least squares, and maximum likelihood methods. Asymptotic properties and numerical results of the estimators are also studied. In addition, some test and forecasting problems are addressed. Three real-data examples are given to show the flexibility and practicability of the new model.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":"52 3","pages":"873-899"},"PeriodicalIF":0.8,"publicationDate":"2023-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42545884","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal multiwave validation of secondary use data with outcome and exposure misclassification 结果和暴露错误分类的二次使用数据的最佳多波验证
IF 0.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-03-31 DOI: 10.1002/cjs.11772
Sarah C. Lotspeich, Gustavo G. C. Amorim, Pamela A. Shaw, Ran Tao, Bryan E. Shepherd

Observational databases provide unprecedented opportunities for secondary use in biomedical research. However, these data can be error-prone and must be validated before use. It is usually unrealistic to validate the whole database because of resource constraints. A cost-effective alternative is a two-phase design that validates a subset of records enriched for information about a particular research question. We consider odds ratio estimation under differential outcome and exposure misclassification and propose optimal designs that minimize the variance of the maximum likelihood estimator. Our adaptive grid search algorithm can locate the optimal design in a computationally feasible manner. Because the optimal design relies on unknown parameters, we introduce a multiwave strategy to approximate the optimal design. We demonstrate the proposed design's efficiency gains through simulations and two large observational studies.

电子健康记录(EHR)等观测数据库的日益可用性为在生物医学研究中二次使用此类数据提供了前所未有的机会。然而,这些数据可能容易出错,需要在使用前进行验证。由于资源限制,验证整个数据库通常是不现实的。一种具有成本效益的替代方案是实施两阶段设计,验证患者记录的子集,这些子集被丰富以获取有关感兴趣的研究问题的信息。在此,我们考虑了差异结果和暴露错误分类下的比值比估计。我们提出了最小化最大似然比值比估计器方差的最优设计。我们开发了一种新的自适应网格搜索算法,该算法可以以计算可行和数值精确的方式定位最优设计。由于优化设计一开始就需要指定未知参数,因此在没有先验信息的情况下是无法实现的,因此我们引入了一种多波采样策略来在实践中对其进行近似。通过广泛的模拟和两项大型观测研究,我们展示了拟议设计相对于现有设计的效率增益。我们提供R包和Shiny应用程序,以方便使用最佳设计。
{"title":"Optimal multiwave validation of secondary use data with outcome and exposure misclassification","authors":"Sarah C. Lotspeich,&nbsp;Gustavo G. C. Amorim,&nbsp;Pamela A. Shaw,&nbsp;Ran Tao,&nbsp;Bryan E. Shepherd","doi":"10.1002/cjs.11772","DOIUrl":"10.1002/cjs.11772","url":null,"abstract":"<p>Observational databases provide unprecedented opportunities for secondary use in biomedical research. However, these data can be error-prone and must be validated before use. It is usually unrealistic to validate the whole database because of resource constraints. A cost-effective alternative is a two-phase design that validates a subset of records enriched for information about a particular research question. We consider odds ratio estimation under differential outcome and exposure misclassification and propose optimal designs that minimize the variance of the maximum likelihood estimator. Our adaptive grid search algorithm can locate the optimal design in a computationally feasible manner. Because the optimal design relies on unknown parameters, we introduce a multiwave strategy to approximate the optimal design. We demonstrate the proposed design's efficiency gains through simulations and two large observational studies.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":"52 2","pages":"532-554"},"PeriodicalIF":0.6,"publicationDate":"2023-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11772","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46055319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A stable and adaptive polygenic signal detection method based on repeated sample splitting 一种基于重复样本分割的稳定自适应多基因信号检测方法
IF 0.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-03-31 DOI: 10.1002/cjs.11768
Yanyan Zhao, Lei Sun

Focusing on polygenic signal detection in high-dimensional genetic association studies of complex traits, we develop a stable and adaptive test for generalized linear models to accommodate different alternatives. To facilitate valid post-selection inference for high-dimensional data, our study here adheres to the original sample-splitting principle but does so repeatedly to increase stability of the inference. We show the asymptotic null distribution of the proposed test for both fixed and diverging numbers of variants. We also show the asymptotic properties of the proposed test under local alternatives, providing insights on why power gain attributed to variable selection and weighting can compensate for efficiency loss due to sample splitting. We support our analytical findings through extensive simulation studies and two applications. The proposed procedure is computationally efficient and has been implemented as the R package DoubleCauchy.

使用多基因风险评分进行性状关联分析和疾病预测对于复杂性状的遗传研究至关重要。有效的推断依赖于样本分割或最近的外部数据,以获得一组潜在的相关遗传变异及其权重,用于构建多基因风险评分。外部数据的使用一直很受欢迎,但由于不同样本之间潜在数据异质性的不利影响,最近的工作越来越多地对其使用提出质疑。我们在这里的研究坚持最初的采样分裂原则,但重复这样做是为了增加我们推断的稳定性。为了适应不同的多基因结构,我们为广义线性模型开发了一种自适应测试。我们提供了所提出的检验的渐近零分布,无论是固定的还是发散的变量数。我们还展示了所提出的测试在局部备选方案下的渐近性质,深入了解了为什么归因于变量选择和加权的功率增益可以补偿由于样本分裂而造成的效率损失。我们通过广泛的模拟研究和应用支持我们的分析结果。
{"title":"A stable and adaptive polygenic signal detection method based on repeated sample splitting","authors":"Yanyan Zhao,&nbsp;Lei Sun","doi":"10.1002/cjs.11768","DOIUrl":"10.1002/cjs.11768","url":null,"abstract":"<p>Focusing on polygenic signal detection in high-dimensional genetic association studies of complex traits, we develop a stable and adaptive test for generalized linear models to accommodate different alternatives. To facilitate valid post-selection inference for high-dimensional data, our study here adheres to the original sample-splitting principle but does so repeatedly to increase stability of the inference. We show the asymptotic null distribution of the proposed test for both fixed and diverging numbers of variants. We also show the asymptotic properties of the proposed test under local alternatives, providing insights on why power gain attributed to variable selection and weighting can compensate for efficiency loss due to sample splitting. We support our analytical findings through extensive simulation studies and two applications. The proposed procedure is computationally efficient and has been implemented as the <span>R</span> package <span>DoubleCauchy</span>.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":"52 1","pages":"79-97"},"PeriodicalIF":0.6,"publicationDate":"2023-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44015590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A class of space-filling designs with low-dimensional stratification and column orthogonality 一类具有低维分层和柱正交性的空间填充设计
IF 0.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-03-28 DOI: 10.1002/cjs.11761
Pengnan Li, Fasheng Sun

Strong orthogonal arrays are suitable designs for computer experiments because of stratification in low-dimensional projections. However, strong orthogonal arrays may be very expensive for a moderate number of factors. In this article, we develop a method for constructing space-filling designs with more economical run sizes. These designs are not only column-orthogonal but also enjoy a large proportion of low-dimensional stratification properties that strong orthogonal arrays ought to have. Moreover, a class of proposed designs can be 3-orthogonal. In addition, some theoretical results on regular fractional factorial designs are obtained as a by-product.

强正交阵列适合计算机实验设计,因为它在低维投影中具有分层作用。然而,对于中等数量的因子来说,强正交阵列可能非常昂贵。在本文中,我们开发了一种方法,用于构建运行规模更经济的空间填充设计。这些设计不仅具有列正交性,而且还具有强正交阵列所应具有的大量低维分层特性。此外,所提出的一类设计可以是 3 正交的。此外,作为副产品,还获得了一些关于正则分数阶乘设计的理论结果。
{"title":"A class of space-filling designs with low-dimensional stratification and column orthogonality","authors":"Pengnan Li,&nbsp;Fasheng Sun","doi":"10.1002/cjs.11761","DOIUrl":"10.1002/cjs.11761","url":null,"abstract":"<p>Strong orthogonal arrays are suitable designs for computer experiments because of stratification in low-dimensional projections. However, strong orthogonal arrays may be very expensive for a moderate number of factors. In this article, we develop a method for constructing space-filling designs with more economical run sizes. These designs are not only column-orthogonal but also enjoy a large proportion of low-dimensional stratification properties that strong orthogonal arrays ought to have. Moreover, a class of proposed designs can be 3-orthogonal. In addition, some theoretical results on regular fractional factorial designs are obtained as a by-product.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":"52 1","pages":"310-326"},"PeriodicalIF":0.6,"publicationDate":"2023-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44562952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Penalized complexity priors for the skewness parameter of power links 电力链路偏度参数的惩罚复杂度先验
IF 0.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-03-27 DOI: 10.1002/cjs.11769
José A. Ordoñez, Marcos O. Prates, Jorge L. Bazán, Victor H. Lachos

The choice of a prior distribution is a key aspect of the Bayesian method. However, in many cases, such as the family of power links, this is not trivial. In this article, we introduce a penalized complexity prior (PC prior) of the skewness parameter for this family, which is useful for dealing with imbalanced data. We derive a general expression for this density and show its usefulness for some particular cases such as the power logit and the power probit links. A simulation study and a real data application are used to assess the efficiency of the introduced densities in comparison with the Gaussian and uniform priors. Results show improvement in point and credible interval estimation for the considered models when using the PC prior in comparison to other well-known standard priors.

先验分布的选择是贝叶斯方法的一个关键方面。然而,在很多情况下,例如幂级数联立方程族,这并非易事。在本文中,我们为该系列引入了偏度参数的受惩罚复杂度先验(PC 先验),这对于处理不平衡数据非常有用。我们推导出了该密度的一般表达式,并展示了它在一些特殊情况下的实用性,如 power logit 和 power probit 链接。我们使用模拟研究和真实数据应用来评估引入的密度与高斯先验和均匀先验相比的效率。结果表明,与其他著名的标准先验相比,使用 PC 先验时,所考虑模型的点估计和可信区间估计都有所改进。
{"title":"Penalized complexity priors for the skewness parameter of power links","authors":"José A. Ordoñez,&nbsp;Marcos O. Prates,&nbsp;Jorge L. Bazán,&nbsp;Victor H. Lachos","doi":"10.1002/cjs.11769","DOIUrl":"10.1002/cjs.11769","url":null,"abstract":"<p>The choice of a prior distribution is a key aspect of the Bayesian method. However, in many cases, such as the family of power links, this is not trivial. In this article, we introduce a penalized complexity prior (PC prior) of the skewness parameter for this family, which is useful for dealing with imbalanced data. We derive a general expression for this density and show its usefulness for some particular cases such as the power logit and the power probit links. A simulation study and a real data application are used to assess the efficiency of the introduced densities in comparison with the Gaussian and uniform priors. Results show improvement in point and credible interval estimation for the considered models when using the PC prior in comparison to other well-known standard priors.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":"52 1","pages":"98-117"},"PeriodicalIF":0.6,"publicationDate":"2023-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48663145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust nonparametric hypothesis tests for differences in the covariance structure of functional data 函数数据协方差结构差异的稳健非参数假设检验
IF 0.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-03-25 DOI: 10.1002/cjs.11767
Kelly Ramsay, Shoja'eddin Chenouri

We develop a group of robust, nonparametric hypothesis tests that detect differences between the covariance operators of several populations of functional data. These tests, called functional Kruskal–Wallis tests for covariance, or FKWC tests, are based on functional data depth ranks. FKWC tests work well even when the data are heavy-tailed, which is shown both in simulation and theory. FKWC tests offer several other benefits: they have a simple asymptotic distribution under the null hypothesis, they are computationally cheap, and they possess transformation-invariance properties. We show that under general alternative hypotheses, these tests are consistent under mild, nonparametric assumptions. As a result, we introduce a new functional depth function called L2-root depth that works well for the purposes of detecting differences in magnitude between covariance kernels. We present an analysis of the FKWC test based on L2-root depth under local alternatives. Through simulations, when the true covariance kernels have an infinite number of positive eigenvalues, we show that these tests have higher power than their competitors while maintaining their nominal size. We also provide a method for computing sample size and performing multiple comparisons.

我们开发了一组鲁棒的非参数假设检验,用于检测几个功能数据总体的协方差算子之间的差异。这些测试称为FKWC测试,基于功能数据深度排名。这些测试即使在数据是重尾的情况下也能很好地工作,这在模拟和理论上都得到了证明。这些测试还提供了其他一些好处,它们在零假设下有一个简单的分布,它们的计算成本很低,并且它们具有变换不变性。我们表明,在一般替代假设下,这些检验在温和的非参数假设下是一致的。作为这项工作的结果,我们引入了一个新的功能深度函数,称为l2 -根深度,它可以很好地用于检测协方差核之间的幅度差异。我们提出了在局部替代方案下使用l2根深度的FKWC测试的分析。在模拟中,当真正的协方差核具有严格的正特征值时,我们表明这些测试比它们的竞争对手具有更高的功率,同时仍然保持其标称大小。我们还提供了一种计算样本大小和执行多重比较的方法。
{"title":"Robust nonparametric hypothesis tests for differences in the covariance structure of functional data","authors":"Kelly Ramsay,&nbsp;Shoja'eddin Chenouri","doi":"10.1002/cjs.11767","DOIUrl":"10.1002/cjs.11767","url":null,"abstract":"<p>We develop a group of robust, nonparametric hypothesis tests that detect differences between the covariance operators of several populations of functional data. These tests, called functional Kruskal–Wallis tests for covariance, or FKWC tests, are based on functional data depth ranks. FKWC tests work well even when the data are heavy-tailed, which is shown both in simulation and theory. FKWC tests offer several other benefits: they have a simple asymptotic distribution under the null hypothesis, they are computationally cheap, and they possess transformation-invariance properties. We show that under general alternative hypotheses, these tests are consistent under mild, nonparametric assumptions. As a result, we introduce a new functional depth function called <math>\u0000 <msup>\u0000 <mrow>\u0000 <mi>L</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mn>2</mn>\u0000 </mrow>\u0000 </msup></math>-root depth that works well for the purposes of detecting differences in magnitude between covariance kernels. We present an analysis of the FKWC test based on <math>\u0000 <msup>\u0000 <mrow>\u0000 <mi>L</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mn>2</mn>\u0000 </mrow>\u0000 </msup></math>-root depth under local alternatives. Through simulations, when the true covariance kernels have an infinite number of positive eigenvalues, we show that these tests have higher power than their competitors while maintaining their nominal size. We also provide a method for computing sample size and performing multiple comparisons.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":"52 1","pages":"43-78"},"PeriodicalIF":0.6,"publicationDate":"2023-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47021433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Acknowledgement of Referees' Services Remerciements aux membres des jurys 对推荐人服务的认可感谢陪审团成员
IF 0.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-02-18 DOI: 10.1002/cjs.11766
Aeberhard, William H. ETH Zürich Asgharian, Masoud McGill University Bahraoui, Tarik* Université du Québec à Montréal Battey, Heather Imperial College London Bédard, Mylène Université de Montréal Bellhouse, David* University of Western Ontario Berger, Yves* University of Southampton Braekers, Roel Hasselt University Brazzale, Alessandra University of Padova Cai, Song Carleton University Cao, Guanqun Auburn University Casa, Alessandro Free University of Bozen-Bolzano Chatterjee, Kashinath* Visva-Bharati University Chen, Baojiang University of Texas Health Science Center Chen, Guanhua University of Wisconsin-Madison Chen, Sixia University of Oklahoma Health Sciences Center Chen, Yaqing* University of California Davis Cheng, Yu University of Pittsburgh Cheung, Rex San Francisco State University Coia, Vincenzo University of British Columbia Cook, Richard University of Waterloo Csató, László ELKH SZTAKI Dagne, Getachew University of South Florida Dai, Ben Chinese University of Hong Kong
Aeberhard, William H. ETH zrich Asgharian, Masoud McGill University Bahraoui, Tarik* University of quemacbec montracimal Battey, Heather Imperial College London b, myl University de montracimal Bellhouse, David* University of Western Ontario Berger, Yves* University of Southampton Braekers, Roel Hasselt University Brazzale, Alessandra University of Padova Cai, Song Carleton University Cao, Guanqun Auburn University Casa, Alessandro Free University of Bozen-Bolzano Chatterjee,Kashinath* Visva-Bharati University Chen, Baojiang University of Texas Health Science Center Chen, wisconsin Guanhua University - madison Chen, Sixia University of Oklahoma Health Science Center Chen, Yaqing* California University Davis Cheng, Yu University of Pittsburgh b张,Rex San Francisco State University Coia, Vincenzo University of British Columbia Cook, Richard University of Waterloo Csató, László ELKH SZTAKI Dagne, Getachew University of South Florida Dai,香港中文大学
{"title":"Acknowledgement of Referees' Services Remerciements aux membres des jurys","authors":"","doi":"10.1002/cjs.11766","DOIUrl":"10.1002/cjs.11766","url":null,"abstract":"Aeberhard, William H. ETH Zürich Asgharian, Masoud McGill University Bahraoui, Tarik* Université du Québec à Montréal Battey, Heather Imperial College London Bédard, Mylène Université de Montréal Bellhouse, David* University of Western Ontario Berger, Yves* University of Southampton Braekers, Roel Hasselt University Brazzale, Alessandra University of Padova Cai, Song Carleton University Cao, Guanqun Auburn University Casa, Alessandro Free University of Bozen-Bolzano Chatterjee, Kashinath* Visva-Bharati University Chen, Baojiang University of Texas Health Science Center Chen, Guanhua University of Wisconsin-Madison Chen, Sixia University of Oklahoma Health Sciences Center Chen, Yaqing* University of California Davis Cheng, Yu University of Pittsburgh Cheung, Rex San Francisco State University Coia, Vincenzo University of British Columbia Cook, Richard University of Waterloo Csató, László ELKH SZTAKI Dagne, Getachew University of South Florida Dai, Ben Chinese University of Hong Kong","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":"51 1","pages":"344-349"},"PeriodicalIF":0.6,"publicationDate":"2023-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42178036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PCA Rerandomization
IF 0.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-02-16 DOI: 10.1002/cjs.11765
Hengtao Zhang, Guosheng Yin, Donald B. Rubin

Mahalanobis distance of covariate means between treatment and control groups is often adopted as a balance criterion when implementing a rerandomization strategy. However, this criterion may not work well for high-dimensional cases because it balances all orthogonalized covariates equally. We propose using principal component analysis (PCA) to identify proper subspaces in which Mahalanobis distance should be calculated. Not only can PCA effectively reduce the dimensionality for high-dimensional covariates, but it also provides computational simplicity by focusing on the top orthogonal components. The PCA rerandomization scheme has desirable theoretical properties for balancing covariates and thereby improving the estimation of average treatment effects. This conclusion is supported by numerical studies using both simulated and real examples.

在实施再随机化策略时,治疗组和对照组之间的马氏距离协变均值通常被用作平衡标准。然而,这个标准可能不适用于高维情况,因为它平等地平衡了所有正交协变量。在这里,我们建议利用主成分分析(PCA)来确定应该在其中计算Mahalanobis距离的适当子空间。PCA不仅可以有效地降低高维情况的维数,同时捕获协变量中的大部分信息,而且它还通过关注顶部正交分量来提供计算简单性。我们证明了我们的PCA重随机化方案在平衡协变量方面具有理想的理论性质,从而改进了平均治疗效果的估计。我们还表明,这一结论得到了数值研究的支持,包括模拟和实际例子。
{"title":"PCA Rerandomization","authors":"Hengtao Zhang,&nbsp;Guosheng Yin,&nbsp;Donald B. Rubin","doi":"10.1002/cjs.11765","DOIUrl":"10.1002/cjs.11765","url":null,"abstract":"<p>Mahalanobis distance of covariate means between treatment and control groups is often adopted as a balance criterion when implementing a rerandomization strategy. However, this criterion may not work well for high-dimensional cases because it balances all orthogonalized covariates equally. We propose using principal component analysis (PCA) to identify proper subspaces in which Mahalanobis distance should be calculated. Not only can PCA effectively reduce the dimensionality for high-dimensional covariates, but it also provides computational simplicity by focusing on the top orthogonal components. The PCA rerandomization scheme has desirable theoretical properties for balancing covariates and thereby improving the estimation of average treatment effects. This conclusion is supported by numerical studies using both simulated and real examples.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":"52 1","pages":"5-25"},"PeriodicalIF":0.6,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cjs.11765","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44206441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Method of model checking for case II interval-censored data under the additive hazards model 加性危害模型下案例II区间截尾数据的模型检验方法
IF 0.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-02-16 DOI: 10.1002/cjs.11759
Yanqin Feng, Ming Tang, Jieli Ding

General or case II interval-censored data are commonly encountered in practice. We develop methods for model-checking and goodness-of-fit testing for the additive hazards model with case II interval-censored data. We propose test statistics based on the supremum of the stochastic processes derived from the cumulative sum of martingale-based residuals over time and covariates. We approximate the distribution of the stochastic process via a simulation technique to conduct a class of graphical and numerical techniques for various purposes of model-fitting evaluations. Simulation studies are conducted to assess the finite-sample performance of the proposed method. A real dataset from an AIDS observational study is analyzed for illustration.

在实践中经常会遇到一般或情况 II 区间删失数据。我们开发了使用情况 II 间隔删失数据的加性危险模型的模型检查和拟合优度检验方法。我们提出的检验统计量是基于马氏残差随时间和协变量的累积和得出的随机过程的上峰。我们通过模拟技术对随机过程的分布进行了近似,从而为模型拟合评估的各种目的提供了一类图形和数值技术。我们进行了模拟研究,以评估所提出方法的有限样本性能。为说明起见,还分析了一项艾滋病观察研究的真实数据集。
{"title":"Method of model checking for case II interval-censored data under the additive hazards model","authors":"Yanqin Feng,&nbsp;Ming Tang,&nbsp;Jieli Ding","doi":"10.1002/cjs.11759","DOIUrl":"10.1002/cjs.11759","url":null,"abstract":"<p>General or case II interval-censored data are commonly encountered in practice. We develop methods for model-checking and goodness-of-fit testing for the additive hazards model with case II interval-censored data. We propose test statistics based on the supremum of the stochastic processes derived from the cumulative sum of martingale-based residuals over time and covariates. We approximate the distribution of the stochastic process via a simulation technique to conduct a class of graphical and numerical techniques for various purposes of model-fitting evaluations. Simulation studies are conducted to assess the finite-sample performance of the proposed method. A real dataset from an AIDS observational study is analyzed for illustration.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":"52 1","pages":"212-236"},"PeriodicalIF":0.6,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48612832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Canadian Journal of Statistics-Revue Canadienne De Statistique
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1