首页 > 最新文献

Scandinavian Journal of Statistics最新文献

英文 中文
A special section honoring Nils Lid Hjort 纪念尼尔斯-利德-希约尔特的专栏
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-23 DOI: 10.1111/sjos.12745
Ørnulf Borgan, Ingrid K. Glad
{"title":"A special section honoring Nils Lid Hjort","authors":"Ørnulf Borgan, Ingrid K. Glad","doi":"10.1111/sjos.12745","DOIUrl":"https://doi.org/10.1111/sjos.12745","url":null,"abstract":"","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141780376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian mixture models (in)consistency for the number of clusters 贝叶斯混合物模型在聚类数量方面的(不)一致性
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-23 DOI: 10.1111/sjos.12739
Louise Alamichel, Daria Bystrova, Julyan Arbel, Guillaume Kon Kam King
Bayesian nonparametric mixture models are common for modeling complex data. While these models are well‐suited for density estimation, recent results proved posterior inconsistency of the number of clusters when the true number of components is finite, for the Dirichlet process and Pitman–Yor process mixture models. We extend these results to additional Bayesian nonparametric priors such as Gibbs‐type processes and finite‐dimensional representations thereof. The latter include the Dirichlet multinomial process, the recently proposed Pitman–Yor, and normalized generalized gamma multinomial processes. We show that mixture models based on these processes are also inconsistent in the number of clusters and discuss possible solutions. Notably, we show that a postprocessing algorithm introduced for the Dirichlet process can be extended to more general models and provides a consistent method to estimate the number of components.
贝叶斯非参数混合模型是复杂数据建模的常用方法。虽然这些模型非常适合密度估计,但最近的研究结果证明,当成分的真实数量有限时,对于 Dirichlet 过程和 Pitman-Yor 过程混合物模型,聚类数量的后验不一致。我们将这些结果扩展到其他贝叶斯非参数先验,如吉布斯型过程及其有限维表示。后者包括 Dirichlet 多叉过程、最近提出的 Pitman-Yor 过程和归一化广义伽马多叉过程。我们证明了基于这些过程的混合物模型在聚类数量上也是不一致的,并讨论了可能的解决方案。值得注意的是,我们证明了针对 Dirichlet 过程引入的后处理算法可以扩展到更一般的模型,并提供了一种估计成分数的一致方法。
{"title":"Bayesian mixture models (in)consistency for the number of clusters","authors":"Louise Alamichel, Daria Bystrova, Julyan Arbel, Guillaume Kon Kam King","doi":"10.1111/sjos.12739","DOIUrl":"https://doi.org/10.1111/sjos.12739","url":null,"abstract":"Bayesian nonparametric mixture models are common for modeling complex data. While these models are well‐suited for density estimation, recent results proved posterior inconsistency of the number of clusters when the true number of components is finite, for the Dirichlet process and Pitman–Yor process mixture models. We extend these results to additional Bayesian nonparametric priors such as Gibbs‐type processes and finite‐dimensional representations thereof. The latter include the Dirichlet multinomial process, the recently proposed Pitman–Yor, and normalized generalized gamma multinomial processes. We show that mixture models based on these processes are also inconsistent in the number of clusters and discuss possible solutions. Notably, we show that a postprocessing algorithm introduced for the Dirichlet process can be extended to more general models and provides a consistent method to estimate the number of components.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141780375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A classical hypothesis test for assessing the homogeneity of disease transmission in stochastic epidemic models 用于评估随机流行病模型中疾病传播同质性的经典假设检验
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-19 DOI: 10.1111/sjos.12743
Georgios Aristotelous, Theodore Kypraios, Philip D. O'Neill
This paper addresses the problem of assessing the homogeneity of the disease transmission process in stochastic epidemic models in populations that are partitioned into social groups. We develop a classical hypothesis test for completed epidemics which assesses whether or not there is significant within‐group transmission during an outbreak. The test is based on time‐ordered group labels of individuals. The null hypothesis is that of homogeneity of disease transmission among individuals, a hypothesis under which the discrete random vector of groups labels has a known sampling distribution that is independent of any model parameters. The test exhibits excellent performance when applied to various scenarios of simulated data and is also illustrated using two real‐life epidemic data sets. We develop some asymptotic theory including a central limit theorem. The test is practically very appealing, being computationally cheap and straightforward to implement, as well as being applicable to a wide range of real‐life outbreak settings and to related problems in other fields.
本文探讨了在随机流行病模型中评估疾病传播过程的同质性问题。我们为已完成的流行病开发了一种经典的假设检验,用于评估疫情爆发期间是否存在显著的群体内传播。该检验基于时间排序的个体群体标签。零假设是疾病在个体间传播的同质性,即群体标签的离散随机向量具有与任何模型参数无关的已知抽样分布。该检验在应用于各种模拟数据场景时表现出卓越的性能,我们还使用两个真实的流行病数据集对其进行了说明。我们提出了一些渐近理论,包括中心极限定理。该检验实际上非常吸引人,计算成本低,实施简单,适用于广泛的现实生活中的疫情爆发环境和其他领域的相关问题。
{"title":"A classical hypothesis test for assessing the homogeneity of disease transmission in stochastic epidemic models","authors":"Georgios Aristotelous, Theodore Kypraios, Philip D. O'Neill","doi":"10.1111/sjos.12743","DOIUrl":"https://doi.org/10.1111/sjos.12743","url":null,"abstract":"This paper addresses the problem of assessing the homogeneity of the disease transmission process in stochastic epidemic models in populations that are partitioned into social groups. We develop a classical hypothesis test for completed epidemics which assesses whether or not there is significant within‐group transmission during an outbreak. The test is based on time‐ordered group labels of individuals. The null hypothesis is that of homogeneity of disease transmission among individuals, a hypothesis under which the discrete random vector of groups labels has a known sampling distribution that is independent of any model parameters. The test exhibits excellent performance when applied to various scenarios of simulated data and is also illustrated using two real‐life epidemic data sets. We develop some asymptotic theory including a central limit theorem. The test is practically very appealing, being computationally cheap and straightforward to implement, as well as being applicable to a wide range of real‐life outbreak settings and to related problems in other fields.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141738111","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adjusted location‐invariant U‐tests for the covariance matrix with elliptically high‐dimensional data 椭圆高维数据协方差矩阵的调整位置不变 U 检验
IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-14 DOI: 10.1111/sjos.12738
Kai Xu, Yeqing Zhou, Liping Zhu
This paper analyzes several covariance matrix U‐tests, which are constructed by modifying the classical John‐Nagao and Ledoit‐Wolf tests, under the elliptically distributed data structure. We study the limiting distributions of these location‐invariant test statistics as the data dimension may go to infinity in an arbitrary way as the sample size does. We find that they tend to have unsatisfactory size performances for general elliptical population. This is mainly because such population often possesses high‐order correlations among their coordinates. Taking such kind of dependency into consideration, we propose necessary corrections for these tests to cope with elliptically high‐dimensional data. For computational efficiency, alternative forms of the new test statistics are also provided. We derive the universal asymptotic null distributions of the proposed test statistics under elliptical distributions and beyond. The powers of the proposed tests are further investigated. The accuracy of the tests is demonstrated by simulations and an empirical study.
本文分析了椭圆分布数据结构下的几种协方差矩阵 U 检验,这些检验是通过修改经典的 John-Nagao 检验和 Ledoit-Wolf 检验而构建的。我们研究了这些位置不变检验统计量的极限分布,因为随着样本量的增加,数据维度可能会以任意方式达到无穷大。我们发现,对于一般的椭圆群体,这些统计量的大小表现往往不能令人满意。这主要是因为这类群体的坐标之间往往具有高阶相关性。考虑到这种依赖性,我们对这些检验提出了必要的修正,以应对椭圆高维数据。为了提高计算效率,我们还提供了新检验统计量的替代形式。我们推导出了所提出的检验统计量在椭圆分布及其他分布下的普遍渐近零分布。我们还进一步研究了拟议检验的幂。我们通过模拟和实证研究证明了检验的准确性。
{"title":"Adjusted location‐invariant U‐tests for the covariance matrix with elliptically high‐dimensional data","authors":"Kai Xu, Yeqing Zhou, Liping Zhu","doi":"10.1111/sjos.12738","DOIUrl":"https://doi.org/10.1111/sjos.12738","url":null,"abstract":"This paper analyzes several covariance matrix U‐tests, which are constructed by modifying the classical John‐Nagao and Ledoit‐Wolf tests, under the elliptically distributed data structure. We study the limiting distributions of these location‐invariant test statistics as the data dimension may go to infinity in an arbitrary way as the sample size does. We find that they tend to have unsatisfactory size performances for general elliptical population. This is mainly because such population often possesses high‐order correlations among their coordinates. Taking such kind of dependency into consideration, we propose necessary corrections for these tests to cope with elliptically high‐dimensional data. For computational efficiency, alternative forms of the new test statistics are also provided. We derive the universal asymptotic null distributions of the proposed test statistics under elliptical distributions and beyond. The powers of the proposed tests are further investigated. The accuracy of the tests is demonstrated by simulations and an empirical study.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":0.8,"publicationDate":"2024-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141649823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Minimax rate of estimation for invariant densities associated to continuous stochastic differential equations over anisotropic Hölder classes 各向异性荷尔德类上与连续随机微分方程相关的不变量密度的最小估计率
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-11 DOI: 10.1111/sjos.12735
Chiara Amorino, Arnaud Gloter
We study the problem of the nonparametric estimation for the density of the stationary distribution of a ‐dimensional stochastic differential equation . From the continuous observation of the sampling path on , we study the estimation of as goes to infinity. For , we characterize the minimax rate for the ‐risk in pointwise estimation over a class of anisotropic Hölder functions with regularity . For , our finding is that, having ordered the smoothness such that , the minimax rate depends on whether or . In the first case, this rate is , and in the second case, it is , where is an explicit exponent dependent on the dimension and , the harmonic mean of smoothness over the directions after excluding and , the smallest ones. We also demonstrate that kernel‐based estimators achieve the optimal minimax rate. Furthermore, we propose an adaptive procedure for both integrated and pointwise risk. In the two‐dimensional case, we show that kernel density estimators achieve the rate , which is optimal in the minimax sense. Finally we illustrate the validity of our theoretical findings by proposing numerical results.
我们研究一维随机微分方程静态分布密度的非参数估计问题。从连续观察取样路径的角度,我们研究了当取样路径变为无穷大时的估计问题。对于 ,我们描述了在一类具有正则性的各向异性赫尔德函数上进行定点估计时-风险的最小率。对于 ,我们的发现是,当平滑度排序为 ,最小率取决于 ,还是 。 在第一种情况下,该率为 ,而在第二种情况下,该率为 ,其中 ,是一个取决于维度的显式指数,而 ,是在剔除 ,和 ,最小方向后,各方向平滑度的谐波平均值。我们还证明,基于核的估计器能达到最佳最小率。此外,我们还针对综合风险和点风险提出了一种自适应程序。在二维情况下,我们证明核密度估计器达到了最小值意义上的最优率。最后,我们通过提出数值结果来说明我们的理论发现是正确的。
{"title":"Minimax rate of estimation for invariant densities associated to continuous stochastic differential equations over anisotropic Hölder classes","authors":"Chiara Amorino, Arnaud Gloter","doi":"10.1111/sjos.12735","DOIUrl":"https://doi.org/10.1111/sjos.12735","url":null,"abstract":"We study the problem of the nonparametric estimation for the density of the stationary distribution of a ‐dimensional stochastic differential equation . From the continuous observation of the sampling path on , we study the estimation of as goes to infinity. For , we characterize the minimax rate for the ‐risk in pointwise estimation over a class of anisotropic Hölder functions with regularity . For , our finding is that, having ordered the smoothness such that , the minimax rate depends on whether or . In the first case, this rate is , and in the second case, it is , where is an explicit exponent dependent on the dimension and , the harmonic mean of smoothness over the directions after excluding and , the smallest ones. We also demonstrate that kernel‐based estimators achieve the optimal minimax rate. Furthermore, we propose an adaptive procedure for both integrated and pointwise risk. In the two‐dimensional case, we show that kernel density estimators achieve the rate , which is optimal in the minimax sense. Finally we illustrate the validity of our theoretical findings by proposing numerical results.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141614551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimation of win, loss probabilities, and win ratio based on right‐censored event data 基于右删失事件数据的胜负概率和胜率估计
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-04 DOI: 10.1111/sjos.12734
Erik T. Parner, Morten Overgaard
The win ratio has in the recent decade gained popularity for analyzing prioritized multiple event data in clinical cohort studies, in particular within cardiovascular research. The literature on estimation of the win ratio using censored event data is however sparse. The methods that have been suggested have either an insufficient adjustment of the censoring or by assuming the the win and loss probabilities are proportional over time. The assumption of proportional win and loss probabilities will often in practice not be satisfied. In this paper, we present estimates for the win ratio, and win and loss probabilities, under independent right‐censoring and derive the asymptotic distribution of the estimates. The proposed win ratio estimate does not require the assumption of proportional win and loss probabilities. The small sample properties of the proposed method are studied in a simulation study showing that the variance formula is accurate even for small samples. The method is applied on two data sets.
近十年来,在临床队列研究中,特别是在心血管研究中,胜率在分析优先多事件数据方面越来越受欢迎。然而,使用删减事件数据估算胜率的文献却很少。已提出的方法要么没有对删减进行充分调整,要么假定随着时间的推移,赢率和输率成正比。实际上,胜负概率成比例的假设往往无法满足。在本文中,我们提出了在独立右删减条件下的胜率以及胜负概率的估计值,并推导出了估计值的渐近分布。所提出的胜率估计值不需要假设赢率和输率成正比。模拟研究对所提方法的小样本特性进行了研究,结果表明,即使是小样本,方差公式也是准确的。该方法应用于两个数据集。
{"title":"Estimation of win, loss probabilities, and win ratio based on right‐censored event data","authors":"Erik T. Parner, Morten Overgaard","doi":"10.1111/sjos.12734","DOIUrl":"https://doi.org/10.1111/sjos.12734","url":null,"abstract":"The win ratio has in the recent decade gained popularity for analyzing prioritized multiple event data in clinical cohort studies, in particular within cardiovascular research. The literature on estimation of the win ratio using censored event data is however sparse. The methods that have been suggested have either an insufficient adjustment of the censoring or by assuming the the win and loss probabilities are proportional over time. The assumption of proportional win and loss probabilities will often in practice not be satisfied. In this paper, we present estimates for the win ratio, and win and loss probabilities, under independent right‐censoring and derive the asymptotic distribution of the estimates. The proposed win ratio estimate does not require the assumption of proportional win and loss probabilities. The small sample properties of the proposed method are studied in a simulation study showing that the variance formula is accurate even for small samples. The method is applied on two data sets.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141548567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Commentary on “Pitfalls of amateur regression: The Dutch New Herring controversies” 关于 "业余回归的陷阱:荷兰新鲱鱼之争"
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-04 DOI: 10.1111/sjos.12741
Jan C. Van Ours, Ben Vollaard
{"title":"Commentary on “Pitfalls of amateur regression: The Dutch New Herring controversies”","authors":"Jan C. Van Ours, Ben Vollaard","doi":"10.1111/sjos.12741","DOIUrl":"https://doi.org/10.1111/sjos.12741","url":null,"abstract":"","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141548566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonparametric estimation of densities on the hypersphere using a parametric guide 使用参数指南对超球面上的密度进行非参数估计
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-03 DOI: 10.1111/sjos.12737
María Alonso‐Pena, Gerda Claeskens, Irène Gijbels
Hyperspherical kernel density estimators (KDE), which use a parametric distribution as a guide, are studied in this paper. The main benefit is that these estimators improve the bias of nonguided kernel density estimators when the parametric guiding distribution is not too far from the true density, while preserving the variance. When using a von Mises‐Fisher density as guide, the proposal performs as well as the classical KDE, even when the guiding model is incorrect, and far from the true distribution. This benefit is particular for the hyperspherical setting given its compact support, and is in contrast to similar methods for real valued data. Moreover, we deal with the important issue of data‐driven selection of the smoothing parameter. Simulations and real data examples illustrate the finite‐sample performance of the proposed method, also in comparison with other recently proposed estimation methods.
本文研究了使用参数分布作为导向的超球核密度估计器(KDE)。其主要优点是,当参数指导分布与真实密度相差不大时,这些估计器可以改善非指导核密度估计器的偏差,同时保留方差。当使用 von Mises-Fisher 密度作为指导时,即使指导模型不正确且与真实分布相差甚远,该提案的性能也不亚于经典的 KDE。考虑到超球面的紧凑支持,这种优势在超球面设置中尤为明显,这与用于实值数据的类似方法形成了鲜明对比。此外,我们还处理了数据驱动的平滑参数选择这一重要问题。模拟和真实数据实例说明了所提方法的有限样本性能,同时也与最近提出的其他估计方法进行了比较。
{"title":"Nonparametric estimation of densities on the hypersphere using a parametric guide","authors":"María Alonso‐Pena, Gerda Claeskens, Irène Gijbels","doi":"10.1111/sjos.12737","DOIUrl":"https://doi.org/10.1111/sjos.12737","url":null,"abstract":"Hyperspherical kernel density estimators (KDE), which use a parametric distribution as a guide, are studied in this paper. The main benefit is that these estimators improve the bias of nonguided kernel density estimators when the parametric guiding distribution is not too far from the true density, while preserving the variance. When using a von Mises‐Fisher density as guide, the proposal performs as well as the classical KDE, even when the guiding model is incorrect, and far from the true distribution. This benefit is particular for the hyperspherical setting given its compact support, and is in contrast to similar methods for real valued data. Moreover, we deal with the important issue of data‐driven selection of the smoothing parameter. Simulations and real data examples illustrate the finite‐sample performance of the proposed method, also in comparison with other recently proposed estimation methods.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141548569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lancaster correlation: A new dependence measure linked to maximum correlation 兰开斯特相关性:与最大相关性相关联的一种新的依赖性测量方法
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-03 DOI: 10.1111/sjos.12733
Hajo Holzmann, Bernhard Klar
We suggest novel correlation coefficients which equal the maximum correlation for a class of bivariate Lancaster distributions while being only slightly smaller than maximum correlation for a variety of further bivariate distributions. In contrast to maximum correlation, however, our correlation coefficients allow for rank and moment‐based estimators which are simple to compute and have tractable asymptotic distributions. Confidence intervals resulting from these asymptotic approximations and the covariance bootstrap show good finite‐sample coverage. In a simulation, the power of asymptotic as well as permutation tests for independence based on our correlation measures compares favorably with competing methods based on distance correlation or rank coefficients for functional dependence, among others. Moreover, for the bivariate normal distribution, our correlation coefficients equal the absolute value of the Pearson correlation, an attractive feature for practitioners which is not shared by various competitors. We illustrate the practical usefulness of our methods in applications to two real data sets.
我们提出了一些新的相关系数,这些相关系数等于兰开斯特二元分布的最大相关系数,而对于其他各种二元分布,相关系数仅略低于最大相关系数。然而,与最大相关性不同的是,我们的相关系数允许使用基于秩和矩的估计值,这些估计值易于计算,并且具有可控的渐近分布。由这些渐近近似值和协方差自举法得出的置信区间显示出良好的有限样本覆盖率。在一项模拟实验中,基于我们的相关性度量的渐近检验和置换检验的独立性,与基于距离相关性或秩系数的函数依赖性等竞争方法相比,都具有更强的说服力。此外,对于二元正态分布,我们的相关系数等于皮尔逊相关性的绝对值,这对从业人员来说是一个很有吸引力的特点,而这是其他竞争者所不具备的。我们在两个真实数据集的应用中说明了我们的方法的实用性。
{"title":"Lancaster correlation: A new dependence measure linked to maximum correlation","authors":"Hajo Holzmann, Bernhard Klar","doi":"10.1111/sjos.12733","DOIUrl":"https://doi.org/10.1111/sjos.12733","url":null,"abstract":"We suggest novel correlation coefficients which equal the maximum correlation for a class of bivariate Lancaster distributions while being only slightly smaller than maximum correlation for a variety of further bivariate distributions. In contrast to maximum correlation, however, our correlation coefficients allow for rank and moment‐based estimators which are simple to compute and have tractable asymptotic distributions. Confidence intervals resulting from these asymptotic approximations and the covariance bootstrap show good finite‐sample coverage. In a simulation, the power of asymptotic as well as permutation tests for independence based on our correlation measures compares favorably with competing methods based on distance correlation or rank coefficients for functional dependence, among others. Moreover, for the bivariate normal distribution, our correlation coefficients equal the absolute value of the Pearson correlation, an attractive feature for practitioners which is not shared by various competitors. We illustrate the practical usefulness of our methods in applications to two real data sets.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141548568","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Validation of point process predictions with proper scoring rules 用适当的评分规则验证点流程预测
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-02 DOI: 10.1111/sjos.12736
Claudio Heinrich‐Mertsching, Thordis L. Thorarinsdottir, Peter Guttorp, Max Schneider
We introduce a class of proper scoring rules for evaluating spatial point process forecasts based on summary statistics. These scoring rules rely on Monte Carlo approximations of expectations and can therefore easily be evaluated for any point process model that can be simulated. In this regard, they are more flexible than the commonly used logarithmic score and other existing proper scores for point process predictions. The scoring rules allow for evaluating the calibration of a model to specific aspects of a point process, such as its spatial distribution or tendency toward clustering. Using simulations, we analyze the sensitivity of our scoring rules to different aspects of the forecasts and compare it to the logarithmic score. Applications to earthquake occurrences in northern California, United States and the spatial distribution of Pacific silver firs in Findley Lake Reserve in Washington highlight the usefulness of our scores for scientific model selection.
我们引入了一类适当的评分规则,用于评估基于汇总统计的空间点过程预测。这些评分规则依赖于蒙特卡罗期望近似值,因此可以很容易地对任何可以模拟的点过程模型进行评估。在这方面,它们比常用的对数评分和其他现有的点过程预测适当评分更加灵活。评分规则允许根据点过程的特定方面(如空间分布或聚类趋势)来评估模型的校准。通过模拟,我们分析了评分规则对预测不同方面的敏感性,并与对数评分进行了比较。在美国加利福尼亚州北部发生的地震和华盛顿州芬德利湖保护区太平洋银杉的空间分布中的应用,凸显了我们的评分对科学模型选择的实用性。
{"title":"Validation of point process predictions with proper scoring rules","authors":"Claudio Heinrich‐Mertsching, Thordis L. Thorarinsdottir, Peter Guttorp, Max Schneider","doi":"10.1111/sjos.12736","DOIUrl":"https://doi.org/10.1111/sjos.12736","url":null,"abstract":"We introduce a class of proper scoring rules for evaluating spatial point process forecasts based on summary statistics. These scoring rules rely on Monte Carlo approximations of expectations and can therefore easily be evaluated for any point process model that can be simulated. In this regard, they are more flexible than the commonly used logarithmic score and other existing proper scores for point process predictions. The scoring rules allow for evaluating the calibration of a model to specific aspects of a point process, such as its spatial distribution or tendency toward clustering. Using simulations, we analyze the sensitivity of our scoring rules to different aspects of the forecasts and compare it to the logarithmic score. Applications to earthquake occurrences in northern California, United States and the spatial distribution of Pacific silver firs in Findley Lake Reserve in Washington highlight the usefulness of our scores for scientific model selection.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141504887","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Scandinavian Journal of Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1