首页 > 最新文献

Annals of the Institute of Statistical Mathematics最新文献

英文 中文
Asymptotic theory in network models with covariates and a growing number of node parameters 具有协变和增长节点参数的网络模型的渐近理论
IF 1 4区 数学 Q2 Mathematics Pub Date : 2022-09-02 DOI: 10.1007/s10463-022-00848-0
Qiuping Wang, Yuan Zhang, Ting Yan

We propose a general model that jointly characterizes degree heterogeneity and homophily in weighted, undirected networks. We present a moment estimation method using node degrees and homophily statistics. We establish consistency and asymptotic normality of our estimator using novel analysis. We apply our general framework to three applications, including both exponential family and non-exponential family models. Comprehensive numerical studies and a data example also demonstrate the usefulness of our method.

我们提出了一个通用模型,共同表征程度异质性和同质在加权,无向网络。提出了一种基于节点度和同态统计量的矩估计方法。我们用新的分析方法建立了估计量的相合性和渐近正态性。我们将我们的一般框架应用于三种应用,包括指数族和非指数族模型。综合数值研究和一个数据算例也证明了本文方法的有效性。
{"title":"Asymptotic theory in network models with covariates and a growing number of node parameters","authors":"Qiuping Wang,&nbsp;Yuan Zhang,&nbsp;Ting Yan","doi":"10.1007/s10463-022-00848-0","DOIUrl":"10.1007/s10463-022-00848-0","url":null,"abstract":"<div><p>We propose a general model that jointly characterizes degree heterogeneity and homophily in weighted, undirected networks. We present a moment estimation method using node degrees and homophily statistics. We establish consistency and asymptotic normality of our estimator using novel analysis. We apply our general framework to three applications, including both exponential family and non-exponential family models. Comprehensive numerical studies and a data example also demonstrate the usefulness of our method.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2022-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48524750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Quantitative robustness of instance ranking problems 实例排序问题的定量鲁棒性
IF 1 4区 数学 Q2 Mathematics Pub Date : 2022-08-30 DOI: 10.1007/s10463-022-00847-1
Tino Werner

Instance ranking problems intend to recover the ordering of the instances in a data set with applications in scientific, social and financial contexts. In this work, we concentrate on the global robustness of parametric instance ranking problems in terms of the breakdown point which measures the fraction of samples that need to be perturbed in order to let the estimator take unreasonable values. Existing breakdown point notions do not cover ranking problems so far. We propose to define a breakdown of the estimator as a sign-reversal of all components which causes the predicted ranking to be potentially completely inverted; therefore, we call it the order-inversal breakdown point (OIBDP). We will study the OIBDP, based on a linear model, for several different carefully distinguished ranking problems and provide least favorable outlier configurations, characterizations of the order-inversal breakdown point and sharp asymptotic upper bounds. We also compute empirical OIBDPs.

实例排序问题旨在恢复在科学、社会和金融环境中应用的数据集中实例的顺序。在这项工作中,我们专注于参数实例排序问题在击穿点方面的全局鲁棒性,击穿点测量需要被扰动的样本的比例,以便让估计器取不合理的值。到目前为止,现有的分解点概念还没有涵盖排名问题。我们建议将估计量的分解定义为所有成分的符号反转,这导致预测的排名可能完全反转;因此,我们称之为序逆击穿点(OIBDP)。我们将研究基于线性模型的OIBDP,用于几个不同的仔细区分排序问题,并提供最不利的离群值配置,序逆击穿点的特征和锐渐近上界。我们还计算了经验oibdp。
{"title":"Quantitative robustness of instance ranking problems","authors":"Tino Werner","doi":"10.1007/s10463-022-00847-1","DOIUrl":"10.1007/s10463-022-00847-1","url":null,"abstract":"<div><p>Instance ranking problems intend to recover the ordering of the instances in a data set with applications in scientific, social and financial contexts. In this work, we concentrate on the global robustness of parametric instance ranking problems in terms of the breakdown point which measures the fraction of samples that need to be perturbed in order to let the estimator take unreasonable values. Existing breakdown point notions do not cover ranking problems so far. We propose to define a breakdown of the estimator as a sign-reversal of all components which causes the predicted ranking to be potentially completely inverted; therefore, we call it the order-inversal breakdown point (OIBDP). We will study the OIBDP, based on a linear model, for several different carefully distinguished ranking problems and provide least favorable outlier configurations, characterizations of the order-inversal breakdown point and sharp asymptotic upper bounds. We also compute empirical OIBDPs.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2022-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10463-022-00847-1.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42157643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Forward variable selection for ultra-high dimensional quantile regression models 超高维分位数回归模型的正向变量选择
IF 1 4区 数学 Q2 Mathematics Pub Date : 2022-08-29 DOI: 10.1007/s10463-022-00849-z
Toshio Honda, Chien-Tong Lin

We propose forward variable selection procedures with a stopping rule for feature screening in ultra-high-dimensional quantile regression models. For such very large models, penalized methods do not work and some preliminary feature screening is necessary. We demonstrate the desirable theoretical properties of our forward procedures by taking care of uniformity w.r.t. subsets of covariates properly. The necessity of such uniformity is often overlooked in the literature. Our stopping rule suitably incorporates the model size at each stage. We also present the results of simulation studies and a real data application to show their good finite sample performances.

我们提出了具有停止规则的变量选择程序,用于超高维分位数回归模型的特征筛选。对于这种非常大的模型,惩罚方法不起作用,有必要进行一些初步的特征筛选。通过适当地处理协变量子集的均匀性,我们证明了我们的正演过程的理想的理论性质。这种一致性的必要性在文献中常常被忽视。我们的停止规则适当地结合了每个阶段的模型大小。我们还给出了仿真研究和实际数据应用的结果,以证明它们具有良好的有限样本性能。
{"title":"Forward variable selection for ultra-high dimensional quantile regression models","authors":"Toshio Honda,&nbsp;Chien-Tong Lin","doi":"10.1007/s10463-022-00849-z","DOIUrl":"10.1007/s10463-022-00849-z","url":null,"abstract":"<div><p>We propose forward variable selection procedures with a stopping rule for feature screening in ultra-high-dimensional quantile regression models. For such very large models, penalized methods do not work and some preliminary feature screening is necessary. We demonstrate the desirable theoretical properties of our forward procedures by taking care of uniformity w.r.t. subsets of covariates properly. The necessity of such uniformity is often overlooked in the literature. Our stopping rule suitably incorporates the model size at each stage. We also present the results of simulation studies and a real data application to show their good finite sample performances.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10463-022-00849-z.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41794639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Conditional selective inference for robust regression and outlier detection using piecewise-linear homotopy continuation 稳健回归的条件选择推理和分段线性同伦延拓的离群值检测
IF 1 4区 数学 Q2 Mathematics Pub Date : 2022-08-27 DOI: 10.1007/s10463-022-00846-2
Toshiaki Tsukurimichi, Yu Inatsu, Vo Nguyen Le Duy, Ichiro Takeuchi

In this paper, we consider conditional selective inference (SI) for a linear model estimated after outliers are removed from the data. To apply the conditional SI framework, it is necessary to characterize the events of how the robust method identifies outliers. Unfortunately, the existing conditional SIs cannot be directly applied to our problem because they are applicable to the case where the selection events can be represented by linear or quadratic constraints. We propose a conditional SI method for popular robust regressions such as least-absolute-deviation regression and Huber regression by introducing a new computational method using a convex optimization technique called homotopy method. We show that the proposed conditional SI method is applicable to a wide class of robust regression and outlier detection methods and has good empirical performance on both synthetic data and real data experiments.

在本文中,我们考虑条件选择推理(SI)的线性模型估计后,从数据中去除异常值。为了应用条件SI框架,有必要描述鲁棒方法如何识别异常值的事件。不幸的是,现有的条件si不能直接应用于我们的问题,因为它们适用于选择事件可以用线性或二次约束表示的情况。我们通过引入一种新的计算方法,使用一种称为同伦方法的凸优化技术,提出了一种适用于最小绝对偏差回归和Huber回归等常用鲁棒回归的条件SI方法。我们的研究表明,所提出的条件SI方法适用于广泛的鲁棒回归和离群值检测方法,并且在合成数据和实际数据实验中都具有良好的经验性能。
{"title":"Conditional selective inference for robust regression and outlier detection using piecewise-linear homotopy continuation","authors":"Toshiaki Tsukurimichi,&nbsp;Yu Inatsu,&nbsp;Vo Nguyen Le Duy,&nbsp;Ichiro Takeuchi","doi":"10.1007/s10463-022-00846-2","DOIUrl":"10.1007/s10463-022-00846-2","url":null,"abstract":"<div><p>In this paper, we consider conditional selective inference (SI) for a linear model estimated after outliers are removed from the data. To apply the conditional SI framework, it is necessary to characterize the events of how the robust method identifies outliers. Unfortunately, the existing conditional SIs cannot be directly applied to our problem because they are applicable to the case where the selection events can be represented by linear or quadratic constraints. We propose a conditional SI method for popular robust regressions such as least-absolute-deviation regression and Huber regression by introducing a new computational method using a convex optimization technique called homotopy method. We show that the proposed conditional SI method is applicable to a wide class of robust regression and outlier detection methods and has good empirical performance on both synthetic data and real data experiments.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2022-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46257738","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Approximating symmetrized estimators of scatter via balanced incomplete U-statistics 用平衡不完全U-统计量逼近散射的对称估计
IF 1 4区 数学 Q2 Mathematics Pub Date : 2022-08-19 DOI: 10.1007/s10463-023-00879-1
L. Duembgen, K. Nordhausen
{"title":"Approximating symmetrized estimators of scatter via balanced incomplete U-statistics","authors":"L. Duembgen, K. Nordhausen","doi":"10.1007/s10463-023-00879-1","DOIUrl":"https://doi.org/10.1007/s10463-023-00879-1","url":null,"abstract":"","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2022-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45737931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Flexible asymmetric multivariate distributions based on two-piece univariate distributions 基于两件式单变量分布的柔性非对称多变量分布
IF 1 4区 数学 Q2 Mathematics Pub Date : 2022-08-02 DOI: 10.1007/s10463-022-00842-6
Jonas Baillien, Irène Gijbels, Anneleen Verhasselt

Classical symmetric distributions like the Gaussian are widely used. However, in reality data often display a lack of symmetry. Multiple distributions, grouped under the name “skewed distributions”, have been developed to specifically cope with asymmetric data. In this paper, we present a broad family of flexible multivariate skewed distributions for which statistical inference is a feasible task. The studied family of multivariate skewed distributions is derived by taking affine combinations of independent univariate distributions. These are members of a flexible family of univariate asymmetric distributions and are an important basis for achieving statistical inference. Besides basic properties of the proposed distributions, also statistical inference based on a maximum likelihood approach is presented. We show that under mild conditions, weak consistency and asymptotic normality of the maximum likelihood estimators hold. These results are supported by a simulation study confirming the developed theoretical results, and some data examples to illustrate practical applicability.

像高斯分布这样的经典对称分布被广泛使用。然而,在现实中,数据往往显示出缺乏对称性。以“偏态分布”命名的多重分布已经被开发出来,专门用于处理非对称数据。在本文中,我们提出了一大类灵活的多元偏态分布,其中统计推断是一项可行的任务。所研究的多元偏态分布族是由独立的单变量分布的仿射组合导出的。这些是灵活的单变量不对称分布家族的成员,是实现统计推断的重要基础。除了提出的分布的基本性质外,还提出了基于极大似然方法的统计推断。我们证明了在温和条件下,极大似然估计的弱相合性和渐近正态性成立。这些结果得到了仿真研究的支持,证实了所建立的理论结果,并通过一些数据实例说明了实际的适用性。
{"title":"Flexible asymmetric multivariate distributions based on two-piece univariate distributions","authors":"Jonas Baillien,&nbsp;Irène Gijbels,&nbsp;Anneleen Verhasselt","doi":"10.1007/s10463-022-00842-6","DOIUrl":"10.1007/s10463-022-00842-6","url":null,"abstract":"<div><p>Classical symmetric distributions like the Gaussian are widely used. However, in reality data often display a lack of symmetry. Multiple distributions, grouped under the name “skewed distributions”, have been developed to specifically cope with asymmetric data. In this paper, we present a broad family of flexible multivariate skewed distributions for which statistical inference is a feasible task. The studied family of multivariate skewed distributions is derived by taking affine combinations of independent univariate distributions. These are members of a flexible family of univariate asymmetric distributions and are an important basis for achieving statistical inference. Besides basic properties of the proposed distributions, also statistical inference based on a maximum likelihood approach is presented. We show that under mild conditions, weak consistency and asymptotic normality of the maximum likelihood estimators hold. These results are supported by a simulation study confirming the developed theoretical results, and some data examples to illustrate practical applicability.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2022-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48406413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the choice of the optimal single order statistic in quantile estimation 分位数估计中最优单阶统计量的选择
IF 1 4区 数学 Q2 Mathematics Pub Date : 2022-08-02 DOI: 10.1007/s10463-022-00845-3
Mariusz Bieniek, Luiza Pańczyk

We study the classical statistical problem of the estimation of quantiles by order statistics of the random sample. For fixed sample size, we determine the single order statistic which is the optimal estimator of a quantile of given order. We propose a totally new approach to the problem, since our optimality criterion is based on the use of nonparametric sharp upper and lower bounds on the bias of the estimation. First, we determine the explicit analytic expressions for the bounds, and then, we choose the order statistic for which the upper and lower bound are simultaneously as close to 0 as possible. The paper contains rigorously proved theoretical results which can be easily implemented in practise. This is also illustrated with numerical examples.

研究了随机样本的有序统计量估计分位数的经典统计问题。对于固定样本量,我们确定了单阶统计量,它是给定阶数的分位数的最优估计量。我们提出了一种全新的方法来解决这个问题,因为我们的最优性准则是基于使用估计偏差的非参数尖锐上界和下界。首先,我们确定了边界的显式解析表达式,然后,我们选择了上界和下界同时尽可能接近0的阶统计量。本文包含了经过严格验证的理论结果,易于在实践中实现。并以数值算例加以说明。
{"title":"On the choice of the optimal single order statistic in quantile estimation","authors":"Mariusz Bieniek,&nbsp;Luiza Pańczyk","doi":"10.1007/s10463-022-00845-3","DOIUrl":"10.1007/s10463-022-00845-3","url":null,"abstract":"<div><p>We study the classical statistical problem of the estimation of quantiles by order statistics of the random sample. For fixed sample size, we determine the single order statistic which is the optimal estimator of a quantile of given order. We propose a totally new approach to the problem, since our optimality criterion is based on the use of nonparametric sharp upper and lower bounds on the bias of the estimation. First, we determine the explicit analytic expressions for the bounds, and then, we choose the order statistic for which the upper and lower bound are simultaneously as close to 0 as possible. The paper contains rigorously proved theoretical results which can be easily implemented in practise. This is also illustrated with numerical examples.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2022-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42659546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Selective inference after feature selection via multiscale bootstrap 多尺度自举特征选择后的选择性推理
IF 1 4区 数学 Q2 Mathematics Pub Date : 2022-07-30 DOI: 10.1007/s10463-022-00838-2
Yoshikazu Terada, Hidetoshi Shimodaira

It is common to show the confidence intervals or p-values of selected features, or predictor variables in regression, but they often involve selection bias. The selective inference approach solves this bias by conditioning on the selection event. Most existing studies of selective inference consider a specific algorithm, such as Lasso, for feature selection, and thus they have difficulties in handling more complicated algorithms. Moreover, existing studies often consider unnecessarily restrictive events, leading to over-conditioning and lower statistical power. Our novel and widely applicable resampling method via multiscale bootstrap addresses these issues to compute an approximately unbiased selective p-value for the selected features. As a simplification of the proposed method, we also develop a simpler method via the classical bootstrap. We prove that the p-value computed by our multiscale bootstrap method is more accurate than the classical bootstrap method. Furthermore, numerical experiments demonstrate that our algorithm works well even for more complicated feature selection methods such as non-convex regularization.

在回归中显示所选特征或预测变量的置信区间或p值是很常见的,但它们通常涉及选择偏差。选择性推理方法通过对选择事件进行条件反射来解决这种偏差。大多数现有的选择性推理研究都是采用特定的算法(如Lasso)来进行特征选择,因此在处理更复杂的算法时存在困难。此外,现有的研究经常考虑不必要的限制性事件,导致过度调节和较低的统计能力。我们新颖且广泛适用的多尺度自举重采样方法解决了这些问题,为所选特征计算近似无偏选择性p值。作为该方法的简化,我们还开发了一种更简单的方法,即经典自举法。证明了用多尺度自举法计算的p值比经典自举法计算的p值更精确。此外,数值实验表明,即使对于非凸正则化等更复杂的特征选择方法,我们的算法也能很好地工作。
{"title":"Selective inference after feature selection via multiscale bootstrap","authors":"Yoshikazu Terada,&nbsp;Hidetoshi Shimodaira","doi":"10.1007/s10463-022-00838-2","DOIUrl":"10.1007/s10463-022-00838-2","url":null,"abstract":"<div><p>It is common to show the confidence intervals or <i>p</i>-values of selected features, or predictor variables in regression, but they often involve selection bias. The selective inference approach solves this bias by conditioning on the selection event. Most existing studies of selective inference consider a specific algorithm, such as Lasso, for feature selection, and thus they have difficulties in handling more complicated algorithms. Moreover, existing studies often consider unnecessarily restrictive events, leading to over-conditioning and lower statistical power. Our novel and widely applicable resampling method via multiscale bootstrap addresses these issues to compute an approximately unbiased selective <i>p</i>-value for the selected features. As a simplification of the proposed method, we also develop a simpler method via the classical bootstrap. We prove that the <i>p</i>-value computed by our multiscale bootstrap method is more accurate than the classical bootstrap method. Furthermore, numerical experiments demonstrate that our algorithm works well even for more complicated feature selection methods such as non-convex regularization.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2022-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43509814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Inference using an exact distribution of test statistic for random-effects meta-analysis 随机效应荟萃分析使用检验统计量的精确分布进行推断
IF 1 4区 数学 Q2 Mathematics Pub Date : 2022-07-26 DOI: 10.1007/s10463-022-00844-4
Keisuke Hanada, Tomoyuki Sugimoto

Random-effects meta-analysis serves to integrate the results of multiple studies with methods such as moment estimation and likelihood estimation duly proposed. These existing methods are based on asymptotic normality with respect to the number of studies. However, the test and interval estimation deviate from the nominal significance level when integrating a small number of studies. Although a method for constructing more conservative intervals has been recently proposed, the exact distribution of test statistic for the overall treatment effect is not well known. In this paper, we provide an almost-exact distribution of the test statistic in random-effects meta-analysis and propose the test and interval estimation using the almost-exact distribution. Simulations demonstrate the accuracy of estimation and application to existing meta-analysis using the method proposed here. With known variance parameters, the estimation performance using the almost-exact distribution always achieves the nominal significance level regardless of the number of studies and heterogeneity. We also propose some methods to construct a conservative interval estimation, even when the variance parameters are unknown, and present their performances via simulation and an application to Alzheimer’s disease meta-analysis.

随机效应荟萃分析将多个研究的结果与适当提出的矩估计和似然估计等方法相结合。这些现有的方法是基于研究数量的渐近正态性。然而,当整合少量研究时,检验和区间估计偏离名义显著性水平。虽然最近提出了一种构造更保守区间的方法,但总体处理效果的检验统计量的确切分布尚不清楚。在本文中,我们提供了随机效应荟萃分析中检验统计量的几乎精确分布,并提出了使用几乎精确分布的检验和区间估计。仿真结果证明了估计的准确性以及本文提出的方法在现有元分析中的应用。在方差参数已知的情况下,无论研究数量和异质性如何,使用几乎精确分布的估计性能总是达到名义显著性水平。我们还提出了一些构建保守区间估计的方法,即使方差参数是未知的,并通过模拟和应用于阿尔茨海默病的meta分析来展示它们的性能。
{"title":"Inference using an exact distribution of test statistic for random-effects meta-analysis","authors":"Keisuke Hanada,&nbsp;Tomoyuki Sugimoto","doi":"10.1007/s10463-022-00844-4","DOIUrl":"10.1007/s10463-022-00844-4","url":null,"abstract":"<div><p>Random-effects meta-analysis serves to integrate the results of multiple studies with methods such as moment estimation and likelihood estimation duly proposed. These existing methods are based on asymptotic normality with respect to the number of studies. However, the test and interval estimation deviate from the nominal significance level when integrating a small number of studies. Although a method for constructing more conservative intervals has been recently proposed, the exact distribution of test statistic for the overall treatment effect is not well known. In this paper, we provide an almost-exact distribution of the test statistic in random-effects meta-analysis and propose the test and interval estimation using the almost-exact distribution. Simulations demonstrate the accuracy of estimation and application to existing meta-analysis using the method proposed here. With known variance parameters, the estimation performance using the almost-exact distribution always achieves the nominal significance level regardless of the number of studies and heterogeneity. We also propose some methods to construct a conservative interval estimation, even when the variance parameters are unknown, and present their performances via simulation and an application to Alzheimer’s disease meta-analysis.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2022-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41358458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Group least squares regression for linear models with strongly correlated predictor variables 预测变量强相关线性模型的群最小二乘回归
IF 1 4区 数学 Q2 Mathematics Pub Date : 2022-07-26 DOI: 10.1007/s10463-022-00841-7
Min Tsao

Traditionally, the main focus of the least squares regression is to study the effects of individual predictor variables, but strongly correlated variables generate multicollinearity which makes it difficult to study their effects. To resolve the multicollinearity issue without abandoning the least squares regression, for situations where predictor variables are in groups with strong within-group correlations but weak between-group correlations, we propose to study the effects of the groups with a group approach to the least squares regression. Using an all positive correlations arrangement of the strongly correlated variables, we first characterize group effects that are meaningful and can be accurately estimated. We then discuss the group approach to the least squares regression through a simulation study and demonstrate that it is an effective method for handling multicollinearity. We also address a common misconception about prediction accuracy of the least squares estimated model.

传统上,最小二乘回归的主要重点是研究单个预测变量的影响,但强相关变量产生多重共线性,使得研究它们的影响变得困难。为了在不放弃最小二乘回归的情况下解决多重共线性问题,对于预测变量在组内相关性强而组间相关性弱的情况,我们建议使用最小二乘回归的组方法来研究组的影响。利用强相关变量的全正相关排列,我们首先描述了有意义且可以准确估计的群体效应。然后,我们通过模拟研究讨论了最小二乘回归的群方法,并证明了它是处理多重共线性的有效方法。我们还解决了关于最小二乘估计模型预测精度的常见误解。
{"title":"Group least squares regression for linear models with strongly correlated predictor variables","authors":"Min Tsao","doi":"10.1007/s10463-022-00841-7","DOIUrl":"10.1007/s10463-022-00841-7","url":null,"abstract":"<div><p>Traditionally, the main focus of the least squares regression is to study the effects of individual predictor variables, but strongly correlated variables generate multicollinearity which makes it difficult to study their effects. To resolve the multicollinearity issue without abandoning the least squares regression, for situations where predictor variables are in groups with strong within-group correlations but weak between-group correlations, we propose to study the effects of the groups with a group approach to the least squares regression. Using an all positive correlations arrangement of the strongly correlated variables, we first characterize group effects that are meaningful and can be accurately estimated. We then discuss the group approach to the least squares regression through a simulation study and demonstrate that it is an effective method for handling multicollinearity. We also address a common misconception about prediction accuracy of the least squares estimated model.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2022-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46133684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Annals of the Institute of Statistical Mathematics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1