首页 > 最新文献

Scandinavian Journal of Statistics最新文献

英文 中文
Inference for all variants of the multivariate coefficient of variation in factorial designs 因子设计中多元变异系数所有变量的推理
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-06-26 DOI: 10.1111/sjos.12740
Marc Ditzhaus, Łukasz Smaga
The multivariate coefficient of variation (MCV) is an attractive and easy‐to‐interpret effect size for the dispersion in multivariate data. Recently, the first inference methods for the MCV were proposed for general factorial designs. However, the inference methods are primarily derived for one special MCV variant while there are several reasonable proposals. Moreover, when rejecting a global null hypothesis, a more in‐depth analysis is of interest to find the significant contrasts of MCV. This paper concerns extending the nonparametric permutation procedure to the other MCV variants and a max‐type test for post hoc analysis. To improve the small sample performance of the latter, we suggest a novel bootstrap strategy and prove its asymptotic validity. The actual performance of all proposed tests is compared in an extensive simulation study and illustrated by real data analysis. All methods are implemented in the R package GFDmcv, available on CRAN.
多变量变异系数(MCV)是多变量数据离散度的一种有吸引力且易于解释的效应大小。最近,首次提出了针对一般因子设计的 MCV 推断方法。然而,这些推断方法主要是针对一种特殊的 MCV 变体得出的,而目前有几种合理的建议。此外,在拒绝全局零假设时,更深入的分析对找到 MCV 的显著对比很有意义。本文涉及将非参数置换程序扩展到其他 MCV 变体,以及用于事后分析的最大类型检验。为了提高后者的小样本性能,我们提出了一种新的引导策略,并证明了其渐近有效性。我们通过广泛的模拟研究比较了所有建议检验的实际性能,并通过实际数据分析进行了说明。所有方法都在 R 软件包 GFDmcv 中实现,可在 CRAN 上下载。
{"title":"Inference for all variants of the multivariate coefficient of variation in factorial designs","authors":"Marc Ditzhaus, Łukasz Smaga","doi":"10.1111/sjos.12740","DOIUrl":"https://doi.org/10.1111/sjos.12740","url":null,"abstract":"The multivariate coefficient of variation (MCV) is an attractive and easy‐to‐interpret effect size for the dispersion in multivariate data. Recently, the first inference methods for the MCV were proposed for general factorial designs. However, the inference methods are primarily derived for one special MCV variant while there are several reasonable proposals. Moreover, when rejecting a global null hypothesis, a more in‐depth analysis is of interest to find the significant contrasts of MCV. This paper concerns extending the nonparametric permutation procedure to the other MCV variants and a max‐type test for post hoc analysis. To improve the small sample performance of the latter, we suggest a novel bootstrap strategy and prove its asymptotic validity. The actual performance of all proposed tests is compared in an extensive simulation study and illustrated by real data analysis. All methods are implemented in the R package GFDmcv, available on CRAN.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141504888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A conversation with Nils Lid Hjort 与尼尔斯-利德-希约尔特的对话
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-06-20 DOI: 10.1111/sjos.12732
Ørnulf Borgan, Ingrid K. Glad
Professor (now emeritus) Nils Lid Hjort has through more than four decades been one of the most original and productive statisticians in Norway, contributing to a wide range of topics such as survival analysis, Bayesian nonparametrics, empirical likelihood, density estimation, focused inference, model selection, and confidence distributions. This conversation, which took place at the University of Oslo in December 2023, sheds light on how Nils Hjort's curious and open mind, coupled with a deep understanding, has enabled him to seamlessly navigate between different fields of statistics and its applications. Our aim is to encourage the statistics community to always be on the lookout for unexpected connections in statistical science and to embrace unexpected encounters with fellow statisticians from around the world.
尼尔斯-利德-希约尔特教授(现为名誉教授)四十多年来一直是挪威最具原创性和最有成就的统计学家之一,在生存分析、贝叶斯非参数、经验似然法、密度估计、重点推断、模型选择和置信度分布等广泛领域做出了贡献。这次对话于 2023 年 12 月在奥斯陆大学举行,它揭示了 Nils Hjort 如何以其好奇、开放的心态和深刻的理解力,在统计学及其应用的不同领域之间游刃有余。我们的目标是鼓励统计界始终关注统计科学中的意外联系,并与来自世界各地的统计学家同仁不期而遇。
{"title":"A conversation with Nils Lid Hjort","authors":"Ørnulf Borgan, Ingrid K. Glad","doi":"10.1111/sjos.12732","DOIUrl":"https://doi.org/10.1111/sjos.12732","url":null,"abstract":"Professor (now emeritus) Nils Lid Hjort has through more than four decades been one of the most original and productive statisticians in Norway, contributing to a wide range of topics such as survival analysis, Bayesian nonparametrics, empirical likelihood, density estimation, focused inference, model selection, and confidence distributions. This conversation, which took place at the University of Oslo in December 2023, sheds light on how Nils Hjort's curious and open mind, coupled with a deep understanding, has enabled him to seamlessly navigate between different fields of statistics and its applications. Our aim is to encourage the statistics community to always be on the lookout for unexpected connections in statistical science and to embrace unexpected encounters with fellow statisticians from around the world.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141504889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Double debiased transfer learning for adaptive Huber regression 自适应胡贝尔回归的双偏移转移学习
IF 1 4区 数学 Q3 Mathematics Pub Date : 2024-05-21 DOI: 10.1111/sjos.12723
Ziyuan Wang, Lei Wang, Heng Lian
Through exploiting information from the source data to improve the fit performance on the target data, transfer learning estimations for high‐dimensional linear regression models have drawn much attention recently, but few studies focus on statistical inference and robust learning in the presence of heavy‐tailed/asymmetric errors. Using adaptive Huber regression (AHR) to achieve the bias and robustness tradeoff, in this paper we propose a robust transfer learning algorithm with high‐dimensional covariates, then construct valid confidence intervals and hypothesis tests based on the debiased lasso approach. When the transferable sources are known, a two‐step ‐penalized transfer AHR estimator is firstly proposed and the error bounds are established. To correct the biases caused by the lasso penalty, a unified debiasing framework based on the decorrelated score equations is considered to establish asymptotic normality of the debiased lasso transfer AHR estimator. Confidence intervals and hypothesis tests for each component can be constructed. When the transferable sources are unknown, a data‐driven source detection algorithm is proposed with theoretical guarantee. Numerical studies verify the performance of our proposed estimator and confidence intervals, and an application to Genotype‐Tissue Expression data is also presented.
通过利用源数据的信息来提高目标数据的拟合性能,高维线性回归模型的迁移学习估计近来备受关注,但很少有研究关注重尾/非对称误差情况下的统计推断和稳健学习。本文利用自适应胡贝尔回归(AHR)来实现偏差和稳健性的权衡,提出了一种具有高维协变量的稳健迁移学习算法,然后基于去偏套索方法构建了有效的置信区间和假设检验。在已知可转移源的情况下,首先提出了一个两步瓣化转移 AHR 估计器,并建立了误差边界。为了纠正 lasso 惩罚造成的偏差,考虑了基于装饰相关得分方程的统一除杂框架,以建立除杂 lasso 转移 AHR 估计器的渐近正态性。可以为每个组成部分构建置信区间和假设检验。当可转移源未知时,提出了一种具有理论保证的数据驱动源检测算法。数值研究验证了我们提出的估计器和置信区间的性能,并介绍了基因型-组织表达数据的应用。
{"title":"Double debiased transfer learning for adaptive Huber regression","authors":"Ziyuan Wang, Lei Wang, Heng Lian","doi":"10.1111/sjos.12723","DOIUrl":"https://doi.org/10.1111/sjos.12723","url":null,"abstract":"Through exploiting information from the source data to improve the fit performance on the target data, transfer learning estimations for high‐dimensional linear regression models have drawn much attention recently, but few studies focus on statistical inference and robust learning in the presence of heavy‐tailed/asymmetric errors. Using adaptive Huber regression (AHR) to achieve the bias and robustness tradeoff, in this paper we propose a robust transfer learning algorithm with high‐dimensional covariates, then construct valid confidence intervals and hypothesis tests based on the debiased lasso approach. When the transferable sources are known, a two‐step ‐penalized transfer AHR estimator is firstly proposed and the error bounds are established. To correct the biases caused by the lasso penalty, a unified debiasing framework based on the decorrelated score equations is considered to establish asymptotic normality of the debiased lasso transfer AHR estimator. Confidence intervals and hypothesis tests for each component can be constructed. When the transferable sources are unknown, a data‐driven source detection algorithm is proposed with theoretical guarantee. Numerical studies verify the performance of our proposed estimator and confidence intervals, and an application to Genotype‐Tissue Expression data is also presented.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141115316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gradient‐based approach to sufficient dimension reduction with functional or longitudinal covariates 基于梯度的方法,利用功能或纵向协变量充分降维
IF 1 4区 数学 Q3 Mathematics Pub Date : 2024-05-19 DOI: 10.1111/sjos.12724
Ming-Yueh Huang, Kwun Chuen Gary Chan
In this paper, we focus on the sufficient dimension reduction problem in regression analysis with real‐valued response and functional or longitudinal covariates. We propose a new method based on gradients of the conditional distribution function to estimate the sufficient dimension reduction subspace. While existing inverse‐regression‐type methods relies on a linearity condition, our method is based on the gradient of conditional distribution function and its validity only requires smoothness conditions on the population parameters. Practically, the proposed estimator can be obtained by standard algorithm of functional principal component analysis. The proposed method is demonstrated through extensive simulations and two empirical examples.
在本文中,我们重点研究了实值响应与函数或纵向协变量回归分析中的充分降维问题。我们提出了一种基于条件分布函数梯度来估计充分降维子空间的新方法。现有的反回归类型方法依赖于线性条件,而我们的方法基于条件分布函数的梯度,其有效性只需要群体参数的平滑性条件。实际上,所提出的估计器可以通过函数主成分分析的标准算法获得。我们通过大量模拟和两个经验实例来证明所提出的方法。
{"title":"Gradient‐based approach to sufficient dimension reduction with functional or longitudinal covariates","authors":"Ming-Yueh Huang, Kwun Chuen Gary Chan","doi":"10.1111/sjos.12724","DOIUrl":"https://doi.org/10.1111/sjos.12724","url":null,"abstract":"In this paper, we focus on the sufficient dimension reduction problem in regression analysis with real‐valued response and functional or longitudinal covariates. We propose a new method based on gradients of the conditional distribution function to estimate the sufficient dimension reduction subspace. While existing inverse‐regression‐type methods relies on a linearity condition, our method is based on the gradient of conditional distribution function and its validity only requires smoothness conditions on the population parameters. Practically, the proposed estimator can be obtained by standard algorithm of functional principal component analysis. The proposed method is demonstrated through extensive simulations and two empirical examples.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141124124","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semiparametric efficient estimation in high‐dimensional partial linear regression models 高维偏线性回归模型中的半参数高效估计
IF 1 4区 数学 Q3 Mathematics Pub Date : 2024-05-15 DOI: 10.1111/sjos.12716
Xinyu Fu, Mian Huang, Weixin Yao
We introduce a novel semiparametric efficient estimation procedure for high‐dimensional partial linear regression models to overcome the challenge of efficiency loss of the traditional least‐squares based estimation procedure under unknown error distributions, while enjoying several appealing theoretical properties. The new estimation procedure provides a sparse estimator for the parametric component and achieves the semiparametric efficiency as the oracle maximum likelihood estimator as if the error distribution was known. By employing the penalized estimation and the semiparametric efficiency theory for ultra‐high‐dimensional partial linear model, the procedure enjoys the oracle variable selection property and offers efficiency gain for non‐Gaussian random errors, while maintaining the same efficiency as the least squares‐based estimator for Gaussian random errors. Extensive simulation studies and an empirical application are conducted to demonstrate the effectiveness of the proposed procedure.
我们为高维偏线性回归模型引入了一种新的半参数高效估计程序,以克服传统的基于最小二乘法的估计程序在未知误差分布下的效率损失难题,同时还具有一些吸引人的理论特性。新的估计程序为参数部分提供了一个稀疏估计器,并在误差分布已知的情况下实现了与甲骨文最大似然估计器一样的半参数效率。通过采用超高维偏线性模型的惩罚估计和半参数效率理论,该程序享有oracle变量选择特性,并为非高斯随机误差提供了效率增益,同时保持了与基于最小二乘法的高斯随机误差估计器相同的效率。通过广泛的模拟研究和实证应用,证明了所提程序的有效性。
{"title":"Semiparametric efficient estimation in high‐dimensional partial linear regression models","authors":"Xinyu Fu, Mian Huang, Weixin Yao","doi":"10.1111/sjos.12716","DOIUrl":"https://doi.org/10.1111/sjos.12716","url":null,"abstract":"We introduce a novel semiparametric efficient estimation procedure for high‐dimensional partial linear regression models to overcome the challenge of efficiency loss of the traditional least‐squares based estimation procedure under unknown error distributions, while enjoying several appealing theoretical properties. The new estimation procedure provides a sparse estimator for the parametric component and achieves the semiparametric efficiency as the oracle maximum likelihood estimator as if the error distribution was known. By employing the penalized estimation and the semiparametric efficiency theory for ultra‐high‐dimensional partial linear model, the procedure enjoys the oracle variable selection property and offers efficiency gain for non‐Gaussian random errors, while maintaining the same efficiency as the least squares‐based estimator for Gaussian random errors. Extensive simulation studies and an empirical application are conducted to demonstrate the effectiveness of the proposed procedure.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140975672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cox processes driven by transformed Gaussian processes on linear networks—A review and new contributions 线性网络上由变换高斯过程驱动的考克斯过程--回顾与新贡献
IF 1 4区 数学 Q3 Mathematics Pub Date : 2024-05-14 DOI: 10.1111/sjos.12720
Jesper Møller, Jakob G. Rasmussen
There is a lack of point process models on linear networks. For an arbitrary linear network, we consider new models for a Cox process with an isotropic pair correlation function obtained in various ways by transforming an isotropic Gaussian process which is used for driving the random intensity function of the Cox process. In particular, we introduce three model classes given by log Gaussian, interrupted, and permanental Cox processes on linear networks, and consider for the first time statistical procedures and applications for parametric families of such models. Moreover, we construct new simulation algorithms for Gaussian processes on linear networks and discuss whether the geodesic metric or the resistance metric should be used for the kind of Cox processes studied in this paper.
线性网络缺乏点过程模型。对于任意线性网络,我们考虑了具有各向同性对相关函数的 Cox 过程的新模型,该模型是通过转换各向同性高斯过程(用于驱动 Cox 过程的随机强度函数)以各种方式获得的。特别是,我们引入了线性网络上对数高斯过程、间断过程和永久考克斯过程给出的三类模型,并首次考虑了此类模型参数族的统计程序和应用。此外,我们还为线性网络上的高斯过程构建了新的模拟算法,并讨论了本文研究的这类 Cox 过程应该使用大地度量还是阻力度量。
{"title":"Cox processes driven by transformed Gaussian processes on linear networks—A review and new contributions","authors":"Jesper Møller, Jakob G. Rasmussen","doi":"10.1111/sjos.12720","DOIUrl":"https://doi.org/10.1111/sjos.12720","url":null,"abstract":"There is a lack of point process models on linear networks. For an arbitrary linear network, we consider new models for a Cox process with an isotropic pair correlation function obtained in various ways by transforming an isotropic Gaussian process which is used for driving the random intensity function of the Cox process. In particular, we introduce three model classes given by log Gaussian, interrupted, and permanental Cox processes on linear networks, and consider for the first time statistical procedures and applications for parametric families of such models. Moreover, we construct new simulation algorithms for Gaussian processes on linear networks and discuss whether the geodesic metric or the resistance metric should be used for the kind of Cox processes studied in this paper.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141062342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On maximizing the likelihood function of general geostatistical models 论一般地质统计模型似然函数的最大化
IF 1 4区 数学 Q3 Mathematics Pub Date : 2024-05-07 DOI: 10.1111/sjos.12722
Tingjin Chu
General geostatistical models are powerful tools for analyzing spatial datasets. A two‐step estimation based on the likelihood function is widely used by researchers, but several theoretical and computational challenges remain to be addressed. First, it is unclear whether there is a unique global maximizer of the log‐likelihood function, a seemingly simple but theoretically challenging question. The second challenge is the convexity of the log‐likelihood function. Besides these two challenges in maximizing the likelihood function, we also study the theoretical property of the two‐step estimation. Unlike many previous works, our results can apply to the non‐twice differentiable covariance functions. In the simulation studies, three optimization algorithms are evaluated in terms of maximizing the log‐likelihood functions.
一般地质统计模型是分析空间数据集的强大工具。研究人员广泛使用基于似然函数的两步估计法,但仍有一些理论和计算难题有待解决。首先,对数似然函数是否存在唯一的全局最大化尚不清楚,这是一个看似简单但在理论上极具挑战性的问题。第二个挑战是对数似然函数的凸性。除了最大化似然函数的这两个挑战,我们还研究了两步估计的理论属性。与之前的许多研究不同,我们的结果可以适用于非两次可微分协方差函数。在模拟研究中,我们从最大化对数似然函数的角度评估了三种优化算法。
{"title":"On maximizing the likelihood function of general geostatistical models","authors":"Tingjin Chu","doi":"10.1111/sjos.12722","DOIUrl":"https://doi.org/10.1111/sjos.12722","url":null,"abstract":"General geostatistical models are powerful tools for analyzing spatial datasets. A two‐step estimation based on the likelihood function is widely used by researchers, but several theoretical and computational challenges remain to be addressed. First, it is unclear whether there is a unique global maximizer of the log‐likelihood function, a seemingly simple but theoretically challenging question. The second challenge is the convexity of the log‐likelihood function. Besides these two challenges in maximizing the likelihood function, we also study the theoretical property of the two‐step estimation. Unlike many previous works, our results can apply to the non‐twice differentiable covariance functions. In the simulation studies, three optimization algorithms are evaluated in terms of maximizing the log‐likelihood functions.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140942490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mahalanobis balancing: A multivariate perspective on approximate covariate balancing 马哈拉诺比斯平衡:近似协变量平衡的多变量视角
IF 1 4区 数学 Q3 Mathematics Pub Date : 2024-04-26 DOI: 10.1111/sjos.12721
Yimin Dai, Ying Yan
In the past decade, various exact balancing‐based weighting methods were introduced to the causal inference literature. It eliminates covariate imbalance by imposing balancing constraints in a certain optimization problem, which can nevertheless be infeasible when there is bad overlap between the covariate distributions in the treated and control groups or when the covariates are high dimensional. Recently, approximate balancing was proposed as an alternative balancing framework. It resolves the feasibility issue by using inequality moment constraints instead. However, it can be difficult to select the threshold parameters. Moreover, moment constraints may not fully capture the discrepancy of covariate distributions. In this paper, we propose Mahalanobis balancing to approximately balance covariate distributions from a multivariate perspective. We use a quadratic constraint to control overall imbalance with a single threshold parameter, which can be tuned by a simple selection procedure. We show that the dual problem of Mahalanobis balancing is an norm‐based regularized regression problem, and establish interesting connection to propensity score models. We derive asymptotic properties, discuss the high‐dimensional scenario, and make extensive numerical comparisons with existing balancing methods.
在过去十年中,各种基于精确平衡的加权方法被引入因果推理文献。这种方法通过在某个优化问题中施加平衡约束来消除协变量的不平衡,但当治疗组和对照组的协变量分布存在严重重叠或协变量维度较高时,这种方法可能并不可行。最近,有人提出了近似平衡作为另一种平衡框架。它通过使用不等矩约束来解决可行性问题。然而,选择阈值参数可能比较困难。此外,矩约束可能无法完全捕捉协变量分布的差异。在本文中,我们提出了马哈拉诺比斯平衡法,从多变量的角度近似平衡协变量分布。我们使用二次约束来控制整体不平衡,只需一个阈值参数,该参数可通过简单的选择程序进行调整。我们证明了 Mahalanobis 平衡的对偶问题是一个基于规范的正则化回归问题,并与倾向评分模型建立了有趣的联系。我们推导了渐近特性,讨论了高维情况,并与现有的平衡方法进行了广泛的数值比较。
{"title":"Mahalanobis balancing: A multivariate perspective on approximate covariate balancing","authors":"Yimin Dai, Ying Yan","doi":"10.1111/sjos.12721","DOIUrl":"https://doi.org/10.1111/sjos.12721","url":null,"abstract":"In the past decade, various exact balancing‐based weighting methods were introduced to the causal inference literature. It eliminates covariate imbalance by imposing balancing constraints in a certain optimization problem, which can nevertheless be infeasible when there is bad overlap between the covariate distributions in the treated and control groups or when the covariates are high dimensional. Recently, approximate balancing was proposed as an alternative balancing framework. It resolves the feasibility issue by using inequality moment constraints instead. However, it can be difficult to select the threshold parameters. Moreover, moment constraints may not fully capture the discrepancy of covariate distributions. In this paper, we propose Mahalanobis balancing to approximately balance covariate distributions from a multivariate perspective. We use a quadratic constraint to control overall imbalance with a single threshold parameter, which can be tuned by a simple selection procedure. We show that the dual problem of Mahalanobis balancing is an norm‐based regularized regression problem, and establish interesting connection to propensity score models. We derive asymptotic properties, discuss the high‐dimensional scenario, and make extensive numerical comparisons with existing balancing methods.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140801664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Asymptotic properties of resampling‐based processes for the average treatment effect in observational studies with competing risks 具有竞争风险的观察性研究中基于重采样过程的平均治疗效果的渐近特性
IF 1 4区 数学 Q3 Mathematics Pub Date : 2024-04-25 DOI: 10.1111/sjos.12714
Jasmin Rühl, Sarah Friedrich
In observational studies with time‐to‐event outcomes, the g‐formula can be used to estimate a treatment effect in the presence of confounding factors. However, the asymptotic distribution of the corresponding stochastic process is complicated and thus not suitable for deriving confidence intervals or time‐simultaneous confidence bands for the average treatment effect. A common remedy are resampling‐based approximations, with Efron's nonparametric bootstrap being the standard tool in practice. We investigate the large sample properties of three different resampling approaches and prove their asymptotic validity in a setting with time‐to‐event data subject to competing risks. The usage of these approaches is demonstrated by an analysis of the effect of physical activity on the risk of knee replacement among patients with advanced knee osteoarthritis.
在具有时间到事件结果的观察性研究中,g 公式可用于估计存在混杂因素时的治疗效果。不过,相应随机过程的渐近分布比较复杂,因此不适合用于推导平均治疗效果的置信区间或时间同步置信带。常用的补救方法是基于重采样的近似方法,其中埃夫隆的非参数自举法是实践中的标准工具。我们研究了三种不同的重采样方法的大样本特性,并证明了它们在具有竞争风险的时间到事件数据中的渐近有效性。通过分析体育锻炼对晚期膝关节骨性关节炎患者膝关节置换风险的影响,证明了这些方法的用途。
{"title":"Asymptotic properties of resampling‐based processes for the average treatment effect in observational studies with competing risks","authors":"Jasmin Rühl, Sarah Friedrich","doi":"10.1111/sjos.12714","DOIUrl":"https://doi.org/10.1111/sjos.12714","url":null,"abstract":"In observational studies with time‐to‐event outcomes, the g‐formula can be used to estimate a treatment effect in the presence of confounding factors. However, the asymptotic distribution of the corresponding stochastic process is complicated and thus not suitable for deriving confidence intervals or time‐simultaneous confidence bands for the average treatment effect. A common remedy are resampling‐based approximations, with Efron's nonparametric bootstrap being the standard tool in practice. We investigate the large sample properties of three different resampling approaches and prove their asymptotic validity in a setting with time‐to‐event data subject to competing risks. The usage of these approaches is demonstrated by an analysis of the effect of physical activity on the risk of knee replacement among patients with advanced knee osteoarthritis.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140801709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Minimax estimation of functional principal components from noisy discretized functional data 从噪声离散函数数据中最小估计函数主成分
IF 1 4区 数学 Q3 Mathematics Pub Date : 2024-04-24 DOI: 10.1111/sjos.12719
Ryad Belhakem, Franck Picard, Vincent Rivoirard, Angelina Roche
Functional Principal Component Analysis is a reference method for dimension reduction of curve data. Its theoretical properties are now well understood in the simplified case where the sample curves are fully observed without noise. However, functional data are noisy and necessarily observed on a finite discretization grid. Common practice consists in smoothing the data and then to compute the functional estimates, but the impact of this denoising step on the procedure's statistical performance are rarely considered. Here we prove new convergence rates for functional principal component estimators. We introduce a double asymptotic framework: one corresponding to the sampling size and a second to the size of the grid. We prove that estimates based on projection onto histograms show optimal rates in a minimax sense. Theoretical results are illustrated on simulated data and the method is applied to the visualization of genomic data.
函数主成分分析法是一种用于降低曲线数据维度的参考方法。在简化的情况下,即样本曲线完全被观测到而没有噪声,人们现在已经很好地理解了它的理论特性。然而,函数数据是有噪声的,而且必须在有限离散网格上进行观测。通常的做法是对数据进行平滑处理,然后计算函数估计值,但很少考虑这一去噪步骤对程序统计性能的影响。在此,我们证明了函数式主成分估计器的新收敛率。我们引入了双重渐近框架:一个与采样大小相对应,另一个与网格大小相对应。我们证明,基于投影到直方图的估计值显示出最小值意义上的最优率。我们在模拟数据上对理论结果进行了说明,并将该方法应用于基因组数据的可视化。
{"title":"Minimax estimation of functional principal components from noisy discretized functional data","authors":"Ryad Belhakem, Franck Picard, Vincent Rivoirard, Angelina Roche","doi":"10.1111/sjos.12719","DOIUrl":"https://doi.org/10.1111/sjos.12719","url":null,"abstract":"Functional Principal Component Analysis is a reference method for dimension reduction of curve data. Its theoretical properties are now well understood in the simplified case where the sample curves are fully observed without noise. However, functional data are noisy and necessarily observed on a finite discretization grid. Common practice consists in smoothing the data and then to compute the functional estimates, but the impact of this denoising step on the procedure's statistical performance are rarely considered. Here we prove new convergence rates for functional principal component estimators. We introduce a double asymptotic framework: one corresponding to the sampling size and a second to the size of the grid. We prove that estimates based on projection onto histograms show optimal rates in a minimax sense. Theoretical results are illustrated on simulated data and the method is applied to the visualization of genomic data.","PeriodicalId":49567,"journal":{"name":"Scandinavian Journal of Statistics","volume":null,"pages":null},"PeriodicalIF":1.0,"publicationDate":"2024-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140801711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Scandinavian Journal of Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1