首页 > 最新文献

Electronic Journal of Statistics最新文献

英文 中文
Dimension independent excess risk by stochastic gradient descent 随机梯度下降的维数无关超额风险
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-01 DOI: 10.1214/22-ejs2055
X. Chen, Qiang Liu, Xin T. Tong
{"title":"Dimension independent excess risk by stochastic gradient descent","authors":"X. Chen, Qiang Liu, Xin T. Tong","doi":"10.1214/22-ejs2055","DOIUrl":"https://doi.org/10.1214/22-ejs2055","url":null,"abstract":"","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.1,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47118771","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Penalized estimation of threshold auto-regressive models with many components and thresholds. 多成分多阈值阈值自回归模型的惩罚性估计
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-01 Epub Date: 2022-03-22 DOI: 10.1214/22-EJS1982
Kunhui Zhang, Abolfazl Safikhani, Alex Tank, Ali Shojaie

Thanks to their simplicity and interpretable structure, autoregressive processes are widely used to model time series data. However, many real time series data sets exhibit non-linear patterns, requiring nonlinear modeling. The threshold Auto-Regressive (TAR) process provides a family of non-linear auto-regressive time series models in which the process dynamics are specific step functions of a thresholding variable. While estimation and inference for low-dimensional TAR models have been investigated, high-dimensional TAR models have received less attention. In this article, we develop a new framework for estimating high-dimensional TAR models, and propose two different sparsity-inducing penalties. The first penalty corresponds to a natural extension of classical TAR model to high-dimensional settings, where the same threshold is enforced for all model parameters. Our second penalty develops a more flexible TAR model, where different thresholds are allowed for different auto-regressive coefficients. We show that both penalized estimation strategies can be utilized in a three-step procedure that consistently learns both the thresholds and the corresponding auto-regressive coefficients. However, our theoretical and empirical investigations show that the direct extension of the TAR model is not appropriate for high-dimensional settings and is better suited for moderate dimensions. In contrast, the more flexible extension of the TAR model leads to consistent estimation and superior empirical performance in high dimensions.

自回归过程结构简单、易于解释,因此被广泛用于建立时间序列数据模型。然而,许多真实的时间序列数据集都表现出非线性模式,需要非线性建模。阈值自回归(TAR)过程提供了一系列非线性自回归时间序列模型,其中的过程动态是阈值变量的特定阶跃函数。虽然低维 TAR 模型的估计和推理已经得到研究,但高维 TAR 模型受到的关注较少。在本文中,我们为估计高维 TAR 模型开发了一个新框架,并提出了两种不同的稀疏性诱导惩罚。第一种惩罚相当于将经典 TAR 模型自然扩展到高维环境,在这种情况下,所有模型参数都有相同的阈值。我们的第二种惩罚方法开发了一种更灵活的 TAR 模型,允许对不同的自回归系数采用不同的阈值。我们的研究表明,这两种惩罚估计策略都可以在一个三步程序中使用,该程序可以持续学习阈值和相应的自回归系数。然而,我们的理论和实证研究表明,TAR 模型的直接扩展并不适合高维设置,而更适合中等维度。相比之下,TAR 模型更灵活的扩展则能在高维度下实现一致的估计和卓越的实证性能。
{"title":"Penalized estimation of threshold auto-regressive models with many components and thresholds.","authors":"Kunhui Zhang, Abolfazl Safikhani, Alex Tank, Ali Shojaie","doi":"10.1214/22-EJS1982","DOIUrl":"10.1214/22-EJS1982","url":null,"abstract":"<p><p>Thanks to their simplicity and interpretable structure, autoregressive processes are widely used to model time series data. However, many real time series data sets exhibit non-linear patterns, requiring nonlinear modeling. The threshold Auto-Regressive (TAR) process provides a family of non-linear auto-regressive time series models in which the process dynamics are specific step functions of a thresholding variable. While estimation and inference for low-dimensional TAR models have been investigated, high-dimensional TAR models have received less attention. In this article, we develop a new framework for estimating high-dimensional TAR models, and propose two different sparsity-inducing penalties. The first penalty corresponds to a natural extension of classical TAR model to high-dimensional settings, where the same threshold is enforced for all model parameters. Our second penalty develops a more flexible TAR model, where different thresholds are allowed for different auto-regressive coefficients. We show that both penalized estimation strategies can be utilized in a three-step procedure that consistently learns both the thresholds and the corresponding auto-regressive coefficients. However, our theoretical and empirical investigations show that the direct extension of the TAR model is not appropriate for high-dimensional settings and is better suited for moderate dimensions. In contrast, the more flexible extension of the TAR model leads to consistent estimation and superior empirical performance in high dimensions.</p>","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":"16 1","pages":"1891-1951"},"PeriodicalIF":1.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10088520/pdf/nihms-1885625.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9851486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On sufficient variable screening using log odds ratio filter 利用对数比值比滤波器进行充分变量筛选
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-01 DOI: 10.1214/21-ejs1951
Baoying Yang, Wenbo Wu, Xiangrong Yin
: For ultrahigh-dimensional data, variable screening is an impor- tant step to reduce the scale of the problem, hence, to improve the estimation accuracy and efficiency. In this paper, we propose a new dependence measure which is called the log odds ratio statistic to be used under the sufficient variable screening framework. The sufficient variable screening approach ensures the sufficiency of the selected input features in model-ing the regression function and is an enhancement of existing marginal screening methods. In addition, we propose an ensemble variable screening approach to combine the proposed fused log odds ratio filter with the fused Kolmogorov filter to achieve supreme performance by taking advantages of both filters. We establish the sure screening properties of the fused log odds ratio filter for both marginal variable screening and sufficient variable screening. Extensive simulations and a real data analysis are provided to demonstrate the usefulness of the proposed log odds ratio filter and the sufficient variable screening procedure.
:对于超高维数据,变量筛选是减少问题规模的重要步骤,因此可以提高估计精度和效率。在本文中,我们提出了一种新的相关性测度,称为对数比值比统计量,用于有效变量筛选框架下。有效的变量筛选方法确保了所选输入特征在回归函数建模中的有效性,是对现有边际筛选方法的改进。此外,我们提出了一种集成变量筛选方法,将所提出的融合对数比值比滤波器与融合Kolmogorov滤波器相结合,通过利用这两种滤波器的优势实现最高性能。我们为边际变量筛选和有效变量筛选建立了融合对数比值比滤波器的可靠筛选特性。提供了广泛的模拟和实际数据分析,以证明所提出的对数比值比滤波器和有效的变量筛选程序的有用性。
{"title":"On sufficient variable screening using log odds ratio filter","authors":"Baoying Yang, Wenbo Wu, Xiangrong Yin","doi":"10.1214/21-ejs1951","DOIUrl":"https://doi.org/10.1214/21-ejs1951","url":null,"abstract":": For ultrahigh-dimensional data, variable screening is an impor- tant step to reduce the scale of the problem, hence, to improve the estimation accuracy and efficiency. In this paper, we propose a new dependence measure which is called the log odds ratio statistic to be used under the sufficient variable screening framework. The sufficient variable screening approach ensures the sufficiency of the selected input features in model-ing the regression function and is an enhancement of existing marginal screening methods. In addition, we propose an ensemble variable screening approach to combine the proposed fused log odds ratio filter with the fused Kolmogorov filter to achieve supreme performance by taking advantages of both filters. We establish the sure screening properties of the fused log odds ratio filter for both marginal variable screening and sufficient variable screening. Extensive simulations and a real data analysis are provided to demonstrate the usefulness of the proposed log odds ratio filter and the sufficient variable screening procedure.","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.1,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47203671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Monte Carlo Markov chains constrained on graphs for a target with disconnected support 具有断开支持的目标在图上约束的蒙特卡罗马尔可夫链
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-01 DOI: 10.1214/22-ejs2043
R. Cerqueti, Emilio De Santis
{"title":"Monte Carlo Markov chains constrained on graphs for a target with disconnected support","authors":"R. Cerqueti, Emilio De Santis","doi":"10.1214/22-ejs2043","DOIUrl":"https://doi.org/10.1214/22-ejs2043","url":null,"abstract":"","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.1,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44387163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The robust nearest shrunken centroids classifier for high-dimensional heavy-tailed data 高维重尾数据的鲁棒最近收缩质心分类器
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-01 DOI: 10.1214/22-ejs2022
Shaokang Ren, Qing Mai
: The nearest shrunken centroids classifier (NSC) is a popular high-dimensional classifier. However, it is prone to inaccurate classification when the data is heavy-tailed. In this paper, we develop a robust general- ization of NSC (RNSC) which remains effective under such circumstances. By incorporating the Huber loss both in the estimation and the calcula- tion of the score function, we reduce the impacts of heavy tails. We rigorously show the variable selection, estimation, and prediction consistency in high dimensions under weak moment conditions. Empirically, our proposal greatly outperforms NSC and many other successful classifiers when data is heavy-tailed while remaining comparable to NSC in the absence of heavy tails. The favorable performance of RNSC is also demonstrated in a real data example.
最近萎缩质心分类器(NSC)是一种流行的高维分类器。然而,当数据是重尾数据时,容易导致分类不准确。在本文中,我们发展了一个在这种情况下仍然有效的稳健的NSC (RNSC)一般化。通过在分数函数的估计和计算中同时考虑Huber损失,我们减小了重尾的影响。我们严格地证明了在弱矩条件下高维变量选择、估计和预测的一致性。根据经验,当数据是重尾时,我们的建议大大优于NSC和许多其他成功的分类器,而在没有重尾的情况下,我们的建议与NSC相当。通过一个实际的数据实例验证了RNSC的良好性能。
{"title":"The robust nearest shrunken centroids classifier for high-dimensional heavy-tailed data","authors":"Shaokang Ren, Qing Mai","doi":"10.1214/22-ejs2022","DOIUrl":"https://doi.org/10.1214/22-ejs2022","url":null,"abstract":": The nearest shrunken centroids classifier (NSC) is a popular high-dimensional classifier. However, it is prone to inaccurate classification when the data is heavy-tailed. In this paper, we develop a robust general- ization of NSC (RNSC) which remains effective under such circumstances. By incorporating the Huber loss both in the estimation and the calcula- tion of the score function, we reduce the impacts of heavy tails. We rigorously show the variable selection, estimation, and prediction consistency in high dimensions under weak moment conditions. Empirically, our proposal greatly outperforms NSC and many other successful classifiers when data is heavy-tailed while remaining comparable to NSC in the absence of heavy tails. The favorable performance of RNSC is also demonstrated in a real data example.","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.1,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45637959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Two-sample test for equal distributions in separate metric space: New maximum mean discrepancy based approaches 独立度量空间中相等分布的两样本检验:基于最大均值差异的新方法
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-01 DOI: 10.1214/22-ejs2033
Jin-Ting Zhang, Łukasz Smaga
{"title":"Two-sample test for equal distributions in separate metric space: New maximum mean discrepancy based approaches","authors":"Jin-Ting Zhang, Łukasz Smaga","doi":"10.1214/22-ejs2033","DOIUrl":"https://doi.org/10.1214/22-ejs2033","url":null,"abstract":"","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.1,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46448002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Functional estimation of anisotropic covariance and autocovariance operators on the sphere 球面上各向异性协方差和自协方差算子的函数估计
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2021-12-23 DOI: 10.1214/22-ejs2064
Alessia Caponera, J. Fageot, Matthieu Simeoni, V. Panaretos
We propose nonparametric estimators for the second-order central moments of possibly anisotropic spherical random fields, within a functional data analysis context. We consider a measurement framework where each random field among an identically distributed collection of spherical random fields is sampled at a few random directions, possibly subject to measurement error. The collection of random fields could be i.i.d. or serially dependent. Though similar setups have already been explored for random functions defined on the unit interval, the nonparametric estimators proposed in the literature often rely on local polynomials, which do not readily extend to the (product) spherical setting. We therefore formulate our estimation procedure as a variational problem involving a generalized Tikhonov regularization term. The latter favours smooth covariance/autocovariance functions, where the smoothness is specified by means of suitable Sobolev-like pseudo-differential operators. Using the machinery of reproducing kernel Hilbert spaces, we establish representer theorems that fully characterize the form of our estimators. We determine their uniform rates of convergence as the number of random fields diverges, both for the dense (increasing number of spatial samples) and sparse (bounded number of spatial samples) regimes. We moreover demonstrate the computational feasibility and practical merits of our estimation procedure in a simulation setting, assuming a fixed number of samples per random field. Our numerical estimation procedure leverages the sparsity and second-order Kronecker structure of our setup to reduce the computational and memory requirements by approximately three orders of magnitude compared to a naive implementation would require. AMS 2000 subject classifications: Primary 62G08; secondary 62M.
在函数数据分析的背景下,我们提出了可能各向异性球面随机场的二阶中心矩的非参数估计。我们考虑一个测量框架,其中在同分布的球形随机场集合中的每个随机场在几个随机方向上采样,可能会受到测量误差的影响。随机字段的集合可以是i.i.d.或序列相关的。尽管已经为单位区间上定义的随机函数探索了类似的设置,但文献中提出的非参数估计通常依赖于局部多项式,而局部多项式不容易扩展到(乘积)球面设置。因此,我们将我们的估计过程公式化为涉及广义Tikhonov正则化项的变分问题。后者倾向于平滑协方差/自协方差函数,其中平滑度是通过合适的类Sobolev伪微分算子来指定的。利用重生成核希尔伯特空间的机制,我们建立了完全表征我们的估计量形式的表示定理。对于密集(空间样本数量的增加)和稀疏(空间样本的有界数量)状态,我们确定它们随着随机场数的发散而一致的收敛速度。此外,我们还证明了我们的估计程序在模拟环境中的计算可行性和实际优点,假设每个随机场有固定数量的样本。我们的数值估计程序利用了我们设置的稀疏性和二阶Kronecker结构,与简单的实现相比,将计算和内存需求减少了大约三个数量级。AMS 2000学科分类:初级62G08;次级62M。
{"title":"Functional estimation of anisotropic covariance and autocovariance operators on the sphere","authors":"Alessia Caponera, J. Fageot, Matthieu Simeoni, V. Panaretos","doi":"10.1214/22-ejs2064","DOIUrl":"https://doi.org/10.1214/22-ejs2064","url":null,"abstract":"We propose nonparametric estimators for the second-order central moments of possibly anisotropic spherical random fields, within a functional data analysis context. We consider a measurement framework where each random field among an identically distributed collection of spherical random fields is sampled at a few random directions, possibly subject to measurement error. The collection of random fields could be i.i.d. or serially dependent. Though similar setups have already been explored for random functions defined on the unit interval, the nonparametric estimators proposed in the literature often rely on local polynomials, which do not readily extend to the (product) spherical setting. We therefore formulate our estimation procedure as a variational problem involving a generalized Tikhonov regularization term. The latter favours smooth covariance/autocovariance functions, where the smoothness is specified by means of suitable Sobolev-like pseudo-differential operators. Using the machinery of reproducing kernel Hilbert spaces, we establish representer theorems that fully characterize the form of our estimators. We determine their uniform rates of convergence as the number of random fields diverges, both for the dense (increasing number of spatial samples) and sparse (bounded number of spatial samples) regimes. We moreover demonstrate the computational feasibility and practical merits of our estimation procedure in a simulation setting, assuming a fixed number of samples per random field. Our numerical estimation procedure leverages the sparsity and second-order Kronecker structure of our setup to reduce the computational and memory requirements by approximately three orders of magnitude compared to a naive implementation would require. AMS 2000 subject classifications: Primary 62G08; secondary 62M.","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.1,"publicationDate":"2021-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49616151","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Convergence properties of data augmentation algorithms for high-dimensional robit regression 高维robit回归数据增广算法的收敛性
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2021-12-20 DOI: 10.1214/22-ejs2098
Sourav Mukherjee, K. Khare, Saptarshi Chakraborty Department of Statistics, U. Florida, D. Biostatistics, State University of New York at Buffalo
Abstract: The logistic and probit link functions are the most common choices for regression models with a binary response. However, these choices are not robust to the presence of outliers/unexpected observations. The robit link function, which is equal to the inverse CDF of the Student’s t-distribution, provides a robust alternative to the probit and logistic link functions. A multivariate normal prior for the regression coefficients is the standard choice for Bayesian inference in robit regression models. The resulting posterior density is intractable and a Data Augmentation (DA) Markov chain is used to generate approximate samples from the desired posterior distribution. Establishing geometric ergodicity for this DA Markov chain is important as it provides theoretical guarantees for asymptotic validity of MCMC standard errors for desired posterior expectations/quantiles. Previous work [1] established geometric ergodicity of this robit DA Markov chain assuming (i) the sample size n dominates the number of predictors p, and (ii) an additional constraint which requires the sample size to be bounded above by a fixed constant which depends on the design matrix X. In particular, modern highdimensional settings where n < p are not considered. In this work, we show that the robit DA Markov chain is trace-class (i.e., the eigenvalues of the corresponding Markov operator are summable) for arbitrary choices of the sample size n, the number of predictors p, the design matrix X, and the prior mean and variance parameters. The trace-class property implies geometric ergodicity. Moreover, this property allows us to conclude that the sandwich robit chain (obtained by inserting an inexpensive extra step in between the two steps of the DA chain) is strictly better than the robit DA chain in an appropriate sense, and enables the use of recent methods to estimate the spectral gap of trace class DA Markov chains.
摘要:对于具有二元响应的回归模型,逻辑和概率连接函数是最常见的选择。然而,这些选择对于异常值/意外观测的存在并不稳健。robit链接函数等于Student t分布的逆CDF,为probit和逻辑链接函数提供了一种稳健的替代方案。回归系数的多元正态先验是robit回归模型中贝叶斯推理的标准选择。所得到的后验密度是难以处理的,并且使用数据增强(DA)马尔可夫链来从期望的后验分布生成近似样本。为该DA马尔可夫链建立几何遍历性是重要的,因为它为所需后验期望/分位数的MCMC标准误差的渐近有效性提供了理论保证。先前的工作[1]建立了该robit DA马尔可夫链的几何遍历性,假设(i)样本大小n支配预测因子p的数量,以及(ii)额外的约束,该约束要求样本大小由取决于设计矩阵X的固定常数在上面定界。特别是,不考虑n<p的现代高维设置。在这项工作中,我们证明了对于样本大小n、预测器数量p、设计矩阵X以及先验均值和方差参数的任意选择,robit DA马尔可夫链是迹类(即,对应的马尔可夫算子的特征值是可和的)。迹类性质暗示了几何遍历性。此外,这一性质使我们能够得出结论,夹层robit链(通过在DA链的两个步骤之间插入一个廉价的额外步骤获得)在适当的意义上严格优于robit DA链,并使我们能够使用最新的方法来估计迹类DA马尔可夫链的谱间隙。
{"title":"Convergence properties of data augmentation algorithms for high-dimensional robit regression","authors":"Sourav Mukherjee, K. Khare, Saptarshi Chakraborty Department of Statistics, U. Florida, D. Biostatistics, State University of New York at Buffalo","doi":"10.1214/22-ejs2098","DOIUrl":"https://doi.org/10.1214/22-ejs2098","url":null,"abstract":"Abstract: The logistic and probit link functions are the most common choices for regression models with a binary response. However, these choices are not robust to the presence of outliers/unexpected observations. The robit link function, which is equal to the inverse CDF of the Student’s t-distribution, provides a robust alternative to the probit and logistic link functions. A multivariate normal prior for the regression coefficients is the standard choice for Bayesian inference in robit regression models. The resulting posterior density is intractable and a Data Augmentation (DA) Markov chain is used to generate approximate samples from the desired posterior distribution. Establishing geometric ergodicity for this DA Markov chain is important as it provides theoretical guarantees for asymptotic validity of MCMC standard errors for desired posterior expectations/quantiles. Previous work [1] established geometric ergodicity of this robit DA Markov chain assuming (i) the sample size n dominates the number of predictors p, and (ii) an additional constraint which requires the sample size to be bounded above by a fixed constant which depends on the design matrix X. In particular, modern highdimensional settings where n < p are not considered. In this work, we show that the robit DA Markov chain is trace-class (i.e., the eigenvalues of the corresponding Markov operator are summable) for arbitrary choices of the sample size n, the number of predictors p, the design matrix X, and the prior mean and variance parameters. The trace-class property implies geometric ergodicity. Moreover, this property allows us to conclude that the sandwich robit chain (obtained by inserting an inexpensive extra step in between the two steps of the DA chain) is strictly better than the robit DA chain in an appropriate sense, and enables the use of recent methods to estimate the spectral gap of trace class DA Markov chains.","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.1,"publicationDate":"2021-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48632525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Flexible inference of optimal individualized treatment strategy in covariate adjusted randomization with multiple covariates 多协变量调整随机化中最优个体化治疗策略的灵活推断
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2021-11-19 DOI: 10.1214/23-ejs2127
Trinetri Ghosh, Yanyuan Ma, Rui Song, Pingshou Zhong
To maximize clinical benefit, clinicians routinely tailor treatment to the individual characteristics of each patient, where individualized treatment rules are needed and are of significant research interest to statisticians. In the covariate-adjusted randomization clinical trial with many covariates, we model the treatment effect with an unspecified function of a single index of the covariates and leave the baseline response completely arbitrary. We devise a class of estimators to consistently estimate the treatment effect function and its associated index while bypassing the estimation of the baseline response, which is subject to the curse of dimensionality. We further develop inference tools to identify predictive covariates and isolate effective treatment region. The usefulness of the methods is demonstrated in both simulations and a clinical data example.
为了最大限度地提高临床效益,临床医生通常根据每位患者的个人特征定制治疗,需要个性化的治疗规则,这对统计学家来说具有重要的研究兴趣。在具有许多协变量的经协变量调整的随机化临床试验中,我们用协变量的单个指数的未指定函数对治疗效果进行建模,并使基线反应完全任意。我们设计了一类估计量,以一致地估计治疗效果函数及其相关指数,同时绕过基线反应的估计,这受到维度诅咒的影响。我们进一步开发了推断工具来识别预测协变量并隔离有效治疗区域。模拟和临床数据示例都证明了这些方法的有用性。
{"title":"Flexible inference of optimal individualized treatment strategy in covariate adjusted randomization with multiple covariates","authors":"Trinetri Ghosh, Yanyuan Ma, Rui Song, Pingshou Zhong","doi":"10.1214/23-ejs2127","DOIUrl":"https://doi.org/10.1214/23-ejs2127","url":null,"abstract":"To maximize clinical benefit, clinicians routinely tailor treatment to the individual characteristics of each patient, where individualized treatment rules are needed and are of significant research interest to statisticians. In the covariate-adjusted randomization clinical trial with many covariates, we model the treatment effect with an unspecified function of a single index of the covariates and leave the baseline response completely arbitrary. We devise a class of estimators to consistently estimate the treatment effect function and its associated index while bypassing the estimation of the baseline response, which is subject to the curse of dimensionality. We further develop inference tools to identify predictive covariates and isolate effective treatment region. The usefulness of the methods is demonstrated in both simulations and a clinical data example.","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.1,"publicationDate":"2021-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48902829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bounds in L1 Wasserstein distance on the normal approximation of general M-estimators 一般m估计量的正态逼近在L1 Wasserstein距离上的界
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2021-11-18 DOI: 10.1214/23-ejs2132
F. Bachoc, M. Fathi
We derive quantitative bounds on the rate of convergence in $L^1$ Wasserstein distance of general M-estimators, with an almost sharp (up to a logarithmic term) behavior in the number of observations. We focus on situations where the estimator does not have an explicit expression as a function of the data. The general method may be applied even in situations where the observations are not independent. Our main application is a rate of convergence for cross validation estimation of covariance parameters of Gaussian processes.
我们导出了一般M-估计量在$L^1$Wasserstein距离内收敛速度的定量界,在观测次数上具有几乎尖锐的(高达对数项)行为。我们关注的是估计器没有作为数据函数的显式表达式的情况。即使在观测不独立的情况下,也可以应用通用方法。我们的主要应用是高斯过程协方差参数的交叉验证估计的收敛速度。
{"title":"Bounds in L1 Wasserstein distance on the normal approximation of general M-estimators","authors":"F. Bachoc, M. Fathi","doi":"10.1214/23-ejs2132","DOIUrl":"https://doi.org/10.1214/23-ejs2132","url":null,"abstract":"We derive quantitative bounds on the rate of convergence in $L^1$ Wasserstein distance of general M-estimators, with an almost sharp (up to a logarithmic term) behavior in the number of observations. We focus on situations where the estimator does not have an explicit expression as a function of the data. The general method may be applied even in situations where the observations are not independent. Our main application is a rate of convergence for cross validation estimation of covariance parameters of Gaussian processes.","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.1,"publicationDate":"2021-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42149805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Electronic Journal of Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1