首页 > 最新文献

Annals of the Institute of Statistical Mathematics最新文献

英文 中文
Robust variable selection with exponential squared loss for partially linear spatial autoregressive models 部分线性空间自回归模型的指数平方损失鲁棒变量选择
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-05-03 DOI: 10.1007/s10463-023-00870-w
Xiuli Wang, Jingchang Shao, Jingjing Wu, Qiang Zhao

In this paper, we consider variable selection for a class of semiparametric spatial autoregressive models based on exponential squared loss (ESL). Using the orthogonal projection technique, we propose a novel orthogonality-based variable selection procedure that enables simultaneous model selection and parameter estimation, and identifies the significance of spatial effects. Under appropriate conditions, we show that the proposed procedure is consistent and the resulting estimator has oracle properties. Furthermore, some simulation studies and an analysis of the Boston housing price data are also carried out to examine the finite-sample performance of the proposed method.

本文研究了一类基于指数平方损失的半参数空间自回归模型的变量选择问题。利用正交投影技术,我们提出了一种新的基于正交性的变量选择过程,可以同时进行模型选择和参数估计,并识别空间效应的重要性。在适当的条件下,我们证明了所提出的过程是一致的,并且所得到的估计量具有oracle性质。此外,还对波士顿房价数据进行了一些模拟研究和分析,以检验所提出方法的有限样本性能。
{"title":"Robust variable selection with exponential squared loss for partially linear spatial autoregressive models","authors":"Xiuli Wang,&nbsp;Jingchang Shao,&nbsp;Jingjing Wu,&nbsp;Qiang Zhao","doi":"10.1007/s10463-023-00870-w","DOIUrl":"10.1007/s10463-023-00870-w","url":null,"abstract":"<div><p>In this paper, we consider variable selection for a class of semiparametric spatial autoregressive models based on exponential squared loss (ESL). Using the orthogonal projection technique, we propose a novel orthogonality-based variable selection procedure that enables simultaneous model selection and parameter estimation, and identifies the significance of spatial effects. Under appropriate conditions, we show that the proposed procedure is consistent and the resulting estimator has oracle properties. Furthermore, some simulation studies and an analysis of the Boston housing price data are also carried out to examine the finite-sample performance of the proposed method.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"75 6","pages":"949 - 977"},"PeriodicalIF":1.0,"publicationDate":"2023-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10463-023-00870-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48247773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical inference using regularized M-estimation in the reproducing kernel Hilbert space for handling missing data 在再现核希尔伯特空间中使用正则化m估计处理缺失数据的统计推断
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-04-27 DOI: 10.1007/s10463-023-00872-8
Hengfang Wang, Jae Kwang Kim

Imputation is a popular technique for handling missing data. We address a nonparametric imputation using the regularized M-estimation techniques in the reproducing kernel Hilbert space. Specifically, we first use kernel ridge regression to develop imputation for handling item nonresponse. Although this nonparametric approach is potentially promising for imputation, its statistical properties are not investigated in the literature. Under some conditions on the order of the tuning parameter, we first establish the root-n consistency of the kernel ridge regression imputation estimator and show that it achieves the lower bound of the semiparametric asymptotic variance. A nonparametric propensity score estimator using the reproducing kernel Hilbert space is also developed by the linear expression of the projection estimator. We show that the resulting propensity score estimator is asymptotically equivalent to the kernel ridge regression imputation estimator. Results from a limited simulation study are also presented to confirm our theory. The proposed method is applied to analyze air pollution data measured in Beijing, China.

代入是处理缺失数据的常用技术。我们在再现核希尔伯特空间中使用正则化m估计技术来解决非参数输入问题。具体而言,我们首先使用核脊回归来开发处理项目无响应的输入。虽然这种非参数方法有可能用于估算,但其统计性质尚未在文献中进行研究。在一定的调优参数阶数条件下,我们首先建立了核脊回归估计量的根n相合性,并证明了它达到了半参数渐近方差的下界。利用投影估计量的线性表达式,提出了利用再现核希尔伯特空间的非参数倾向评分估计量。我们证明了所得的倾向分数估计量是渐近等价于核脊回归估计量。有限模拟研究的结果也证实了我们的理论。将该方法应用于北京地区的空气污染数据分析。
{"title":"Statistical inference using regularized M-estimation in the reproducing kernel Hilbert space for handling missing data","authors":"Hengfang Wang,&nbsp;Jae Kwang Kim","doi":"10.1007/s10463-023-00872-8","DOIUrl":"10.1007/s10463-023-00872-8","url":null,"abstract":"<div><p>Imputation is a popular technique for handling missing data. We address a nonparametric imputation using the regularized M-estimation techniques in the reproducing kernel Hilbert space. Specifically, we first use kernel ridge regression to develop imputation for handling item nonresponse. Although this nonparametric approach is potentially promising for imputation, its statistical properties are not investigated in the literature. Under some conditions on the order of the tuning parameter, we first establish the root-<i>n</i> consistency of the kernel ridge regression imputation estimator and show that it achieves the lower bound of the semiparametric asymptotic variance. A nonparametric propensity score estimator using the reproducing kernel Hilbert space is also developed by the linear expression of the projection estimator. We show that the resulting propensity score estimator is asymptotically equivalent to the kernel ridge regression imputation estimator. Results from a limited simulation study are also presented to confirm our theory. The proposed method is applied to analyze air pollution data measured in Beijing, China.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"75 6","pages":"911 - 929"},"PeriodicalIF":1.0,"publicationDate":"2023-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10463-023-00872-8.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48637382","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A goodness-of-fit test on the number of biclusters in a relational data matrix 关系数据矩阵中双聚类数的拟合优度检验
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-04-17 DOI: 10.1007/s10463-023-00869-3
Chihiro Watanabe, Taiji Suzuki

Biclustering is a method for detecting homogeneous submatrices in a given matrix. Although there are many studies that estimate the underlying bicluster structure of a matrix, few have enabled us to determine the appropriate number of biclusters. Recently, a statistical test on the number of biclusters has been proposed for a regular-grid bicluster structure. However, when the latent bicluster structure does not satisfy such regular-grid assumption, the previous test requires a larger number of biclusters than necessary for the null hypothesis to be accepted, which is not desirable in terms of interpreting the accepted structure. In this study, we propose a new statistical test on the number of biclusters that does not require the regular-grid assumption and derive the asymptotic behavior of the proposed test statistic in both null and alternative cases. We illustrate the effectiveness of the proposed method by applying it to both synthetic and practical data matrices.

双聚类是一种在给定矩阵中检测齐次矩阵的方法。虽然有许多研究估计了矩阵的潜在双簇结构,但很少有研究使我们能够确定适当的双簇数量。最近,对规则网格双聚类结构提出了一种双聚类数目的统计检验方法。然而,当潜在的双聚类结构不满足这种规则网格假设时,前面的检验需要比接受零假设所需的更多的双聚类,这在解释接受的结构方面是不可取的。在这项研究中,我们提出了一个新的双聚类数量的统计检验,它不需要正则网格假设,并推导了所提出的检验统计量在null和alternative情况下的渐近行为。我们通过将其应用于合成和实际数据矩阵来说明所提出方法的有效性。
{"title":"A goodness-of-fit test on the number of biclusters in a relational data matrix","authors":"Chihiro Watanabe,&nbsp;Taiji Suzuki","doi":"10.1007/s10463-023-00869-3","DOIUrl":"10.1007/s10463-023-00869-3","url":null,"abstract":"<div><p>Biclustering is a method for detecting homogeneous submatrices in a given matrix. Although there are many studies that estimate the underlying bicluster structure of a matrix, few have enabled us to determine the appropriate number of biclusters. Recently, a statistical test on the number of biclusters has been proposed for a regular-grid bicluster structure. However, when the latent bicluster structure does not satisfy such regular-grid assumption, the previous test requires a larger number of biclusters than necessary for the null hypothesis to be accepted, which is not desirable in terms of interpreting the accepted structure. In this study, we propose a new statistical test on the number of biclusters that does not require the regular-grid assumption and derive the asymptotic behavior of the proposed test statistic in both null and alternative cases. We illustrate the effectiveness of the proposed method by applying it to both synthetic and practical data matrices.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"75 6","pages":"979 - 1009"},"PeriodicalIF":1.0,"publicationDate":"2023-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42497102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gene–environment interaction analysis under the Cox model Cox模型下的基因-环境互作分析
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-04-10 DOI: 10.1007/s10463-023-00871-9
Kuangnan Fang, Jingmao Li, Yaqing Xu, Shuangge Ma, Qingzhao Zhang

For the survival of cancer and many other complex diseases, gene–environment (G-E) interactions have been established as having essential importance. G-E interaction analysis can be roughly classified as marginal and joint, depending on the number of G variables analyzed at a time. In this study, we focus on joint analysis, which can better reflect disease biology and is statistically more challenging. Many approaches have been developed for joint G-E interaction analysis for survival outcomes and led to important findings. However, without rigorous statistical development, quite a few methods have a weak theoretical ground. To fill this knowledge gap, in this article, we consider joint G-E interaction analysis under the Cox model. Sparse group penalization is adopted for regularizing estimation and selecting important main effects and interactions. The “main effects, interactions” variable selection hierarchy, which has been strongly advocated in recent literature, is satisfied. Significantly advancing from some published studies, we rigorously establish the consistency properties under high dimensionality. An effective computational algorithm is developed, simulation demonstrates competitive performance of the proposed approach, and analysis of The Cancer Genome Atlas (TCGA) data on stomach adenocarcinoma (STAD) further demonstrates its practical utility.

对于癌症和许多其他复杂疾病的生存,基因-环境(G-E)相互作用已被确定为具有至关重要的意义。根据一次分析G变量的数量,G- e相互作用分析大致可分为边际和联合两种。在本研究中,我们侧重于联合分析,这可以更好地反映疾病生物学,在统计学上更具挑战性。已经开发了许多方法来联合G-E相互作用分析生存结果,并导致了重要的发现。然而,由于没有严格的统计发展,相当多的方法理论基础薄弱。为了填补这一知识空白,在本文中,我们考虑在Cox模型下的联合G-E相互作用分析。采用稀疏组惩罚对估计进行正则化,选择重要的主效应和交互作用。满足近年来文献中大力提倡的“主效应、交互作用”变量选择层次。在一些已发表的研究成果的基础上,我们严格地建立了高维下的一致性。开发了一种有效的计算算法,仿真验证了该方法的竞争性能,并通过对胃癌基因组图谱(TCGA)数据的分析进一步验证了该方法的实用性。
{"title":"Gene–environment interaction analysis under the Cox model","authors":"Kuangnan Fang,&nbsp;Jingmao Li,&nbsp;Yaqing Xu,&nbsp;Shuangge Ma,&nbsp;Qingzhao Zhang","doi":"10.1007/s10463-023-00871-9","DOIUrl":"10.1007/s10463-023-00871-9","url":null,"abstract":"<div><p>For the survival of cancer and many other complex diseases, gene–environment (G-E) interactions have been established as having essential importance. G-E interaction analysis can be roughly classified as marginal and joint, depending on the number of G variables analyzed at a time. In this study, we focus on joint analysis, which can better reflect disease biology and is statistically more challenging. Many approaches have been developed for joint G-E interaction analysis for survival outcomes and led to important findings. However, without rigorous statistical development, quite a few methods have a weak theoretical ground. To fill this knowledge gap, in this article, we consider joint G-E interaction analysis under the Cox model. Sparse group penalization is adopted for regularizing estimation and selecting important main effects and interactions. The “main effects, interactions” variable selection hierarchy, which has been strongly advocated in recent literature, is satisfied. Significantly advancing from some published studies, we rigorously establish the consistency properties under high dimensionality. An effective computational algorithm is developed, simulation demonstrates competitive performance of the proposed approach, and analysis of The Cancer Genome Atlas (TCGA) data on stomach adenocarcinoma (STAD) further demonstrates its practical utility.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"75 6","pages":"931 - 948"},"PeriodicalIF":1.0,"publicationDate":"2023-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10463-023-00871-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42111901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parametric estimation of spatial–temporal point processes using the Stoyan–Grabarnik statistic 基于Stoyan-Grabarnik统计量的时空点过程参数估计
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-03-10 DOI: 10.1007/s10463-023-00866-6
Conor Kresin, Frederic Schoenberg

A novel estimator for the parameters governing spatial–temporal point processes is proposed. Unlike the maximum likelihood estimator, the proposed estimator is fast and easy to compute, and does not require the computation or approximation of a computationally expensive integral. This parametric estimator is based on the Stoyan–Grabarnik (sum of inverse intensity) statistic and is shown to be consistent, under quite general conditions. Simulations are presented demonstrating the performance of the estimator.

提出了一种新的时空点过程控制参数估计方法。与极大似然估计量不同,所提出的估计量快速且易于计算,并且不需要计算或逼近计算昂贵的积分。该参数估计是基于Stoyan-Grabarnik(逆强度和)统计量,并被证明是一致的,在相当一般的条件下。仿真验证了该估计器的性能。
{"title":"Parametric estimation of spatial–temporal point processes using the Stoyan–Grabarnik statistic","authors":"Conor Kresin,&nbsp;Frederic Schoenberg","doi":"10.1007/s10463-023-00866-6","DOIUrl":"10.1007/s10463-023-00866-6","url":null,"abstract":"<div><p>A novel estimator for the parameters governing spatial–temporal point processes is proposed. Unlike the maximum likelihood estimator, the proposed estimator is fast and easy to compute, and does not require the computation or approximation of a computationally expensive integral. This parametric estimator is based on the Stoyan–Grabarnik (sum of inverse intensity) statistic and is shown to be consistent, under quite general conditions. Simulations are presented demonstrating the performance of the estimator.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"75 6","pages":"887 - 909"},"PeriodicalIF":1.0,"publicationDate":"2023-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41313716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic data-based bin width selection for rose diagram 玫瑰图基于数据的仓宽自动选择
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-03-09 DOI: 10.1007/s10463-023-00868-4
Yasuhito Tsuruta, Masahiko Sagae

A rose diagram is a representation that circularly organizes data with the bin width as the central angle. This diagram is widely used to display and summarize circular data. Some studies have proposed the selector of bin width based on data. However, only a few papers have discussed the property of these selectors from a statistical perspective. Thus, this study aims to provide a data-based bin width selector for rose diagrams using a statistical approach. We consider that the radius of the rose diagram is a nonparametric estimator of the square root of two times the circular density. We derive the mean integrated square error of the rose diagram and its optimal bin width and propose two new selectors: normal reference rule and biased cross-validation. We show that biased cross-validation converges to its optimizer. Additionally, we propose a polygon rose diagram to enhance the rose diagram.

玫瑰图是一种以箱宽作为圆心角对数据进行圆形组织的表示。这个图表被广泛用于显示和总结循环数据。一些研究提出了基于数据的料仓宽度选择方法。然而,只有少数论文从统计学的角度讨论了这些选择器的性质。因此,本研究旨在使用统计方法为玫瑰图提供基于数据的bin宽度选择器。我们认为玫瑰图的半径是圆密度的平方根两倍的非参数估计量。我们推导了玫瑰图的平均积分平方误差及其最优库宽度,并提出了两个新的选择器:正态参考规则和有偏交叉验证。我们证明了有偏交叉验证收敛到它的优化器。此外,我们提出了一个多边形玫瑰图来增强玫瑰图。
{"title":"Automatic data-based bin width selection for rose diagram","authors":"Yasuhito Tsuruta,&nbsp;Masahiko Sagae","doi":"10.1007/s10463-023-00868-4","DOIUrl":"10.1007/s10463-023-00868-4","url":null,"abstract":"<div><p>A rose diagram is a representation that circularly organizes data with the bin width as the central angle. This diagram is widely used to display and summarize circular data. Some studies have proposed the selector of bin width based on data. However, only a few papers have discussed the property of these selectors from a statistical perspective. Thus, this study aims to provide a data-based bin width selector for rose diagrams using a statistical approach. We consider that the radius of the rose diagram is a nonparametric estimator of the square root of two times the circular density. We derive the mean integrated square error of the rose diagram and its optimal bin width and propose two new selectors: normal reference rule and biased cross-validation. We show that biased cross-validation converges to its optimizer. Additionally, we propose a polygon rose diagram to enhance the rose diagram.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"75 5","pages":"855 - 886"},"PeriodicalIF":1.0,"publicationDate":"2023-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10463-023-00868-4.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47513957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mixture of shifted binomial distributions for rating data 混合移位二项分布的评级数据
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-02-10 DOI: 10.1007/s10463-023-00865-7
Shaoting Li, Jiahua Chen

Rating data are a kind of ordinal categorical data routinely collected in survey sampling. The response value in such applications is confined to a finite number of ordered categories. Due to population heterogeneity, the respondents may have several different rating styles. A finite mixture model is thus most suitable to fit datasets of this nature. In this paper, we propose a two-component mixture of shifted binomial distributions for rating data. We show that this model is identifiable and propose a numerically stable penalized likelihood approach for parameter estimation. We adapt an expectation-maximization algorithm for the penalized maximum likelihood estimation. Our simulation results show that the penalized maximum likelihood estimator is consistent and effective. We fit the proposed model and other models in the literature to some real-world datasets and find the proposed model can have much better fits.

评级数据是在调查抽样中常规收集的一种有序分类数据。在这种应用中,响应值被限制在有限数量的有序类别中。由于人口异质性,受访者可能有几种不同的评级风格。因此,有限混合模型最适合拟合这种性质的数据集。在本文中,我们提出了一个双分量混合移位二项分布的评级数据。我们证明了该模型是可识别的,并提出了一种数值稳定的惩罚似然方法用于参数估计。我们采用了一种期望最大化算法来进行惩罚极大似然估计。仿真结果表明,惩罚极大似然估计是一致的、有效的。我们将提出的模型和文献中的其他模型拟合到一些现实世界的数据集,发现提出的模型可以有更好的拟合。
{"title":"Mixture of shifted binomial distributions for rating data","authors":"Shaoting Li,&nbsp;Jiahua Chen","doi":"10.1007/s10463-023-00865-7","DOIUrl":"10.1007/s10463-023-00865-7","url":null,"abstract":"<div><p>Rating data are a kind of ordinal categorical data routinely collected in survey sampling. The response value in such applications is confined to a finite number of ordered categories. Due to population heterogeneity, the respondents may have several different rating styles. A finite mixture model is thus most suitable to fit datasets of this nature. In this paper, we propose a two-component mixture of shifted binomial distributions for rating data. We show that this model is identifiable and propose a numerically stable penalized likelihood approach for parameter estimation. We adapt an expectation-maximization algorithm for the penalized maximum likelihood estimation. Our simulation results show that the penalized maximum likelihood estimator is consistent and effective. We fit the proposed model and other models in the literature to some real-world datasets and find the proposed model can have much better fits.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"75 5","pages":"833 - 853"},"PeriodicalIF":1.0,"publicationDate":"2023-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43469802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Least absolute deviation estimation for AR(1) processes with roots close to unity 根接近1的AR(1)过程的最小绝对偏差估计
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-01-23 DOI: 10.1007/s10463-022-00864-0
Nannan Ma, Hailin Sang, Guangyu Yang

We establish the asymptotic theory of least absolute deviation estimators for AR(1) processes with autoregressive parameter satisfying (n(rho _n-1)rightarrow gamma) for some fixed (gamma) as (nrightarrow infty), which is parallel to the results of ordinary least squares estimators developed by Andrews and Guggenberger (Journal of Time Series Analysis, 29, 203–212, 2008) in the case (gamma = 0) or Chan and Wei (Annals of Statistics, 15, 1050–1063, 1987) and Phillips (Biometrika, 74, 535–574, 1987) in the case (gamma ne 0). Simulation experiments are conducted to confirm the theoretical results and to demonstrate the robustness of the least absolute deviation estimation.

对于自回归参数满足(n(rho _n-1)rightarrow gamma)的AR(1)过程,对于某些固定的(gamma) = (nrightarrow infty),我们建立了最小绝对偏差估计量的渐近理论,这与andrew和Guggenberger (Journal of Time Series Analysis, 29,203 - 212,2008)在(gamma = 0)或Chan和Wei (Annals of Statistics, 15,1050 - 1063, 1987)和Phillips (Biometrika, 74,535 - 574)的情况下的普通最小二乘估计量的结果相似。1987)在(gamma ne 0)的情况下。仿真实验验证了理论结果,并验证了最小绝对偏差估计的鲁棒性。
{"title":"Least absolute deviation estimation for AR(1) processes with roots close to unity","authors":"Nannan Ma,&nbsp;Hailin Sang,&nbsp;Guangyu Yang","doi":"10.1007/s10463-022-00864-0","DOIUrl":"10.1007/s10463-022-00864-0","url":null,"abstract":"<div><p>We establish the asymptotic theory of least absolute deviation estimators for AR(1) processes with autoregressive parameter satisfying <span>(n(rho _n-1)rightarrow gamma)</span> for some fixed <span>(gamma)</span> as <span>(nrightarrow infty)</span>, which is parallel to the results of ordinary least squares estimators developed by Andrews and Guggenberger (Journal of Time Series Analysis, 29, 203–212, 2008) in the case <span>(gamma = 0)</span> or Chan and Wei (Annals of Statistics, 15, 1050–1063, 1987) and Phillips (Biometrika, 74, 535–574, 1987) in the case <span>(gamma ne 0)</span>. Simulation experiments are conducted to confirm the theoretical results and to demonstrate the robustness of the least absolute deviation estimation.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"75 5","pages":"799 - 832"},"PeriodicalIF":1.0,"publicationDate":"2023-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46897034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonparametric multiple regression by projection on non-compactly supported bases 非紧支撑基上投影的非参数多元回归
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-01-22 DOI: 10.1007/s10463-022-00863-1
Florian Dussap

We study the nonparametric regression estimation problem with a random design in ({mathbb{R}}^{p}) with (pge 2). We do so by using a projection estimator obtained by least squares minimization. Our contribution is to consider non-compact estimation domains in ({mathbb {R}}^{p}), on which we recover the function, and to provide a theoretical study of the risk of the estimator relative to a norm weighted by the distribution of the design. We propose a model selection procedure in which the model collection is random and takes into account the discrepancy between the empirical norm and the norm associated with the distribution of design. We prove that the resulting estimator automatically optimizes the bias-variance trade-off in both norms, and we illustrate the numerical performance of our procedure on simulated data.

我们用(pge 2)研究了({mathbb{R}}^{p})中随机设计的非参数回归估计问题。我们通过使用由最小二乘最小化得到的投影估计量来做到这一点。我们的贡献是考虑({mathbb {R}}^{p})中的非紧凑估计域,我们在其上恢复函数,并提供相对于由设计分布加权的范数的估计器风险的理论研究。我们提出了一个模型选择程序,其中模型集合是随机的,并考虑到经验规范和与设计分布相关的规范之间的差异。我们证明了所得到的估计器在两个规范中自动优化偏差-方差权衡,并说明了我们的过程在模拟数据上的数值性能。
{"title":"Nonparametric multiple regression by projection on non-compactly supported bases","authors":"Florian Dussap","doi":"10.1007/s10463-022-00863-1","DOIUrl":"10.1007/s10463-022-00863-1","url":null,"abstract":"<div><p>We study the nonparametric regression estimation problem with a random design in <span>({mathbb{R}}^{p})</span> with <span>(pge 2)</span>. We do so by using a projection estimator obtained by least squares minimization. Our contribution is to consider non-compact estimation domains in <span>({mathbb {R}}^{p})</span>, on which we recover the function, and to provide a theoretical study of the risk of the estimator relative to a norm weighted by the distribution of the design. We propose a model selection procedure in which the model collection is random and takes into account the discrepancy between the empirical norm and the norm associated with the distribution of design. We prove that the resulting estimator automatically optimizes the bias-variance trade-off in both norms, and we illustrate the numerical performance of our procedure on simulated data.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"75 5","pages":"731 - 771"},"PeriodicalIF":1.0,"publicationDate":"2023-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47801701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Robust density power divergence estimates for panel data models 面板数据模型的稳健密度功率散度估计
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2023-01-20 DOI: 10.1007/s10463-022-00862-2
Abhijit Mandal, Beste Hamiye Beyaztas, Soutir Bandyopadhyay

The panel data regression models have become one of the most widely applied statistical approaches in different fields of research, including social, behavioral, environmental sciences, and econometrics. However, traditional least-squares-based techniques frequently used for panel data models are vulnerable to the adverse effects of data contamination or outlying observations that may result in biased and inefficient estimates and misleading statistical inference. In this study, we propose a minimum density power divergence estimation procedure for panel data regression models with random effects to achieve robustness against outliers. The robustness, as well as the asymptotic properties of the proposed estimator, are rigorously established. The finite-sample properties of the proposed method are investigated through an extensive simulation study and an application to climate data in Oman. Our results demonstrate that the proposed estimator exhibits improved performance over some traditional and robust methods in the presence of data contamination.

面板数据回归模型已成为社会科学、行为科学、环境科学和计量经济学等不同研究领域中应用最广泛的统计方法之一。然而,经常用于面板数据模型的传统基于最小二乘的技术容易受到数据污染或外围观测值的不利影响,这可能导致有偏见和低效的估计以及误导性的统计推断。在这项研究中,我们提出了一个具有随机效应的面板数据回归模型的最小密度功率散度估计程序,以实现对异常值的鲁棒性。严格地证明了该估计量的鲁棒性和渐近性。通过广泛的模拟研究和阿曼气候数据的应用,研究了所提出方法的有限样本特性。我们的结果表明,在存在数据污染的情况下,所提出的估计器比一些传统的鲁棒方法表现出更好的性能。
{"title":"Robust density power divergence estimates for panel data models","authors":"Abhijit Mandal,&nbsp;Beste Hamiye Beyaztas,&nbsp;Soutir Bandyopadhyay","doi":"10.1007/s10463-022-00862-2","DOIUrl":"10.1007/s10463-022-00862-2","url":null,"abstract":"<div><p>The panel data regression models have become one of the most widely applied statistical approaches in different fields of research, including social, behavioral, environmental sciences, and econometrics. However, traditional least-squares-based techniques frequently used for panel data models are vulnerable to the adverse effects of data contamination or outlying observations that may result in biased and inefficient estimates and misleading statistical inference. In this study, we propose a <i>minimum density power divergence</i> estimation procedure for panel data regression models with random effects to achieve robustness against outliers. The robustness, as well as the asymptotic properties of the proposed estimator, are rigorously established. The finite-sample properties of the proposed method are investigated through an extensive simulation study and an application to climate data in Oman. Our results demonstrate that the proposed estimator exhibits improved performance over some traditional and robust methods in the presence of data contamination.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"75 5","pages":"773 - 798"},"PeriodicalIF":1.0,"publicationDate":"2023-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43954223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Annals of the Institute of Statistical Mathematics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1