Annals of the Institute of Statistical Mathematics最新文献

英文中文

Mixture of shifted binomial distributions for rating data 混合移位二项分布的评级数据

IF 1 4区数学 Q3 STATISTICS & PROBABILITY

Annals of the Institute of Statistical Mathematics

Pub Date : 2023-02-10 DOI: 10.1007/s10463-023-00865-7

Shaoting Li, Jiahua Chen

Rating data are a kind of ordinal categorical data routinely collected in survey sampling. The response value in such applications is confined to a finite number of ordered categories. Due to population heterogeneity, the respondents may have several different rating styles. A finite mixture model is thus most suitable to fit datasets of this nature. In this paper, we propose a two-component mixture of shifted binomial distributions for rating data. We show that this model is identifiable and propose a numerically stable penalized likelihood approach for parameter estimation. We adapt an expectation-maximization algorithm for the penalized maximum likelihood estimation. Our simulation results show that the penalized maximum likelihood estimator is consistent and effective. We fit the proposed model and other models in the literature to some real-world datasets and find the proposed model can have much better fits.

评级数据是在调查抽样中常规收集的一种有序分类数据。在这种应用中，响应值被限制在有限数量的有序类别中。由于人口异质性，受访者可能有几种不同的评级风格。因此，有限混合模型最适合拟合这种性质的数据集。在本文中，我们提出了一个双分量混合移位二项分布的评级数据。我们证明了该模型是可识别的，并提出了一种数值稳定的惩罚似然方法用于参数估计。我们采用了一种期望最大化算法来进行惩罚极大似然估计。仿真结果表明，惩罚极大似然估计是一致的、有效的。我们将提出的模型和文献中的其他模型拟合到一些现实世界的数据集，发现提出的模型可以有更好的拟合。

引用次数: 0

Least absolute deviation estimation for AR(1) processes with roots close to unity 根接近1的AR（1）过程的最小绝对偏差估计

IF 1 4区数学 Q3 STATISTICS & PROBABILITY

Annals of the Institute of Statistical Mathematics

Pub Date : 2023-01-23 DOI: 10.1007/s10463-022-00864-0

Nannan Ma, Hailin Sang, Guangyu Yang

We establish the asymptotic theory of least absolute deviation estimators for AR(1) processes with autoregressive parameter satisfying (n(rho _n-1)rightarrow gamma) for some fixed (gamma) as (nrightarrow infty), which is parallel to the results of ordinary least squares estimators developed by Andrews and Guggenberger (Journal of Time Series Analysis, 29, 203–212, 2008) in the case (gamma = 0) or Chan and Wei (Annals of Statistics, 15, 1050–1063, 1987) and Phillips (Biometrika, 74, 535–574, 1987) in the case (gamma ne 0). Simulation experiments are conducted to confirm the theoretical results and to demonstrate the robustness of the least absolute deviation estimation.

对于自回归参数满足(n(rho _n-1)rightarrow gamma)的AR(1)过程，对于某些固定的(gamma) = (nrightarrow infty)，我们建立了最小绝对偏差估计量的渐近理论，这与andrew和Guggenberger (Journal of Time Series Analysis, 29,203 - 212,2008)在(gamma = 0)或Chan和Wei (Annals of Statistics, 15,1050 - 1063, 1987)和Phillips (Biometrika, 74,535 - 574)的情况下的普通最小二乘估计量的结果相似。1987)在(gamma ne 0)的情况下。仿真实验验证了理论结果，并验证了最小绝对偏差估计的鲁棒性。

引用次数: 0

Nonparametric multiple regression by projection on non-compactly supported bases 非紧支撑基上投影的非参数多元回归

IF 1 4区数学 Q3 STATISTICS & PROBABILITY

Annals of the Institute of Statistical Mathematics

Pub Date : 2023-01-22 DOI: 10.1007/s10463-022-00863-1

Florian Dussap

We study the nonparametric regression estimation problem with a random design in ({mathbb{R}}^{p}) with (pge 2). We do so by using a projection estimator obtained by least squares minimization. Our contribution is to consider non-compact estimation domains in ({mathbb {R}}^{p}), on which we recover the function, and to provide a theoretical study of the risk of the estimator relative to a norm weighted by the distribution of the design. We propose a model selection procedure in which the model collection is random and takes into account the discrepancy between the empirical norm and the norm associated with the distribution of design. We prove that the resulting estimator automatically optimizes the bias-variance trade-off in both norms, and we illustrate the numerical performance of our procedure on simulated data.

我们用(pge 2)研究了({mathbb{R}}^{p})中随机设计的非参数回归估计问题。我们通过使用由最小二乘最小化得到的投影估计量来做到这一点。我们的贡献是考虑({mathbb {R}}^{p})中的非紧凑估计域，我们在其上恢复函数，并提供相对于由设计分布加权的范数的估计器风险的理论研究。我们提出了一个模型选择程序，其中模型集合是随机的，并考虑到经验规范和与设计分布相关的规范之间的差异。我们证明了所得到的估计器在两个规范中自动优化偏差-方差权衡，并说明了我们的过程在模拟数据上的数值性能。

引用次数: 4

Robust density power divergence estimates for panel data models 面板数据模型的稳健密度功率散度估计

IF 1 4区数学 Q3 STATISTICS & PROBABILITY

Annals of the Institute of Statistical Mathematics

Pub Date : 2023-01-20 DOI: 10.1007/s10463-022-00862-2

Abhijit Mandal, Beste Hamiye Beyaztas, Soutir Bandyopadhyay

The panel data regression models have become one of the most widely applied statistical approaches in different fields of research, including social, behavioral, environmental sciences, and econometrics. However, traditional least-squares-based techniques frequently used for panel data models are vulnerable to the adverse effects of data contamination or outlying observations that may result in biased and inefficient estimates and misleading statistical inference. In this study, we propose a minimum density power divergence estimation procedure for panel data regression models with random effects to achieve robustness against outliers. The robustness, as well as the asymptotic properties of the proposed estimator, are rigorously established. The finite-sample properties of the proposed method are investigated through an extensive simulation study and an application to climate data in Oman. Our results demonstrate that the proposed estimator exhibits improved performance over some traditional and robust methods in the presence of data contamination.

面板数据回归模型已成为社会科学、行为科学、环境科学和计量经济学等不同研究领域中应用最广泛的统计方法之一。然而，经常用于面板数据模型的传统基于最小二乘的技术容易受到数据污染或外围观测值的不利影响，这可能导致有偏见和低效的估计以及误导性的统计推断。在这项研究中，我们提出了一个具有随机效应的面板数据回归模型的最小密度功率散度估计程序，以实现对异常值的鲁棒性。严格地证明了该估计量的鲁棒性和渐近性。通过广泛的模拟研究和阿曼气候数据的应用，研究了所提出方法的有限样本特性。我们的结果表明，在存在数据污染的情况下，所提出的估计器比一些传统的鲁棒方法表现出更好的性能。

引用次数: 0

Correction to: Group least squares regression for linear models with strongly correlated predictor variables 校正:具有强相关预测变量的线性模型的组最小二乘回归

IF 1 4区数学 Q3 STATISTICS & PROBABILITY

Annals of the Institute of Statistical Mathematics

Pub Date : 2023-01-11 DOI: 10.1007/s10463-022-00861-3

Min Tsao

引用次数: 0

A copula spectral test for pairwise time reversibility 两两时间可逆性的联结谱检验

IF 1 4区数学 Q3 STATISTICS & PROBABILITY

Annals of the Institute of Statistical Mathematics

Pub Date : 2022-12-26 DOI: 10.1007/s10463-022-00859-x

Shibin Zhang

In this paper, we propose a new frequency domain test for pairwise time reversibility at any specific couple of quantiles of two-dimensional marginal distribution. The proposed test is applicable to a very broad class of time series, regardless of the existence of moments and Markovian properties. By varying the couple of quantiles, the test can detect any violation of pairwise time reversibility. Our approach is based on an estimator of the (L^2)-distance between the imaginary part of copula spectral density kernel and its value under the null hypothesis. We show that the limiting distribution of the proposed test statistic is normal and investigate the finite sample performance by means of a simulation study. We illustrate the use of the proposed test by applying it to stock price data.

在本文中，我们提出了一种新的二维边缘分布的双时间可逆性的频域检验方法。所提出的检验适用于非常广泛的时间序列，而不考虑矩和马尔可夫性质的存在。通过改变一对分位数，测试可以检测任何违反两两时间可逆性。我们的方法是基于零假设下copula谱密度核的虚部与其值之间的(L^2) -距离的估计量。我们证明了所提出的检验统计量的极限分布是正态分布，并通过仿真研究了有限样本的性能。我们通过将所提出的测试应用于股票价格数据来说明它的使用。

引用次数: 1

Generation of all randomizations using circuits 使用电路生成所有随机化。

IF 1 4区数学 Q3 STATISTICS & PROBABILITY

Annals of the Institute of Statistical Mathematics

Pub Date : 2022-12-23 DOI: 10.1007/s10463-022-00860-4

Elena Pesce, Fabio Rapallo, Eva Riccomagno, Henry P. Wynn

After a rich history in medicine, randomized control trials (RCTs), both simple and complex, are in increasing use in other areas, such as web-based A/B testing and planning and design of decisions. A main objective of RCTs is to be able to measure parameters, and contrasts in particular, while guarding against biases from hidden confounders. After careful definitions of classical entities such as contrasts, an algebraic method based on circuits is introduced which gives a wide choice of randomization schemes.

在拥有丰富的医学历史之后，随机对照试验（RCT），无论是简单的还是复杂的，在其他领域的应用越来越多，例如基于网络的a/B测试以及决策的规划和设计。随机对照试验的一个主要目标是能够测量参数，特别是对比度，同时防止隐藏的混杂因素带来的偏差。在仔细定义了对比度等经典实体后，引入了一种基于电路的代数方法，该方法提供了广泛的随机化方案选择。

引用次数: 0

Model averaging for semiparametric varying coefficient quantile regression models 半参数变系数分位数回归模型的模型平均

IF 1 4区数学 Q3 STATISTICS & PROBABILITY

Annals of the Institute of Statistical Mathematics

Pub Date : 2022-12-22 DOI: 10.1007/s10463-022-00857-z

Zishu Zhan, Yang Li, Yuhong Yang, Cunjie Lin

In this study, we propose a model averaging approach to estimating the conditional quantiles based on a set of semiparametric varying coefficient models. Different from existing literature on the subject, we consider a particular form for all candidates, where there is only one varying coefficient in each sub-model, and all the candidates under investigation may be misspecified. We propose a weight choice criterion based on a leave-more-out cross-validation objective function. Moreover, the resulting averaging estimator is more robust against model misspecification due to the weighted coefficients that adjust the relative importance of the varying and constant coefficients for the same predictors. We prove out statistical properties for each sub-model and asymptotic optimality of the weight selection method. Simulation studies show that the proposed procedure has satisfactory prediction accuracy. An analysis of a skin cutaneous melanoma data further supports the merits of the proposed approach.

在这项研究中，我们提出了一种基于半参数变系数模型的模型平均方法来估计条件分位数。与现有文献不同的是，我们考虑了所有候选项的特定形式，其中每个子模型中只有一个变化系数，并且所有被调查的候选项都可能被错误指定。我们提出了一个基于留多交叉验证目标函数的权重选择准则。此外，由于加权系数调整了相同预测因子的变系数和常系数的相对重要性，因此所得的平均估计器对模型错误规范的鲁棒性更强。证明了各子模型的统计性质和权重选择方法的渐近最优性。仿真研究表明，该方法具有较好的预测精度。对皮肤黑色素瘤数据的分析进一步支持了所提出方法的优点。

引用次数: 1

Slash distributions, generalized convolutions, and extremes 斜线分布，广义卷积和极值

IF 1 4区数学 Q3 STATISTICS & PROBABILITY

Annals of the Institute of Statistical Mathematics

Pub Date : 2022-12-20 DOI: 10.1007/s10463-022-00858-y

M. Arendarczyk, T. J. Kozubowski, A. K. Panorska

An (alpha)-slash distribution built upon a random variable X is a heavy tailed distribution corresponding to (Y=X/U^{1/alpha }), where U is standard uniform random variable, independent of X. We point out and explore a connection between (alpha)-slash distributions, which are gaining popularity in statistical practice, and generalized convolutions, which come up in the probability theory as generalizations of the standard concept of the convolution of probability measures and allow for the operation between the measures to be random itself. The stochastic interpretation of Kendall convolution discussed in this work brings this theoretical concept closer to statistical practice, and leads to new results for (alpha)-slash distributions connected with extremes. In particular, we show that the maximum of independent random variables with (alpha)-slash distributions is also a random variable with an (alpha)-slash distribution. Our theoretical results are illustrated by several examples involving standard and novel probability distributions and extremes.

建立在随机变量X上的(alpha) -斜线分布是对应于(Y=X/U^{1/alpha })的重尾分布，其中U是独立于X的标准均匀随机变量。我们指出并探索了在统计实践中越来越流行的(alpha) -斜线分布与广义卷积之间的联系。它出现在概率论中作为概率测度卷积标准概念的概括并且允许测度之间的运算本身是随机的。在这项工作中讨论的肯德尔卷积的随机解释使这一理论概念更接近统计实践，并导致与极端相关的(alpha) -斜线分布的新结果。特别地，我们证明了具有(alpha) -斜线分布的独立随机变量的最大值也是具有(alpha) -斜线分布的随机变量。我们的理论结果通过几个涉及标准和新的概率分布和极值的例子来说明。

{"title":"Slash distributions, generalized convolutions, and extremes","authors":"M. Arendarczyk, T. J. Kozubowski, A. K. Panorska","doi":"10.1007/s10463-022-00858-y","DOIUrl":"10.1007/s10463-022-00858-y","url":null,"abstract":"<div>An (alpha)-slash distribution built upon a random variable X is a heavy tailed distribution corresponding to (Y=X/U^{1/alpha }), where U is standard uniform random variable, independent of X. We point out and explore a connection between (alpha)-slash distributions, which are gaining popularity in statistical practice, and generalized convolutions, which come up in the probability theory as generalizations of the standard concept of the convolution of probability measures and allow for the operation between the measures to be random itself. The stochastic interpretation of Kendall convolution discussed in this work brings this theoretical concept closer to statistical practice, and leads to new results for (alpha)-slash distributions connected with extremes. In particular, we show that the maximum of independent random variables with (alpha)-slash distributions is also a random variable with an (alpha)-slash distribution. Our theoretical results are illustrated by several examples involving standard and novel probability distributions and extremes.</div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"75 4","pages":"593 - 617"},"PeriodicalIF":1.0,"publicationDate":"2022-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10463-022-00858-y.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42924932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

A unified precision matrix estimation framework via sparse column-wise inverse operator under weak sparsity 弱稀疏性下基于稀疏列逆算子的统一精度矩阵估计框架

IF 1 4区数学 Q3 STATISTICS & PROBABILITY

Annals of the Institute of Statistical Mathematics

Pub Date : 2022-12-08 DOI: 10.1007/s10463-022-00856-0

Zeyu Wu, Cheng Wang, Weidong Liu

In this paper, we estimate the high-dimensional precision matrix under the weak sparsity condition where many entries are nearly zero. We revisit the sparse column-wise inverse operator estimator and derive its general error bounds under the weak sparsity condition. A unified framework is established to deal with various cases including the heavy-tailed data, the non-paranormal data, and the matrix variate data. These new methods can achieve the same convergence rates as the existing methods and can be implemented efficiently.

在弱稀疏性条件下，我们估计了高维精度矩阵，其中许多项接近于零。我们重新研究了稀疏列逆算子估计，并推导了它在弱稀疏条件下的一般误差界。建立了一个统一的框架来处理各种情况，包括重尾数据、非异常数据和矩阵变量数据。这些新方法可以达到与现有方法相同的收敛速度，并且可以有效地实现。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Annals of the Institute of Statistical Mathematics

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀