首页 > 最新文献

Mathematical Methods of Statistics最新文献

英文 中文
On Aggregation of Uncensored and Censored Observations 关于未删减和删减观测数据的聚合
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-15 DOI: 10.3103/s1066530724700078
Sam Efromovich

Abstract

In survival analysis a random right-censoring partitions data into uncensored and censored observations of the lifetime of interest. The dominance of uncensored observations is a familiar methodology in nonparametric estimation motivated by the classical Kaplan–Meier product-limit and Cox partial likelihood estimators. Nonetheless, for high rate censoring it is of interest to understand what, if anything, can be done by aggregating uncensored and censored observations for the staple nonparametric problems of density and regression estimation. The oracle, who knows distribution of the censoring lifetime, can use each subsample for consistent estimation and hence may shed light on the aggregation. The oracle’s asymptotic theory reveals that density estimation, based on censored observations, is an ill-posed problem with slower rates of risk convergence, the ill-posedness occurs in frequency-domain, its severity increases with frequency, and accordingly a special aggregation on low frequencies may be beneficial. On the other hand, censored observations are not ill-posed for nonparametric regression and the aggregation is feasible. Based on these theoretical results, methodology of aggregation in frequency domain is developed and proposed estimators are tested on simulated and real examples.

摘要 在生存分析中,随机右删减将数据分为未删减和已删减的相关生命期观测值。在非参数估计中,未删减观测值占主导地位是一种熟悉的方法,其动机是经典的 Kaplan-Meier 乘积限值和 Cox 部分似然估计器。然而,对于高删失率,我们有兴趣了解,在密度和回归估计等主要非参数问题上,通过汇总未删失和删失观测值,可以做些什么(如果有的话)。知道剔除寿命分布的神谕者可以使用每个子样本进行一致的估计,因此可能会对聚合有所启发。oracle的渐近理论显示,基于删减观测值的密度估计是一个风险收敛速度较慢的问题,这种问题发生在频率域,其严重程度随频率的增加而增加,因此在低频率上进行特殊的聚合可能是有益的。另一方面,对于非参数回归而言,有删减的观测数据不会出现问题,而且聚合也是可行的。基于这些理论结果,我们开发了频域聚合方法,并在模拟和实际例子中测试了所提出的估计器。
{"title":"On Aggregation of Uncensored and Censored Observations","authors":"Sam Efromovich","doi":"10.3103/s1066530724700078","DOIUrl":"https://doi.org/10.3103/s1066530724700078","url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>In survival analysis a random right-censoring partitions data into uncensored and censored observations of the lifetime of interest. The dominance of uncensored observations is a familiar methodology in nonparametric estimation motivated by the classical Kaplan–Meier product-limit and Cox partial likelihood estimators. Nonetheless, for high rate censoring it is of interest to understand what, if anything, can be done by aggregating uncensored and censored observations for the staple nonparametric problems of density and regression estimation. The oracle, who knows distribution of the censoring lifetime, can use each subsample for consistent estimation and hence may shed light on the aggregation. The oracle’s asymptotic theory reveals that density estimation, based on censored observations, is an ill-posed problem with slower rates of risk convergence, the ill-posedness occurs in frequency-domain, its severity increases with frequency, and accordingly a special aggregation on low frequencies may be beneficial. On the other hand, censored observations are not ill-posed for nonparametric regression and the aggregation is feasible. Based on these theoretical results, methodology of aggregation in frequency domain is developed and proposed estimators are tested on simulated and real examples.</p>","PeriodicalId":46039,"journal":{"name":"Mathematical Methods of Statistics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141717518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimation of Parameters of Misclassified Size Biased Uniform Poisson Distribution and Its Application 误分类大小偏差均匀泊松分布参数估计及其应用
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-15 DOI: 10.3103/s106653072470008x
B. S. Trivedi, D. R. Barot, M. N. Patel

Abstract

Statistical data analysis is of great interest in every field ofmanagement, business, engineering, medicine, etc. At the time ofclassification and analysis, errors may arise, like aclassification of observation in the other class instead of theactual class. All fields of science and economics have substantialproblems due to misclassification errors in the observed data. Dueto a misclassification error in the data, the sampling process maynot suggest an appropriate probability distribution, and in thatcase, inference is impaired. When these types of errors areidentified in variables, it is expected to consider the problem’ssolution regarding classification errors. This paper presents thesituation where specific counts are reported erroneously asbelonging to other counts in the context of size biased UniformPoisson distribution, the so-called misclassified size biasedUniform Poisson distribution. Further, we have estimated theparameters of misclassified size biased Uniform Poissondistribution by applying the method of moments, maximum likelihoodmethod, and approximate Bayes estimation method. A simulationstudy is carried out to assess the performance of estimationmethods. A real dataset is discussed to demonstrate thesuitability and applicability of the proposed distribution in themodeling count dataset. A Monte Carlo simulation study ispresented to compare the estimators. The simulation results showthat the ML estimates perform better than their correspondingmoment estimates and approximate Bayes estimates.

摘要 统计数据分析在管理、商业、工程、医学等各个领域都具有重要意义。在进行分类和分析时,可能会出现错误,如将观测数据归入其他类别而非实际类别。科学和经济学的所有领域都存在因观测数据分类错误而导致的重大问题。由于数据中的分类错误,抽样过程可能无法显示适当的概率分布,在这种情况下,推断就会受到影响。当发现变量中存在这类误差时,就需要考虑如何解决分类误差问题。本文介绍了在大小偏统一泊松分布的背景下,特定计数被错误地报告为属于其他计数的情况,即所谓的误分类大小偏统一泊松分布。此外,我们还运用矩量法、最大似然法和近似贝叶斯估计法估计了误分类大小偏倚均匀泊松分布的参数。通过模拟研究来评估估计方法的性能。讨论了一个真实数据集,以证明所提出的分布在模拟计数数据集中的适用性和应用性。为了比较估计方法,还进行了蒙特卡罗模拟研究。仿真结果表明,ML 估计结果优于相应的矩估计结果和近似贝叶斯估计结果。
{"title":"Estimation of Parameters of Misclassified Size Biased Uniform Poisson Distribution and Its Application","authors":"B. S. Trivedi, D. R. Barot, M. N. Patel","doi":"10.3103/s106653072470008x","DOIUrl":"https://doi.org/10.3103/s106653072470008x","url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>Statistical data analysis is of great interest in every field of\u0000management, business, engineering, medicine, etc. At the time of\u0000classification and analysis, errors may arise, like a\u0000classification of observation in the other class instead of the\u0000actual class. All fields of science and economics have substantial\u0000problems due to misclassification errors in the observed data. Due\u0000to a misclassification error in the data, the sampling process may\u0000not suggest an appropriate probability distribution, and in that\u0000case, inference is impaired. When these types of errors are\u0000identified in variables, it is expected to consider the problem’s\u0000solution regarding classification errors. This paper presents the\u0000situation where specific counts are reported erroneously as\u0000belonging to other counts in the context of size biased Uniform\u0000Poisson distribution, the so-called misclassified size biased\u0000Uniform Poisson distribution. Further, we have estimated the\u0000parameters of misclassified size biased Uniform Poisson\u0000distribution by applying the method of moments, maximum likelihood\u0000method, and approximate Bayes estimation method. A simulation\u0000study is carried out to assess the performance of estimation\u0000methods. A real dataset is discussed to demonstrate the\u0000suitability and applicability of the proposed distribution in the\u0000modeling count dataset. A Monte Carlo simulation study is\u0000presented to compare the estimators. The simulation results show\u0000that the ML estimates perform better than their corresponding\u0000moment estimates and approximate Bayes estimates.</p>","PeriodicalId":46039,"journal":{"name":"Mathematical Methods of Statistics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141717516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rates of the Strong Uniform Consistency with Rates for Conditional U-Statistics Estimators with General Kernels on Manifolds 曲面上具有一般核的条件 U 统计估计器的强均匀一致率与率
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-15 DOI: 10.3103/s1066530724700066
Salim Bouzebda, Nourelhouda Taachouche

Abstract

(U)-statistics represent a fundamental class of statistics from modeling quantities of interest defined by multi-subject responses. (U)-statistics generalize the empirical mean of a random variable (X) to sums over every (m)-tuple of distinct observations of (X). Stute [103] introduced a class of so-called conditional (U)-statistics, which may be viewed as a generalization of the Nadaraya-Watson estimates of a regression function. Stute proved their strong pointwise consistency to:

$$r^{(k)}(varphi,tilde{mathbf{t}}):=mathbb{E}[varphi(Y_{1},ldots,Y_{k})|(X_{1},ldots,X_{k})=tilde{mathbf{t}}]quadtextrm{for}quadtilde{mathbf{t}}=left(mathbf{t}_{1},ldots,mathbf{t}_{k}right)inmathbb{R}^{dk}.$$

In the analysis of modern machine learning algorithms, sometimes we need to manipulate kernel estimation within the nonconventional setting with intricate kernels that might even be irregular and asymmetric. In this general setting, we obtain the strong uniform consistency result for the general kernel on Riemannian manifolds with Riemann integrable kernels for the conditional (U)-processes. We treat both cases when the class of functions is bounded or unbounded, satisfying some moment conditions. These results are proved under some standard structural conditions on the classes of functions and some mild conditions on the model. Our findings are applied to the regression function, the set indexed conditional (U)-statistics, the generalized (U)-statistics, and the discrimination problem. The theoretical results established in this paper are (or will be) key tools for many further developments in manifold data analysis.

Abstract(U)-statistics 代表了由多受试者反应定义的感兴趣数量建模的一类基本统计。(U)-statistics 将随机变量 (X) 的经验平均值概括为 (X) 的每一个 (m)-tuple 的不同观测值的总和。Stute[103]引入了一类所谓的条件(U)统计量,可以将其视为回归函数的 Nadaraya-Watson 估计值的一般化。Stute 证明了它们的强点一致性:$$r^{(k)}(varphi,tilde{mathbf{t}}):=mathbb{E}[varphi(Y_{1},ldots,Y_{k})|(X_{1},ldots,X_{k})=tilde{mathbf{t}}]quadtextrm{for}quadtilde{mathbf{t}}=left(mathbf{t}_{1},ldots,mathbf{t}_{k}right)inmathbb{R}^{dk}.$$ 在分析现代机器学习算法时,有时我们需要在非常规环境下使用错综复杂的内核来处理内核估计,这些内核甚至可能是不规则和不对称的。在这种一般情况下,我们得到了在黎曼流形上具有黎曼可积分核的条件(U)过程的一般核的强均匀一致性结果。我们处理了满足某些矩条件的有界或无界函数类的两种情况。这些结果是在函数类的一些标准结构条件和模型的一些温和条件下证明的。我们的发现被应用于回归函数、集合索引条件 (U)统计量、广义 (U)统计量和判别问题。本文建立的理论结果是(或将是)流形数据分析进一步发展的关键工具。
{"title":"Rates of the Strong Uniform Consistency with Rates for Conditional U-Statistics Estimators with General Kernels on Manifolds","authors":"Salim Bouzebda, Nourelhouda Taachouche","doi":"10.3103/s1066530724700066","DOIUrl":"https://doi.org/10.3103/s1066530724700066","url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>\u0000<span>(U)</span>-statistics represent a fundamental class of statistics from modeling quantities of interest defined by multi-subject responses. <span>(U)</span>-statistics generalize the empirical mean of a random variable <span>(X)</span> to sums over every <span>(m)</span>-tuple of distinct observations of <span>(X)</span>. Stute [103] introduced a class of so-called conditional <span>(U)</span>-statistics, which may be viewed as a generalization of the Nadaraya-Watson estimates of a regression function. Stute proved their strong pointwise consistency to:</p><span>$$r^{(k)}(varphi,tilde{mathbf{t}}):=mathbb{E}[varphi(Y_{1},ldots,Y_{k})|(X_{1},ldots,X_{k})=tilde{mathbf{t}}]quadtextrm{for}quadtilde{mathbf{t}}=left(mathbf{t}_{1},ldots,mathbf{t}_{k}right)inmathbb{R}^{dk}.$$</span><p>In the analysis of modern machine learning algorithms, sometimes we need to manipulate kernel estimation within the nonconventional setting with intricate kernels that might even be irregular and asymmetric. In this general setting, we obtain the strong uniform consistency result for the general kernel on Riemannian manifolds with Riemann integrable kernels for the conditional <span>(U)</span>-processes. We treat both cases when the class of functions is bounded or unbounded, satisfying some moment conditions. These results are proved under some standard structural conditions on the classes of functions and some mild conditions on the model. Our findings are applied to the regression function, the set indexed conditional <span>(U)</span>-statistics, the generalized <span>(U)</span>-statistics, and the discrimination problem. The theoretical results established in this paper are (or will be) key tools for many further developments in manifold data analysis.</p>","PeriodicalId":46039,"journal":{"name":"Mathematical Methods of Statistics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141717517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stochastic Comparisons of the Smallest Claim Amounts from Two Heterogeneous Portfolios Following Exponentiated Weibull Distribution 两种异质投资组合的最小索赔额的随机比较,遵循幂次韦布尔分布
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-15 DOI: 10.3103/s1066530724700108
Suheir Kareem Ramani, Habib Jafari, Ghobad Saadat Kia

Abstract

In actuarial science, it is often of interest to compare stochastically smallest claim amounts from heterogeneous portfolios. In this paper, we obtain the usual stochastic order between the smallest claim amounts when the matrix of parameters ((boldsymbol{alpha}), (boldsymbol{lambda})) changes to another matrix in terms of chain majorization order. By using the Archimedean copula and weak majorization conceptions, we also obtain some conditions for comparison of smallest claim amounts in terms of usual stochastic order.

摘要在精算学中,比较来自异质投资组合的随机最小索赔额通常是很有意义的。在本文中,当参数矩阵((boldsymbol{alpha}), (boldsymbol{lambda}))变化为另一个矩阵时,我们得到了最小索赔额之间通常的随机顺序,即链式大化顺序。通过使用阿基米德共轭和弱主要化概念,我们还得到了一些按通常随机顺序比较最小索赔额的条件。
{"title":"Stochastic Comparisons of the Smallest Claim Amounts from Two Heterogeneous Portfolios Following Exponentiated Weibull Distribution","authors":"Suheir Kareem Ramani, Habib Jafari, Ghobad Saadat Kia","doi":"10.3103/s1066530724700108","DOIUrl":"https://doi.org/10.3103/s1066530724700108","url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>In actuarial science, it is often of interest to compare stochastically smallest claim amounts from heterogeneous portfolios. In this paper, we obtain the usual stochastic order between the smallest claim amounts when the matrix of parameters <span>((boldsymbol{alpha})</span>, <span>(boldsymbol{lambda}))</span> changes to another matrix in terms of chain majorization order. By using the Archimedean copula and weak majorization conceptions, we also obtain some conditions for comparison of smallest claim amounts in terms of usual stochastic order.</p>","PeriodicalId":46039,"journal":{"name":"Mathematical Methods of Statistics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141717515","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Asymptotic Properties of Extrema of Moving Sums of Independent Non-identically Distributed Variables 独立非同分布变量移动和的渐近特性
IF 0.5 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-15 DOI: 10.3103/s1066530724700091
Narayanaswamy Balakrishnan, Alexei Stepanov

Abstract

In this work, we discuss the asymptotic behavior of minima and maxima of moving sums of independent and non-identically distributed random variables. We first establish some theoretical results associated with the asymptotic behavior of minima and maxima. Then, we apply these results to exponential and normal models. We also derive strong limit results for the minima and maxima of moving sums taken from these two models.

摘要 在这项工作中,我们讨论了独立和非同分布随机变量移动总和的最小值和最大值的渐近行为。我们首先建立了一些与最小值和最大值渐近行为相关的理论结果。然后,我们将这些结果应用于指数模型和正态模型。我们还推导出了这两个模型中移动和的最小值和最大值的强极限结果。
{"title":"Asymptotic Properties of Extrema of Moving Sums of Independent Non-identically Distributed Variables","authors":"Narayanaswamy Balakrishnan, Alexei Stepanov","doi":"10.3103/s1066530724700091","DOIUrl":"https://doi.org/10.3103/s1066530724700091","url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>In this work, we discuss the asymptotic behavior of minima and maxima of moving sums of independent and non-identically distributed random variables. We first establish some theoretical results associated with the asymptotic behavior of minima and maxima. Then, we apply these results to exponential and normal models. We also derive strong limit results for the minima and maxima of moving sums taken from these two models.</p>","PeriodicalId":46039,"journal":{"name":"Mathematical Methods of Statistics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141717513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Functional Uniform-in-Bandwidth Moderate Deviation Principle for the Local Empirical Processes Involving Functional Data 涉及函数数据的局部经验过程的函数统一带宽适度偏差原理
IF 0.5 Q4 Mathematics Pub Date : 2024-04-25 DOI: 10.3103/s1066530724700030
Nour-Eddine Berrahou, Salim Bouzebda, Lahcen Douge

Abstract

Our research employs general empirical process methods to investigate and establish moderate deviation principles for kernel-type function estimators that rely on an infinite-dimensional covariate, subject to mild regularity conditions. In doing so, we introduce a valuable moderate deviation principle for a function-indexed process, utilizing intricate exponential contiguity arguments. The primary objective of this paper is to contribute to the existing literature on functional data analysis by establishing functional moderate deviation principles for both Nadaraya–Watson and conditional distribution processes. These principles serve as fundamental tools for analyzing and understanding the behavior of these processes in the context of functional data analysis. By extending the scope of moderate deviation principles to the realm of functional data analysis, we enhance our understanding of the statistical properties and limitations of kernel-type function estimators when dealing with infinite-dimensional covariates. Our findings provide valuable insights and contribute to the advancement of statistical methodology in functional data analysis.

摘要我们的研究采用了一般经验过程方法,研究并建立了依赖于无穷维协变量的核型函数估计器的适度偏差原则,但须满足温和的正则性条件。在此过程中,我们利用错综复杂的指数连续性论证,为函数索引过程引入了有价值的适度偏差原理。本文的主要目的是通过建立纳达拉亚-沃森和条件分布过程的函数适度偏差原理,为现有的函数数据分析文献做出贡献。这些原则是在函数数据分析中分析和理解这些过程行为的基本工具。通过将中等偏差原理的范围扩展到函数数据分析领域,我们加深了对核型函数估计器在处理无限维协变量时的统计特性和局限性的理解。我们的研究结果提供了宝贵的见解,有助于推动函数数据分析统计方法的发展。
{"title":"Functional Uniform-in-Bandwidth Moderate Deviation Principle for the Local Empirical Processes Involving Functional Data","authors":"Nour-Eddine Berrahou, Salim Bouzebda, Lahcen Douge","doi":"10.3103/s1066530724700030","DOIUrl":"https://doi.org/10.3103/s1066530724700030","url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>Our research employs general empirical process methods to investigate and establish moderate deviation principles for kernel-type function estimators that rely on an infinite-dimensional covariate, subject to mild regularity conditions. In doing so, we introduce a valuable moderate deviation principle for a function-indexed process, utilizing intricate exponential contiguity arguments. The primary objective of this paper is to contribute to the existing literature on functional data analysis by establishing functional moderate deviation principles for both Nadaraya–Watson and conditional distribution processes. These principles serve as fundamental tools for analyzing and understanding the behavior of these processes in the context of functional data analysis. By extending the scope of moderate deviation principles to the realm of functional data analysis, we enhance our understanding of the statistical properties and limitations of kernel-type function estimators when dealing with infinite-dimensional covariates. Our findings provide valuable insights and contribute to the advancement of statistical methodology in functional data analysis.</p>","PeriodicalId":46039,"journal":{"name":"Mathematical Methods of Statistics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140803620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Controlling Separation in Generating Samples for Logistic Regression Models 在生成逻辑回归模型样本时控制分离度
IF 0.5 Q4 Mathematics Pub Date : 2024-04-25 DOI: 10.3103/s1066530724700017
Huong T. T. Pham, Hoa Pham

Abstract

Separation has a significant impact on parameter estimates for logistic regression models in frequentist approach and in Bayesian approach. When separation presents in a sample, the maximum likelihood estimation (MLE) does not exist through standard estimation methods. The existence of posterior means is affected by the presence of separation and also depended on the forms of prior distributions. Therefore, controlling the appearance of separation in generating samples from the logistic regression models has an important role for parameter estimation techniques. In this paper, we propose necessary and sufficient conditions for separation occurring in the logistic regression samples with two dimensional models and multiple dimensional models of independent variables. By using the technique of rotating Castesian coordinates of p dimensions, the characteristic of separation occurring in general cases is presented. Using these results, we propose algorithms to control the probability of separation appearance in generated samples for given sample sizes and multiple dimensional models of independent variables. The simulation studies show that the proposed algorithms can effectively generate the designed random samples with controlling the probability of separation appearance.

摘要 在频数法和贝叶斯法中,分离对逻辑回归模型的参数估计有重大影响。当样本中出现分离时,最大似然估计(MLE)就无法通过标准估计方法实现。后验均值的存在受到分离现象的影响,同时也取决于先验分布的形式。因此,在生成逻辑回归模型样本时控制分离的出现对参数估计技术具有重要作用。本文提出了自变量二维模型和多维模型的逻辑回归样本出现分离的必要条件和充分条件。通过使用 p 维旋转 Castesian 坐标技术,提出了一般情况下发生分离的特征。利用这些结果,我们提出了在给定样本大小和自变量多维模型的情况下,控制生成样本中出现分离的概率的算法。模拟研究表明,所提出的算法可以有效生成设计的随机样本,并控制分离出现的概率。
{"title":"Controlling Separation in Generating Samples for Logistic Regression Models","authors":"Huong T. T. Pham, Hoa Pham","doi":"10.3103/s1066530724700017","DOIUrl":"https://doi.org/10.3103/s1066530724700017","url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>Separation has a significant impact on parameter estimates for logistic regression models in frequentist approach and in Bayesian approach. When separation presents in a sample, the maximum likelihood estimation (MLE) does not exist through standard estimation methods. The existence of posterior means is affected by the presence of separation and also depended on the forms of prior distributions. Therefore, controlling the appearance of separation in generating samples from the logistic regression models has an important role for parameter estimation techniques. In this paper, we propose necessary and sufficient conditions for separation occurring in the logistic regression samples with two dimensional models and multiple dimensional models of independent variables. By using the technique of rotating Castesian coordinates of p dimensions, the characteristic of separation occurring in general cases is presented. Using these results, we propose algorithms to control the probability of separation appearance in generated samples for given sample sizes and multiple dimensional models of independent variables. The simulation studies show that the proposed algorithms can effectively generate the designed random samples with controlling the probability of separation appearance.</p>","PeriodicalId":46039,"journal":{"name":"Mathematical Methods of Statistics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140803675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing Monotonicity: An Approach Based on Transformed Order Statistics 评估单调性:基于变换阶次统计的方法
IF 0.5 Q4 Mathematics Pub Date : 2024-04-25 DOI: 10.3103/s1066530724700054
Aleksandr Chen, Nadezhda Gribkova, Ričardas Zitikis

Abstract

In a number of research areas, such as non-convex optimization and machine learning, determining and assessing regions of monotonicity of functions is pivotal. Numerically, it can be done using the proportion of positive (or negative) increments of transformed ordered inputs. When the number of inputs grows, the proportion tends to an index of increase (or decrease) of the underlying function. In this paper, we introduce a most general index of monotonicity and provide its interpretation in all practically relevant scenarios, including those that arise when the distribution of inputs has jumps and flat regions, and when the function is only piecewise differentiable. This enables us to assess monotonicity of very general functions under particularly mild conditions on the inputs.

摘要 在非凸优化和机器学习等多个研究领域,确定和评估函数的单调性区域至关重要。在数值上,可以利用变换有序输入的正(或负)增量比例来实现。当输入的数量增加时,该比例就会趋向于基础函数的增加(或减少)指数。在本文中,我们引入了一种最通用的单调性指数,并在所有与实际相关的情况下对其进行了解释,包括输入分布存在跳跃和平坦区域时,以及函数仅为片断微分时出现的情况。这使我们能够在输入条件特别温和的情况下评估非常一般的函数的单调性。
{"title":"Assessing Monotonicity: An Approach Based on Transformed Order Statistics","authors":"Aleksandr Chen, Nadezhda Gribkova, Ričardas Zitikis","doi":"10.3103/s1066530724700054","DOIUrl":"https://doi.org/10.3103/s1066530724700054","url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>In a number of research areas, such as non-convex optimization and machine learning, determining and assessing regions of monotonicity of functions is pivotal. Numerically, it can be done using the proportion of positive (or negative) increments of transformed ordered inputs. When the number of inputs grows, the proportion tends to an index of increase (or decrease) of the underlying function. In this paper, we introduce a most general index of monotonicity and provide its interpretation in all practically relevant scenarios, including those that arise when the distribution of inputs has jumps and flat regions, and when the function is only piecewise differentiable. This enables us to assess monotonicity of very general functions under particularly mild conditions on the inputs.</p>","PeriodicalId":46039,"journal":{"name":"Mathematical Methods of Statistics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140803672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Truncated Estimators for a Precision Matrix 精确矩阵的截断估计器
IF 0.5 Q4 Mathematics Pub Date : 2024-04-25 DOI: 10.3103/s1066530724700029
Anis M. Haddouche, Dominique Fourdrinier

Abstract

In this paper, we estimate the precision matrix ({Sigma}^{-1}) of a Gaussian multivariate linear regression model through its canonical form (({Z}^{T},{U}^{T})^{T}) where (Z) and (U) are respectively an (mtimes p) and an (ntimes p) matrices. This problem is addressed under the data-based loss function (textrm{tr} [({hat{Sigma}}^{-1}-{Sigma}^{-1})S]^{2}), where ({hat{Sigma}}^{-1}) estimates ({Sigma}^{-1}), for any ordering of (m,n) and (p), in a unified approach. We derive estimators which, besides the information contained in the sample covariance matrix (S={U}^{T}U), use the information contained in the sample mean (Z). We provide conditions for which these estimators improve over the usual estimators (a{S}^{+}) where (a) is a positive constant and ({S}^{+}) is the Moore-Penrose inverse of (S). Thanks to the role of (Z), such estimators are also improved by their truncated version.

Abstract 在本文中,我们通过高斯多元线性回归模型的规范形式来估计其精度矩阵({({Z}^{T},{U}^{T})^{T}),其中(Z)和(U)分别是一个(m/times p) 矩阵和一个(n/times p) 矩阵。在基于数据的损失函数(textrm{tr} [({hat{Sigma}}^{-1}-{Sigma}^{-1})S]^{2}) 下,这个问题得到了解决,其中({hat{Sigma}}^{-1})以统一的方法估计了({Sigma}^{-1}),对于(m,n)和(p)的任意排序。除了样本协方差矩阵(S={U}^{T}U)中包含的信息外,我们还得出了使用样本平均值(Z)中包含的信息的估计值。我们提供了这些估计值优于通常估计值的条件,其中 (a)是一个正常数,({S}^{+})是(S)的摩尔-彭罗斯倒数。由于(Z)的作用,这些估计值也会通过它们的截断版本得到改进。
{"title":"Truncated Estimators for a Precision Matrix","authors":"Anis M. Haddouche, Dominique Fourdrinier","doi":"10.3103/s1066530724700029","DOIUrl":"https://doi.org/10.3103/s1066530724700029","url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>In this paper, we estimate the precision matrix <span>({Sigma}^{-1})</span> of a Gaussian multivariate linear regression model through its canonical form <span>(({Z}^{T},{U}^{T})^{T})</span> where <span>(Z)</span> and <span>(U)</span> are respectively an <span>(mtimes p)</span> and an <span>(ntimes p)</span> matrices. This problem is addressed under the data-based loss function <span>(textrm{tr} [({hat{Sigma}}^{-1}-{Sigma}^{-1})S]^{2})</span>, where <span>({hat{Sigma}}^{-1})</span> estimates <span>({Sigma}^{-1})</span>, for any ordering of <span>(m,n)</span> and <span>(p)</span>, in a unified approach. We derive estimators which, besides the information contained in the sample covariance matrix <span>(S={U}^{T}U)</span>, use the information contained in the sample mean <span>(Z)</span>. We provide conditions for which these estimators improve over the usual estimators <span>(a{S}^{+})</span> where <span>(a)</span> is a positive constant and <span>({S}^{+})</span> is the Moore-Penrose inverse of <span>(S)</span>. Thanks to the role of <span>(Z)</span>, such estimators are also improved by their truncated version.</p>","PeriodicalId":46039,"journal":{"name":"Mathematical Methods of Statistics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140803686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Characterizing Existence and Location of the ML Estimate in the Conway–Maxwell–Poisson Model 确定康威-麦克斯韦-泊松模型中 ML 估计值的存在性和位置
IF 0.5 Q4 Mathematics Pub Date : 2024-04-25 DOI: 10.3103/s1066530724700042
Stefan Bedbur, Anton Imm, Udo Kamps

Abstract

As a flexible extension of the common Poisson model, the Conway–Maxwell–Poisson distribution allows for describing under- and overdispersion in count data via an additional parameter. Estimation methods for two Conway–Maxwell–Poisson parameters are then required to specify the model. In this work, two characterization results are provided related to maximum likelihood estimation of the Conway–Maxwell–Poisson parameters. The first states that maximum likelihood estimation fails if and only if the range of the observations is less than two. Assuming that the maximum likelihood estimate exists, the second result then comprises a simple necessary and sufficient condition for the maximum likelihood estimate to be a solution of the likelihood equation; otherwise it lies on the boundary of the parameter set. A simulation study is carried out to investigate the accuracy of the maximum likelihood estimate in dependence of the range of the underlying observations.

摘要 作为普通泊松模型的灵活扩展,康威-麦克斯韦-泊松分布允许通过一个附加参数来描述计数数据的欠分散和过分散。因此,需要两个 Conway-Maxwell 泊松参数的估计方法来指定模型。在这项工作中,提供了两个与康威-麦克斯韦-泊松参数最大似然估计有关的特征结果。第一个结果表明,当且仅当观测值的范围小于两个时,最大似然估计才会失败。假设存在最大似然估计,那么第二个结果就包含了一个简单的必要条件和充分条件,即最大似然估计是似然方程的一个解;否则,它就位于参数集的边界上。我们进行了一项模拟研究,以探讨最大似然估计的准确性与基本观测值范围的关系。
{"title":"Characterizing Existence and Location of the ML Estimate in the Conway–Maxwell–Poisson Model","authors":"Stefan Bedbur, Anton Imm, Udo Kamps","doi":"10.3103/s1066530724700042","DOIUrl":"https://doi.org/10.3103/s1066530724700042","url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>As a flexible extension of the common Poisson model, the Conway–Maxwell–Poisson distribution allows for describing under- and overdispersion in count data via an additional parameter. Estimation methods for two Conway–Maxwell–Poisson parameters are then required to specify the model. In this work, two characterization results are provided related to maximum likelihood estimation of the Conway–Maxwell–Poisson parameters. The first states that maximum likelihood estimation fails if and only if the range of the observations is less than two. Assuming that the maximum likelihood estimate exists, the second result then comprises a simple necessary and sufficient condition for the maximum likelihood estimate to be a solution of the likelihood equation; otherwise it lies on the boundary of the parameter set. A simulation study is carried out to investigate the accuracy of the maximum likelihood estimate in dependence of the range of the underlying observations.</p>","PeriodicalId":46039,"journal":{"name":"Mathematical Methods of Statistics","volume":null,"pages":null},"PeriodicalIF":0.5,"publicationDate":"2024-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140803669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Mathematical Methods of Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1