Pub Date : 2024-05-30DOI: 10.1007/s00362-024-01578-6
Xuehu Zhu, Rongzhu Zhao, Dan Zeng, Qian Zhao, Jun Zhang
This paper introduces a novel Dimension Reduction-based Adaptive-to-model Semi-supervised Classification method, specifically designed for scenarios where the number of unlabeled samples significantly exceeds that of labeled samples. Leveraging the strengths of sufficient dimension reduction and non-parametric interpolation, the method significantly amplifies the value derived from unlabeled samples, thus enhancing the precision of the classification model. An iterative version is also presented to extract further insights from the interpolated unlabeled samples. Theoretical analyses and numerical studies demonstrate substantial improvements in classifier accuracy, particularly in the context of model misspecified. The effectiveness of the proposed method in enhancing classification accuracy is further substantiated through two empirical analyses: credit card application evaluations and coronary heart disease diagnostic assessments.
{"title":"Dimension reduction-based adaptive-to-model semi-supervised classification","authors":"Xuehu Zhu, Rongzhu Zhao, Dan Zeng, Qian Zhao, Jun Zhang","doi":"10.1007/s00362-024-01578-6","DOIUrl":"https://doi.org/10.1007/s00362-024-01578-6","url":null,"abstract":"<p>This paper introduces a novel Dimension Reduction-based Adaptive-to-model Semi-supervised Classification method, specifically designed for scenarios where the number of unlabeled samples significantly exceeds that of labeled samples. Leveraging the strengths of sufficient dimension reduction and non-parametric interpolation, the method significantly amplifies the value derived from unlabeled samples, thus enhancing the precision of the classification model. An iterative version is also presented to extract further insights from the interpolated unlabeled samples. Theoretical analyses and numerical studies demonstrate substantial improvements in classifier accuracy, particularly in the context of model misspecified. The effectiveness of the proposed method in enhancing classification accuracy is further substantiated through two empirical analyses: credit card application evaluations and coronary heart disease diagnostic assessments.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"83 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141191719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-25DOI: 10.1007/s00362-024-01568-8
Morteza Mohammadi, Majid Hashempour
This paper introduces the concept of dynamic cumulative residual extropy inaccuracy (DCREI) by expanding on the existing dynamic cumulative residual extropy (DCRE) measure and proposes a weighted version of it. The paper then investigates a characterization problem for the proposed weighted dynamic extropy inaccuracy measure under the proportional hazard model and characterizes some well-known lifetime distributions using the weighted dynamic cumulative residual extropy inaccuracy (WDCREI) measure. Additionally, the study discusses the stochastic ordering of WDCREI and certain results based on it. Non-parametric estimations of the proposed measures based on kernel and empirical estimators are suggested. Results of a simulation study show that the kernel-based estimators perform better than the empirical-based estimator. Finally, applications of the proposed measures on model selection are provided.
{"title":"On weighted version of dynamic cumulative residual inaccuracy measure based on extropy","authors":"Morteza Mohammadi, Majid Hashempour","doi":"10.1007/s00362-024-01568-8","DOIUrl":"https://doi.org/10.1007/s00362-024-01568-8","url":null,"abstract":"<p>This paper introduces the concept of dynamic cumulative residual extropy inaccuracy (DCREI) by expanding on the existing dynamic cumulative residual extropy (DCRE) measure and proposes a weighted version of it. The paper then investigates a characterization problem for the proposed weighted dynamic extropy inaccuracy measure under the proportional hazard model and characterizes some well-known lifetime distributions using the weighted dynamic cumulative residual extropy inaccuracy (WDCREI) measure. Additionally, the study discusses the stochastic ordering of WDCREI and certain results based on it. Non-parametric estimations of the proposed measures based on kernel and empirical estimators are suggested. Results of a simulation study show that the kernel-based estimators perform better than the empirical-based estimator. Finally, applications of the proposed measures on model selection are provided.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"9 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141153701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-20DOI: 10.1007/s00362-024-01563-z
Guangyu Wu, Anders Lindquist
Probability density estimation is a core problem in statistics and data science. Moment methods are an important means of density estimation, but they are generally strongly dependent on the choice of feasible functions, which severely affects the performance. In this paper, we propose a non-classical parametrization for density estimation using sample moments, which does not require the choice of such functions. The parametrization is induced by the squared Hellinger distance, and the solution minimizing it, which is proved to exist and be unique subject to a simple prior that does not depend on data, and which can be obtained by convex optimization. Statistical properties of the density estimator, together with an asymptotic error upper bound, are proposed for the estimator by power moments. Simulation results validate the performance of the estimator by a comparison to several prevailing methods. The convergence rate of the proposed estimator is proved to be (m^{-1/2}) (m being the number of data samples), which is the optimal convergence rate for parametric estimators and exceeds that of the nonparametric estimators. To the best of our knowledge, the proposed estimator is the first one in the literature for which the power moments up to an arbitrary even order exactly match the sample moments, while the true density is not assumed to fall within specific function classes.
{"title":"A non-classical parameterization for density estimation using sample moments","authors":"Guangyu Wu, Anders Lindquist","doi":"10.1007/s00362-024-01563-z","DOIUrl":"https://doi.org/10.1007/s00362-024-01563-z","url":null,"abstract":"<p>Probability density estimation is a core problem in statistics and data science. Moment methods are an important means of density estimation, but they are generally strongly dependent on the choice of feasible functions, which severely affects the performance. In this paper, we propose a non-classical parametrization for density estimation using sample moments, which does not require the choice of such functions. The parametrization is induced by the squared Hellinger distance, and the solution minimizing it, which is proved to exist and be unique subject to a simple prior that does not depend on data, and which can be obtained by convex optimization. Statistical properties of the density estimator, together with an asymptotic error upper bound, are proposed for the estimator by power moments. Simulation results validate the performance of the estimator by a comparison to several prevailing methods. The convergence rate of the proposed estimator is proved to be <span>(m^{-1/2})</span> (<i>m</i> being the number of data samples), which is the optimal convergence rate for parametric estimators and exceeds that of the nonparametric estimators. To the best of our knowledge, the proposed estimator is the first one in the literature for which the power moments up to an arbitrary even order exactly match the sample moments, while the true density is not assumed to fall within specific function classes.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"57 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141153313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-20DOI: 10.1007/s00362-024-01564-y
Monika S. Dhull, Arun Kumar
In this article, we discuss some geometric infinitely divisible (gid) random variables using the Laplace exponents which are Bernstein functions and study their properties. The distributional properties and limiting behavior of the probability densities of these gid random variables at (0^{+}) are studied. The autoregressive (AR) models with gid marginals are introduced. Further, the first order AR process is generalized to kth order AR process. We also provide the parameter estimation method based on conditional least square and method of moments for the introduced AR(1) process. We also apply the introduced AR(1) model with geometric inverse Gaussian marginals on the household energy usage data which provide a good fit as compared to normal AR(1) data.
本文利用伯恩斯坦函数的拉普拉斯指数讨论了一些几何无限可分(gid)随机变量,并研究了它们的性质。研究了这些gid随机变量在(0^{+})处的概率密度的分布性质和极限行为。引入了具有 gid 边值的自回归(AR)模型。此外,还将一阶 AR 过程泛化为 kth 阶 AR 过程。我们还为引入的 AR(1) 过程提供了基于条件最小二乘法和矩法的参数估计方法。我们还将引入的具有几何反高斯边际的 AR(1) 模型应用于家庭能源使用数据,与普通 AR(1) 数据相比,该模型具有良好的拟合效果。
{"title":"Geometric infinitely divisible autoregressive models","authors":"Monika S. Dhull, Arun Kumar","doi":"10.1007/s00362-024-01564-y","DOIUrl":"https://doi.org/10.1007/s00362-024-01564-y","url":null,"abstract":"<p>In this article, we discuss some geometric infinitely divisible (gid) random variables using the Laplace exponents which are Bernstein functions and study their properties. The distributional properties and limiting behavior of the probability densities of these gid random variables at <span>(0^{+})</span> are studied. The autoregressive (AR) models with gid marginals are introduced. Further, the first order AR process is generalized to <i>k</i>th order AR process. We also provide the parameter estimation method based on conditional least square and method of moments for the introduced AR(1) process. We also apply the introduced AR(1) model with geometric inverse Gaussian marginals on the household energy usage data which provide a good fit as compared to normal AR(1) data.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"5 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141153300","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-10DOI: 10.1007/s00362-024-01561-1
Ping Sun, Ze-Chun Hu, Wei Sun
Let X be a random variable with finite second moment. We investigate the inequality: (P{|X-textrm{E}[X]|le sqrt{textrm{Var}(X)}}ge P{|Z|le 1}), where Z is a standard normal random variable. We prove that this inequality holds for many familiar infinitely divisible continuous distributions including the Laplace, Gumbel, Logistic, Pareto, infinitely divisible Weibull, Log-normal, Student’s t and Inverse Gaussian distributions. Numerical results are given to show that the inequality with continuity correction also holds for some infinitely divisible discrete distributions.
假设 X 是一个具有有限第二矩的随机变量。我们研究不等式(P{|X-textrm{E}[X]|le sqrt{textrm{Var}(X)}}ge P{|Z|le 1}),其中 Z 是标准正态随机变量。我们证明了这个不等式对许多熟悉的无限可分连续分布都成立,包括拉普拉斯分布、甘贝尔分布、对数分布、帕累托分布、无限可分韦布尔分布、对数正态分布、Student's t 分布和反高斯分布。给出的数值结果表明,带连续性修正的不等式也适用于某些无限可分离散分布。
{"title":"Variation comparison between infinitely divisible distributions and the normal distribution","authors":"Ping Sun, Ze-Chun Hu, Wei Sun","doi":"10.1007/s00362-024-01561-1","DOIUrl":"https://doi.org/10.1007/s00362-024-01561-1","url":null,"abstract":"<p>Let <i>X</i> be a random variable with finite second moment. We investigate the inequality: <span>(P{|X-textrm{E}[X]|le sqrt{textrm{Var}(X)}}ge P{|Z|le 1})</span>, where <i>Z</i> is a standard normal random variable. We prove that this inequality holds for many familiar infinitely divisible continuous distributions including the Laplace, Gumbel, Logistic, Pareto, infinitely divisible Weibull, Log-normal, Student’s <i>t</i> and Inverse Gaussian distributions. Numerical results are given to show that the inequality with continuity correction also holds for some infinitely divisible discrete distributions.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"3 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140925479","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-07DOI: 10.1007/s00362-024-01558-w
Tanmay Sahoo, Nil Kamal Hazra, Narayanaswamy Balakrishnan
Sequential order statistics (SOS) are useful tools for modeling the lifetimes of systems wherein the failure of a component has a significant impact on the lifetimes of the remaining surviving components. The SOS model is a general model that contains most of the existing models for ordered random variables. In this paper, we consider the SOS model with non-identical components and then discuss various univariate and multivariate stochastic comparison results in both one-and two-sample scenarios.
序列有序统计(SOS)是建立系统寿命模型的有用工具,其中一个组件的失效会对其余存活组件的寿命产生重大影响。SOS 模型是一个通用模型,包含了大多数现有的有序随机变量模型。在本文中,我们考虑了具有非相同组件的 SOS 模型,然后讨论了单样本和双样本情况下的各种单变量和多变量随机比较结果。
{"title":"Multivariate stochastic comparisons of sequential order statistics with non-identical components","authors":"Tanmay Sahoo, Nil Kamal Hazra, Narayanaswamy Balakrishnan","doi":"10.1007/s00362-024-01558-w","DOIUrl":"https://doi.org/10.1007/s00362-024-01558-w","url":null,"abstract":"<p>Sequential order statistics (SOS) are useful tools for modeling the lifetimes of systems wherein the failure of a component has a significant impact on the lifetimes of the remaining surviving components. The SOS model is a general model that contains most of the existing models for ordered random variables. In this paper, we consider the SOS model with non-identical components and then discuss various univariate and multivariate stochastic comparison results in both one-and two-sample scenarios.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"128 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140883338","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-06DOI: 10.1007/s00362-024-01559-9
Martial Longla
We characterize absolutely continuous symmetric copulas with square integrable densities in this paper. This characterization is used to create new copula families, that are perturbations of the independence copula. The full study of mixing properties of Markov chains generated by these copula families is conducted. An extension that includes the Farlie–Gumbel–Morgenstern family of copulas is proposed. We propose some examples of copulas that generate non-mixing Markov chains, but whose convex combinations generate (psi )-mixing Markov chains. Some general results on (psi )-mixing are given. The Spearman’s correlation (rho _S) and Kendall’s (tau ) are provided for the created copula families. Some general remarks are provided for (rho _S) and (tau ). A central limit theorem is provided for parameter estimators in one example. A simulation study is conducted to support derived asymptotic distributions for some examples.
{"title":"New copula families and mixing properties","authors":"Martial Longla","doi":"10.1007/s00362-024-01559-9","DOIUrl":"https://doi.org/10.1007/s00362-024-01559-9","url":null,"abstract":"<p>We characterize absolutely continuous symmetric copulas with square integrable densities in this paper. This characterization is used to create new copula families, that are perturbations of the independence copula. The full study of mixing properties of Markov chains generated by these copula families is conducted. An extension that includes the Farlie–Gumbel–Morgenstern family of copulas is proposed. We propose some examples of copulas that generate non-mixing Markov chains, but whose convex combinations generate <span>(psi )</span>-mixing Markov chains. Some general results on <span>(psi )</span>-mixing are given. The Spearman’s correlation <span>(rho _S)</span> and Kendall’s <span>(tau )</span> are provided for the created copula families. Some general remarks are provided for <span>(rho _S)</span> and <span>(tau )</span>. A central limit theorem is provided for parameter estimators in one example. A simulation study is conducted to support derived asymptotic distributions for some examples.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"32 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140883228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-06DOI: 10.1007/s00362-023-01507-z
Jianbo Xu
There is often a presence of random change points (RCPs) with varying timing of hazard rate change among patients in survival analysis within oncology trials. This is in contrast to fixed change points in piecewise constant hazard models, where the timing of hazard rate change remains the same for all subjects. However, currently there is a lack of appropriate statistical methods to effectively tackle this particular issue. This article presents novel statistical methods that aim to characterize these complex survival models. These methods allow for the estimation of important features such as the probability of an event occurring and being censored, and the expected number of events within the clinical trial, prior to any specific time, and within specific time intervals. They also derive expected survival time and parametric expected survival and hazard functions for subjects with any finite number of RCPs. Simulation studies validate these methods and demonstrate their reliability and effectiveness. Real clinical data from an oncology trial is also used to apply these methods. The applications of these methods in oncology trials are extensive, including estimating hazard rates and rate parameters of RCPs, assessing treatment switching, delayed onset of immunotherapy, and subsequent anticancer therapies. They also have value in clinical trial planning, monitoring, and sample size adjustment. The expected parametric survival and hazard functions provide a thorough understanding of the behaviors and effects of RCPs in complex survival models.
{"title":"Multiple random change points in survival analysis with applications to clinical trials","authors":"Jianbo Xu","doi":"10.1007/s00362-023-01507-z","DOIUrl":"https://doi.org/10.1007/s00362-023-01507-z","url":null,"abstract":"<p>There is often a presence of random change points (RCPs) with varying timing of hazard rate change among patients in survival analysis within oncology trials. This is in contrast to fixed change points in piecewise constant hazard models, where the timing of hazard rate change remains the same for all subjects. However, currently there is a lack of appropriate statistical methods to effectively tackle this particular issue. This article presents novel statistical methods that aim to characterize these complex survival models. These methods allow for the estimation of important features such as the probability of an event occurring and being censored, and the expected number of events within the clinical trial, prior to any specific time, and within specific time intervals. They also derive expected survival time and parametric expected survival and hazard functions for subjects with any finite number of RCPs. Simulation studies validate these methods and demonstrate their reliability and effectiveness. Real clinical data from an oncology trial is also used to apply these methods. The applications of these methods in oncology trials are extensive, including estimating hazard rates and rate parameters of RCPs, assessing treatment switching, delayed onset of immunotherapy, and subsequent anticancer therapies. They also have value in clinical trial planning, monitoring, and sample size adjustment. The expected parametric survival and hazard functions provide a thorough understanding of the behaviors and effects of RCPs in complex survival models.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"26 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140883339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-06DOI: 10.1007/s00362-024-01556-y
Xiaodi Wang, Hengzhen Huang
Symmetrical global sensitivity analysis (SGSA) can aid practitioners in reducing the model complexity by identifying symmetries within the model. In this paper, we propose a nested symmetrical Latin hypercube design (NSLHD) for implementing SGSA in a sequential manner. By combining the strengths of the nested Latin hypercube design and symmetrical design, the proposed design allows for the implementation of SGSA without the need to pre-determine the sample size of the experiment. We develop a random sampling procedure and an efficient sequential optimization algorithm to construct flexible NSLHDs in terms of runs and factors. Sampling properties of the constructed designs are studied. Numerical examples are given to demonstrate the effectiveness of the NSLHD for designing sequential sensitivity analysis.
{"title":"Nested symmetrical Latin hypercube designs","authors":"Xiaodi Wang, Hengzhen Huang","doi":"10.1007/s00362-024-01556-y","DOIUrl":"https://doi.org/10.1007/s00362-024-01556-y","url":null,"abstract":"<p>Symmetrical global sensitivity analysis (SGSA) can aid practitioners in reducing the model complexity by identifying symmetries within the model. In this paper, we propose a nested symmetrical Latin hypercube design (NSLHD) for implementing SGSA in a sequential manner. By combining the strengths of the nested Latin hypercube design and symmetrical design, the proposed design allows for the implementation of SGSA without the need to pre-determine the sample size of the experiment. We develop a random sampling procedure and an efficient sequential optimization algorithm to construct flexible NSLHDs in terms of runs and factors. Sampling properties of the constructed designs are studied. Numerical examples are given to demonstrate the effectiveness of the NSLHD for designing sequential sensitivity analysis.</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"112 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140883227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-02DOI: 10.1007/s00362-024-01555-z
Lisa Parveen, Ruhul Ali Khan, Murari Mitra
Comparison of variability or dispersion of two distributions is the major focus of this work. To this end, we consider a two sample testing problem for detecting dominance in dispersive order and develop a test based on U-statistic approach. We also explore a link between the two measures of variability, viz. dispersive order and Gini’s mean difference (GMD). We exploit methodologies based on jackknife empirical likelihood (JEL) and adjusted JEL in order to overcome certain practical difficulties. The performance of the proposed test is assessed by means of a simulation study. Finally, we apply our test in the context of several real life situations including medical studies and insurance data.
比较两个分布的变异性或分散性是这项工作的重点。为此,我们考虑了一个检测分散秩支配性的双样本检验问题,并开发了一种基于 U 统计的检验方法。我们还探讨了两种可变性测量之间的联系,即分散秩序和基尼均值差(GMD)。为了克服某些实际困难,我们利用了基于杰克刀经验似然法(JEL)和调整 JEL 的方法。我们通过模拟研究来评估所提出的检验方法的性能。最后,我们将我们的检验方法应用于包括医学研究和保险数据在内的几种实际情况中。
{"title":"A two sample nonparametric test for variability via empirical likelihood methods","authors":"Lisa Parveen, Ruhul Ali Khan, Murari Mitra","doi":"10.1007/s00362-024-01555-z","DOIUrl":"https://doi.org/10.1007/s00362-024-01555-z","url":null,"abstract":"<p>Comparison of variability or dispersion of two distributions is the major focus of this work. To this end, we consider a two sample testing problem for detecting dominance in dispersive order and develop a test based on <i>U</i>-statistic approach. We also explore a link between the two measures of variability, viz. dispersive order and Gini’s mean difference (GMD). We exploit methodologies based on jackknife empirical likelihood (JEL) and adjusted JEL in order to overcome certain practical difficulties. The performance of the proposed test is assessed by means of a simulation study. Finally, we apply our test in the context of several real life situations including medical studies and insurance data.\u0000</p>","PeriodicalId":51166,"journal":{"name":"Statistical Papers","volume":"194 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140883340","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}