首页 > 最新文献

Journal of Multivariate Analysis最新文献

英文 中文
A general framework to extend sufficient dimension reductions to the cases of the mixture multivariate elliptical distributions 给出了将足够降维扩展到混合多元椭圆分布的一般框架
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-11-28 DOI: 10.1016/j.jmva.2025.105551
Wenjuan Li , Hongming Pei , Ali Jiang , Fei Chen
In the sufficient dimension reduction (SDR), many methods depend on some assumptions on the distribution of predictor vector, such as the linear design condition (L.D.C.), the assumption of constant conditional variance, and so on. The mixture distributions emerge frequently in practice, but they may not satisfy the above assumptions. In this article, a general framework is proposed to extend various SDR methods to the cases where the predictor vector follows the mixture elliptical distributions, together with the asymptotic property for the consistency of the kernel matrix estimators. For illustration, the extensions of several classical SDR approaches under the proposed framework are detailed. Moreover, a method to estimate the structural dimension is given, together with a procedure to check an assumption called homogeneity. The proposed methodology is illustrated by simulated and real examples.
在充分降维(SDR)中,许多方法依赖于对预测向量分布的一些假设,如线性设计条件(L.D.C.)、条件方差恒定假设等。混合分布在实践中经常出现,但它们可能不满足上述假设。本文提出了一个一般框架,将各种SDR方法扩展到预测向量服从混合椭圆分布的情况,并给出了核矩阵估计量相合性的渐近性质。为了说明这一点,详细介绍了几种经典SDR方法在该框架下的扩展。此外,还给出了一种估计结构尺寸的方法,以及一种检验均匀性假设的方法。通过仿真和实际算例说明了所提出的方法。
{"title":"A general framework to extend sufficient dimension reductions to the cases of the mixture multivariate elliptical distributions","authors":"Wenjuan Li ,&nbsp;Hongming Pei ,&nbsp;Ali Jiang ,&nbsp;Fei Chen","doi":"10.1016/j.jmva.2025.105551","DOIUrl":"10.1016/j.jmva.2025.105551","url":null,"abstract":"<div><div>In the sufficient dimension reduction (SDR), many methods depend on some assumptions on the distribution of predictor vector, such as the linear design condition (L.D.C.), the assumption of constant conditional variance, and so on. The mixture distributions emerge frequently in practice, but they may not satisfy the above assumptions. In this article, a general framework is proposed to extend various SDR methods to the cases where the predictor vector follows the mixture elliptical distributions, together with the asymptotic property for the consistency of the kernel matrix estimators. For illustration, the extensions of several classical SDR approaches under the proposed framework are detailed. Moreover, a method to estimate the structural dimension is given, together with a procedure to check an assumption called homogeneity. The proposed methodology is illustrated by simulated and real examples.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"213 ","pages":"Article 105551"},"PeriodicalIF":1.4,"publicationDate":"2025-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145683764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Varying-coefficient quantile regression with effect under panel data and missing observation 在面板数据和缺失观测下的变系数分位数回归
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-11-28 DOI: 10.1016/j.jmva.2025.105561
Shu-Yu Li , Han-Ying Liang , Bao-Hua Wang
We, in this paper, focus on partial linear varying-coefficient quantile regression with fixed effects under panel data and missing observations, where the missing observations include either responses or covariates are missing at random. Under independent setting, we define estimators of the unknown parameter vector, varying-coefficient function and effect in the model, and discuss their large number properties. To use the information from within-subject correlations, we propose weighted estimators for the unknown amounts, where the weights are chosen based on mean and quantile regressions, respectively, by quadratic inference technique and empirical likelihood method. Under dependent assumption, we establish asymptotic normality of the weighted estimators. Meanwhile, we study hypothesis tests of the parameter, varying-coefficient function and effect, and prove asymptotic distributions of restricted estimators and test statistics of the parameter under null hypothesis and local alternative hypothesis, respectively. Also, oracle property of the parameter is considered. Simulation study and real data analysis are conducted to evaluate the performance of the proposed methods.
本文主要研究面板数据和缺失观测值下固定效应的偏线性变系数分位数回归,其中缺失观测值包括随机缺失的响应或随机缺失的协变量。在独立设定下,定义了模型中未知参数向量、变系数函数和效应的估计量,并讨论了它们的大数性质。为了利用主体内相关性的信息,我们提出了未知数量的加权估计,其中权重分别基于均值和分位数回归,通过二次推理技术和经验似然方法选择。在相关假设下,我们建立了加权估计量的渐近正态性。同时,研究了参数、变系数函数和效应的假设检验,分别证明了零假设和局部备择假设下的约束估计量的渐近分布和参数的统计量检验。同时,还考虑了参数的oracle属性。通过仿真研究和实际数据分析,对所提方法的性能进行了评价。
{"title":"Varying-coefficient quantile regression with effect under panel data and missing observation","authors":"Shu-Yu Li ,&nbsp;Han-Ying Liang ,&nbsp;Bao-Hua Wang","doi":"10.1016/j.jmva.2025.105561","DOIUrl":"10.1016/j.jmva.2025.105561","url":null,"abstract":"<div><div>We, in this paper, focus on partial linear varying-coefficient quantile regression with fixed effects under panel data and missing observations, where the missing observations include either responses or covariates are missing at random. Under independent setting, we define estimators of the unknown parameter vector, varying-coefficient function and effect in the model, and discuss their large number properties. To use the information from within-subject correlations, we propose weighted estimators for the unknown amounts, where the weights are chosen based on mean and quantile regressions, respectively, by quadratic inference technique and empirical likelihood method. Under dependent assumption, we establish asymptotic normality of the weighted estimators. Meanwhile, we study hypothesis tests of the parameter, varying-coefficient function and effect, and prove asymptotic distributions of restricted estimators and test statistics of the parameter under null hypothesis and local alternative hypothesis, respectively. Also, oracle property of the parameter is considered. Simulation study and real data analysis are conducted to evaluate the performance of the proposed methods.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"212 ","pages":"Article 105561"},"PeriodicalIF":1.4,"publicationDate":"2025-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145681897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Jump detection in single-index models with measurement error 具有测量误差的单指标模型的跳跃检测
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-11-28 DOI: 10.1016/j.jmva.2025.105574
Yuan Liu , Yan-Yong Zhao , Noriszura Ismail , Razik Ridzuan Mohd Tajuddin , Yuchun Zhang
Measurement error regression is widely used in statistical modeling. When the regression function is discontinuous, the estimation and inference have become challenging. In this paper, we develop a jump detection framework for a single index model with measurement error. First, for the single index model with measurement error, the consistent estimator of the index coefficient is obtained by using both the SIMEX (simulation extrapolation) and estimation equation methods. Then, the one-sided kernel local linear method is used to construct the estimator of the nonparametric function and the estimator of the jump point. Under some regularity assumptions, the asymptotic properties of the resultant estimators are established. The finite sample performance of our methodologies is evaluated by numerical simulation, and finally they are used to analyze the effect of serum cholesterol level and age on male blood.
测量误差回归在统计建模中得到了广泛的应用。当回归函数不连续时,估计和推理就变得很有挑战性。在本文中,我们开发了一个具有测量误差的单指标模型的跳跃检测框架。首先,针对具有测量误差的单指标模型,采用模拟外推法和估计方程法得到指标系数的一致估计量;然后,利用单侧核局部线性方法构造了非参数函数的估计量和跳点的估计量。在一些正则性假设下,给出了合成估计量的渐近性质。通过数值模拟对方法的有限样本性能进行了评价,最后应用该方法分析了血清胆固醇水平和年龄对男性血液的影响。
{"title":"Jump detection in single-index models with measurement error","authors":"Yuan Liu ,&nbsp;Yan-Yong Zhao ,&nbsp;Noriszura Ismail ,&nbsp;Razik Ridzuan Mohd Tajuddin ,&nbsp;Yuchun Zhang","doi":"10.1016/j.jmva.2025.105574","DOIUrl":"10.1016/j.jmva.2025.105574","url":null,"abstract":"<div><div>Measurement error regression is widely used in statistical modeling. When the regression function is discontinuous, the estimation and inference have become challenging. In this paper, we develop a jump detection framework for a single index model with measurement error. First, for the single index model with measurement error, the consistent estimator of the index coefficient is obtained by using both the SIMEX (simulation extrapolation) and estimation equation methods. Then, the one-sided kernel local linear method is used to construct the estimator of the nonparametric function and the estimator of the jump point. Under some regularity assumptions, the asymptotic properties of the resultant estimators are established. The finite sample performance of our methodologies is evaluated by numerical simulation, and finally they are used to analyze the effect of serum cholesterol level and age on male blood.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"212 ","pages":"Article 105574"},"PeriodicalIF":1.4,"publicationDate":"2025-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145616473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Testing multivariate normality for two-level structural equation models 两级结构方程模型的多元正态性检验
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-11-28 DOI: 10.1016/j.jmva.2025.105562
Jiajuan Liang , Peter M. Bentler , Yiwen Cao
Multivariate normality is a common assumption in the maximum likelihood analysis of two-level structural equation models. Under the normal assumption, the independence condition on level-1 observations is no longer satisfied. As a result, existing statistics for testing multivariate normality based independent observations cannot be directly used for the same purpose in two-level structural equation models. In this paper we tackle this problem by employing the theory of spherical matrix distributions and some properties of invariant statistics. A series of necessary tests are constructed from some existing invariant statistics with a balanced level-1 sample design. These necessary tests are applicable without requiring a large level-1 or level-2 sample size. A Monte Carlo study is carried out to demonstrate the performance of the proposed tests in the aspects of controlling type I error rates, the power against a departure from multivariate normality for level-1 variables, and the power against a departure from multivariate normality for level-2 variables. An application of the necessary tests to a practical data set is illustrated.
多元正态性是两级结构方程模型最大似然分析中常见的假设。在正态假设下,一级观测值的独立性不再满足。因此,现有的用于检验基于独立观测的多元正态性的统计量不能直接用于两层结构方程模型的相同目的。本文利用球矩阵分布理论和不变统计的一些性质来解决这一问题。在平衡的一级样本设计下,从现有的一些不变统计量构造了一系列必要的检验。这些必要的测试适用于不需要大的一级或二级样本量。通过蒙特卡罗研究证明了所提出的测试在控制I型错误率、1级变量的抗偏离多元正态性的能力以及2级变量的抗偏离多元正态性的能力方面的性能。并举例说明了必要的测试在实际数据集上的应用。
{"title":"Testing multivariate normality for two-level structural equation models","authors":"Jiajuan Liang ,&nbsp;Peter M. Bentler ,&nbsp;Yiwen Cao","doi":"10.1016/j.jmva.2025.105562","DOIUrl":"10.1016/j.jmva.2025.105562","url":null,"abstract":"<div><div>Multivariate normality is a common assumption in the maximum likelihood analysis of two-level structural equation models. Under the normal assumption, the independence condition on level-1 observations is no longer satisfied. As a result, existing statistics for testing multivariate normality based independent observations cannot be directly used for the same purpose in two-level structural equation models. In this paper we tackle this problem by employing the theory of spherical matrix distributions and some properties of invariant statistics. A series of necessary tests are constructed from some existing invariant statistics with a balanced level-1 sample design. These necessary tests are applicable without requiring a large level-1 or level-2 sample size. A Monte Carlo study is carried out to demonstrate the performance of the proposed tests in the aspects of controlling type I error rates, the power against a departure from multivariate normality for level-1 variables, and the power against a departure from multivariate normality for level-2 variables. An application of the necessary tests to a practical data set is illustrated.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"212 ","pages":"Article 105562"},"PeriodicalIF":1.4,"publicationDate":"2025-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145616475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fréchet kNN-based sufficient dimension reduction 基于knn的fresamet充分降维
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-11-28 DOI: 10.1016/j.jmva.2025.105566
Xueyan Huang, Rui Qiu, Zhou Yu
In this paper, we introduce two Fréchet inverse regression methods with kernel bandwidths determined by k nearest neighbors, designed to achieve sufficient dimension reduction for a metric space-valued response and Euclidean predictors. A key advantage of the proposals lies in their ability to effectively preserve the intrinsic information of the metric space-valued response. We establish the asymptotic normality of these methods through rigorous theoretical proofs. Additionally, simulations and a real data example are provided to validate the performance and practical applicability of the proposed methods.
在本文中,我们介绍了两种核带宽由k近邻决定的fracimet逆回归方法,旨在为度量空间值响应和欧几里得预测实现足够的降维。这些建议的一个关键优势在于它们能够有效地保留度量空间值响应的内在信息。通过严格的理论证明,建立了这些方法的渐近正态性。最后,通过仿真和一个实际数据算例验证了所提方法的性能和实用性。
{"title":"Fréchet kNN-based sufficient dimension reduction","authors":"Xueyan Huang,&nbsp;Rui Qiu,&nbsp;Zhou Yu","doi":"10.1016/j.jmva.2025.105566","DOIUrl":"10.1016/j.jmva.2025.105566","url":null,"abstract":"<div><div>In this paper, we introduce two Fréchet inverse regression methods with kernel bandwidths determined by <span><math><mi>k</mi></math></span> nearest neighbors, designed to achieve sufficient dimension reduction for a metric space-valued response and Euclidean predictors. A key advantage of the proposals lies in their ability to effectively preserve the intrinsic information of the metric space-valued response. We establish the asymptotic normality of these methods through rigorous theoretical proofs. Additionally, simulations and a real data example are provided to validate the performance and practical applicability of the proposed methods.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"212 ","pages":"Article 105566"},"PeriodicalIF":1.4,"publicationDate":"2025-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145681887","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Type I multivariate Pólya-Aeppli distributions with applications 带应用程序的I型多变量Pólya-Aeppli发行版
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-11-28 DOI: 10.1016/j.jmva.2025.105556
Claire Geldenhuys, Rene Ehlers, Andriette Bekker
An extensive body of literature exists that specifically addresses the univariate case of zero-inflated count models. In contrast, research pertaining to multivariate models is notably less developed. We propose two new parsimonious multivariate models that can be used to model correlated multivariate overdispersed count data. Furthermore, for different parameter settings and sample sizes, various simulations are performed. In conclusion, we demonstrate the performance of the newly proposed multivariate candidates on two benchmark datasets, which surpasses that of several alternative approaches.
有大量的文献专门论述了零膨胀计数模型的单变量情况。相比之下,与多变量模型有关的研究明显欠发达。我们提出了两个新的简化的多元模型,可以用来模拟相关的多元过分散计数数据。此外,对于不同的参数设置和样本量,进行了各种模拟。总之,我们证明了新提出的多变量候选算法在两个基准数据集上的性能优于几种替代方法。
{"title":"Type I multivariate Pólya-Aeppli distributions with applications","authors":"Claire Geldenhuys,&nbsp;Rene Ehlers,&nbsp;Andriette Bekker","doi":"10.1016/j.jmva.2025.105556","DOIUrl":"10.1016/j.jmva.2025.105556","url":null,"abstract":"<div><div>An extensive body of literature exists that specifically addresses the univariate case of zero-inflated count models. In contrast, research pertaining to multivariate models is notably less developed. We propose two new parsimonious multivariate models that can be used to model correlated multivariate overdispersed count data. Furthermore, for different parameter settings and sample sizes, various simulations are performed. In conclusion, we demonstrate the performance of the newly proposed multivariate candidates on two benchmark datasets, which surpasses that of several alternative approaches.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"212 ","pages":"Article 105556"},"PeriodicalIF":1.4,"publicationDate":"2025-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145681888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the two-sample Behrens–Fisher problem for high-dimensional data 高维数据的双样本Behrens-Fisher问题
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-11-28 DOI: 10.1016/j.jmva.2025.105572
Yongshuai Chen , Gongming Shi , Xiaomeng Yan, Baoxue Zhang
In this paper, we study the limiting distribution of Chen-Qin’s test statistic and propose a novel weighted bootstrap test procedure for the high-dimensional two-sample Behrens–Fisher problem. We first show that the test statistic has an asymptotic null that is a mixture of a chi-square-type mixture distribution and a normal distribution, without imposing either the normal assumption or a factor-like model assumption on the underlying distributions. To gain insight into the asymptotic null distribution of the test statistic, we show that under stronger restrictions on the covariance matrices and the null hypothesis, the test statistic is either asymptotically normal or a chi-square-type mixture distribution. The power properties of the test are evaluated asymptotically under the high-dimensional local and fixed alternative hypothesis. We also derive that the proposed weighted bootstrap test procedure has correct test level asymptotically. Two simulation studies and a real data example show that the new weighted bootstrap procedure significantly outperforms other benchmarks in terms of size control and is comparable in terms of power.
本文研究了Chen-Qin检验统计量的极限分布,提出了一种新的高维双样本Behrens-Fisher问题的加权自举检验方法。我们首先证明检验统计量有一个渐近零值,它是卡方型混合分布和正态分布的混合物,而没有对底层分布施加正态假设或类因子模型假设。为了深入了解检验统计量的渐近零分布,我们表明,在对协方差矩阵和零假设的更强限制下,检验统计量要么是渐近正态分布,要么是卡方型混合分布。在高维局部定备假设下,渐近地评价了检验的幂性质。并渐近地证明了所提出的加权自举检验方法具有正确的检验水平。两个仿真研究和一个实际数据示例表明,新的加权自举过程在大小控制方面明显优于其他基准程序,并且在功率方面具有可比性。
{"title":"On the two-sample Behrens–Fisher problem for high-dimensional data","authors":"Yongshuai Chen ,&nbsp;Gongming Shi ,&nbsp;Xiaomeng Yan,&nbsp;Baoxue Zhang","doi":"10.1016/j.jmva.2025.105572","DOIUrl":"10.1016/j.jmva.2025.105572","url":null,"abstract":"<div><div>In this paper, we study the limiting distribution of Chen-Qin’s test statistic and propose a novel weighted bootstrap test procedure for the high-dimensional two-sample Behrens–Fisher problem. We first show that the test statistic has an asymptotic null that is a mixture of a chi-square-type mixture distribution and a normal distribution, without imposing either the normal assumption or a factor-like model assumption on the underlying distributions. To gain insight into the asymptotic null distribution of the test statistic, we show that under stronger restrictions on the covariance matrices and the null hypothesis, the test statistic is either asymptotically normal or a chi-square-type mixture distribution. The power properties of the test are evaluated asymptotically under the high-dimensional local and fixed alternative hypothesis. We also derive that the proposed weighted bootstrap test procedure has correct test level asymptotically. Two simulation studies and a real data example show that the new weighted bootstrap procedure significantly outperforms other benchmarks in terms of size control and is comparable in terms of power.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"212 ","pages":"Article 105572"},"PeriodicalIF":1.4,"publicationDate":"2025-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145681890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Variable selection in mixture regression for longitudinal data based on joint mean–covariance model 基于联合均值-协方差模型的纵向数据混合回归变量选择
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-11-19 DOI: 10.1016/j.jmva.2025.105548
Jing Yu , Jianxin Pan
A large number of explanatory variables may be measured with the collection of longitudinal data, of which some may not be influential for modeling of heterogeneous longitudinal data. For such complex data, not only their mean but also covariances may be affected by various explanatory variables. A data-driven approach is proposed to model the mean and covariance structures, simultaneously, together with selecting influential explanatory variables. A penalized maximum likelihood method for the joint mean and covariance model is developed within the framework of finite Gaussian mixture regression. EM algorithm is employed for the numerical calculation. The parameter estimators obtained are shown to be consistent and asymptotically normally distributed, and have oracle properties with proper choices of penalty function and tuning parameter. Simulation studies show that the proposed method works very well and provides accurate and effective parameter estimators by conducting variable selection. For illustration, real data analysis on clustering COVID-19 infected cases for European countries in terms of governmental policy effects is made to demonstrate the usefulness of the proposed method.
通过收集纵向数据可以测量大量的解释变量,其中一些变量可能对异构纵向数据的建模没有影响。对于这种复杂的数据,不仅其均值,而且协方差都可能受到各种解释变量的影响。提出了一种数据驱动的方法,同时对均值和协方差结构进行建模,并选择有影响的解释变量。在有限高斯混合回归的框架内,对联合均值和协方差模型提出了一种惩罚极大似然法。采用EM算法进行数值计算。得到的参数估计量是一致的、渐近正态分布的,并且在适当选择惩罚函数和调优参数的情况下具有oracle性质。仿真研究表明,该方法通过变量选择提供了准确有效的参数估计。为了说明,从政府政策效应的角度对欧洲国家的聚集性COVID-19感染病例进行了实际数据分析,以证明所提出方法的有效性。
{"title":"Variable selection in mixture regression for longitudinal data based on joint mean–covariance model","authors":"Jing Yu ,&nbsp;Jianxin Pan","doi":"10.1016/j.jmva.2025.105548","DOIUrl":"10.1016/j.jmva.2025.105548","url":null,"abstract":"<div><div>A large number of explanatory variables may be measured with the collection of longitudinal data, of which some may not be influential for modeling of heterogeneous longitudinal data. For such complex data, not only their mean but also covariances may be affected by various explanatory variables. A data-driven approach is proposed to model the mean and covariance structures, simultaneously, together with selecting influential explanatory variables. A penalized maximum likelihood method for the joint mean and covariance model is developed within the framework of finite Gaussian mixture regression. EM algorithm is employed for the numerical calculation. The parameter estimators obtained are shown to be consistent and asymptotically normally distributed, and have oracle properties with proper choices of penalty function and tuning parameter. Simulation studies show that the proposed method works very well and provides accurate and effective parameter estimators by conducting variable selection. For illustration, real data analysis on clustering COVID-19 infected cases for European countries in terms of governmental policy effects is made to demonstrate the usefulness of the proposed method.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"212 ","pages":"Article 105548"},"PeriodicalIF":1.4,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145571180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parametric convergence rate of a non-parametric estimator in multivariate mixtures of power series distributions under conditional independence 条件无关下幂级数分布多元混合中非参数估计量的参数收敛速率
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-11-19 DOI: 10.1016/j.jmva.2025.105542
Fadoua Balabdaoui , Harald Besdziek , Yong Wang
The conditional independence assumption has recently appeared in a growing body of literature on the estimation of multivariate mixtures. We consider here conditionally independent multivariate mixtures of power series distributions with infinite support, to which belong Poisson, Geometric or Negative Binomial mixtures. We show that for all these mixtures, the non-parametric maximum likelihood estimator converges to the truth at the rate (ln(nd))1+d/2n1/2 in the Hellinger distance, where n denotes the size of the observed sample and d represents the dimension of the mixture. Using this result, we then construct a new non-parametric estimator based on the maximum likelihood estimator that converges with the parametric rate n1/2 in all p-distances, for p1. These convergences rates are supported by simulations and the theory is illustrated using the famous Vélib dataset of the bike sharing system of Paris. We also introduce a testing procedure for whether the conditional independence assumption is satisfied for a given sample. This testing procedure is applied for several multivariate mixtures, with varying levels of dependence, and is thereby shown to distinguish well between conditionally independent and dependent mixtures. Finally, we use this testing procedure to investigate whether conditional independence holds for Vélib dataset.
条件独立假设最近出现在越来越多关于多元混合估计的文献中。本文考虑具有无限支持的幂级数分布的条件独立多元混合,它们属于泊松混合、几何混合或负二项式混合。我们证明,对于所有这些混合物,非参数极大似然估计在海灵格距离中以(ln(nd))1+d/2n−1/2的速率收敛于真值,其中n表示观察样本的大小,d表示混合物的尺寸。利用这一结果,我们构造了一个新的基于极大似然估计量的非参数估计量,当p≥1时,该估计量在所有的p距离上以参数速率n−1/2收敛。这些收敛速度得到了仿真的支持,并用著名的巴黎共享单车系统vsamublb数据集说明了该理论。我们还介绍了对给定样本是否满足条件独立假设的检验过程。该测试程序适用于几个多变量混合物,具有不同程度的依赖,因此可以很好地区分条件独立和依赖混合物。最后,我们使用这个测试过程来调查vsamlib数据集是否条件独立。
{"title":"Parametric convergence rate of a non-parametric estimator in multivariate mixtures of power series distributions under conditional independence","authors":"Fadoua Balabdaoui ,&nbsp;Harald Besdziek ,&nbsp;Yong Wang","doi":"10.1016/j.jmva.2025.105542","DOIUrl":"10.1016/j.jmva.2025.105542","url":null,"abstract":"<div><div>The conditional independence assumption has recently appeared in a growing body of literature on the estimation of multivariate mixtures. We consider here conditionally independent multivariate mixtures of power series distributions with infinite support, to which belong Poisson, Geometric or Negative Binomial mixtures. We show that for all these mixtures, the non-parametric maximum likelihood estimator converges to the truth at the rate <span><math><mrow><msup><mrow><mrow><mo>(</mo><mo>ln</mo><mrow><mo>(</mo><mi>n</mi><mi>d</mi><mo>)</mo></mrow><mo>)</mo></mrow></mrow><mrow><mn>1</mn><mo>+</mo><mi>d</mi><mo>/</mo><mn>2</mn></mrow></msup><msup><mrow><mi>n</mi></mrow><mrow><mo>−</mo><mn>1</mn><mo>/</mo><mn>2</mn></mrow></msup></mrow></math></span> in the Hellinger distance, where <span><math><mi>n</mi></math></span> denotes the size of the observed sample and <span><math><mi>d</mi></math></span> represents the dimension of the mixture. Using this result, we then construct a new non-parametric estimator based on the maximum likelihood estimator that converges with the parametric rate <span><math><msup><mrow><mi>n</mi></mrow><mrow><mo>−</mo><mn>1</mn><mo>/</mo><mn>2</mn></mrow></msup></math></span> in all <span><math><msub><mrow><mi>ℓ</mi></mrow><mrow><mi>p</mi></mrow></msub></math></span>-distances, for <span><math><mrow><mi>p</mi><mo>≥</mo><mn>1</mn></mrow></math></span>. These convergences rates are supported by simulations and the theory is illustrated using the famous Vélib dataset of the bike sharing system of Paris. We also introduce a testing procedure for whether the conditional independence assumption is satisfied for a given sample. This testing procedure is applied for several multivariate mixtures, with varying levels of dependence, and is thereby shown to distinguish well between conditionally independent and dependent mixtures. Finally, we use this testing procedure to investigate whether conditional independence holds for Vélib dataset.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"212 ","pages":"Article 105542"},"PeriodicalIF":1.4,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145571181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimation for partially time-varying spatial autoregressive panel data model under linear constraints 线性约束下部分时变空间自回归面板数据模型的估计
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-11-19 DOI: 10.1016/j.jmva.2025.105547
Lingling Tian , Chuanhua Wei , Bing Sun , Mixia Wu
This paper investigates a constrained spatial autoregressive panel data model with fixed effects, partially linear time-varying coefficients, and time-varying spatial dependence. We propose a constrained profile two-stage least squares estimator and establish its asymptotic properties. Furthermore, a statistical test is constructed to examine whether the constant coefficients satisfy pre-specified linear constraints. Monte Carlo simulations under both independent and α-mixing error structures demonstrate the finite-sample performance of the proposed estimators and testing procedure. A real data example is provided to illustrate the practical applicability of the method. In addition, when the time dimension T is relatively small, a Block Bootstrap procedure is proposed to compute the p-value for the test.
本文研究了具有固定效应、部分线性时变系数和时变空间依赖性的约束性空间自回归面板数据模型。提出了一种约束轮廓两阶段最小二乘估计,并建立了它的渐近性质。此外,构造了一个统计检验来检验常系数是否满足预先规定的线性约束。在独立误差结构和α-混合误差结构下的蒙特卡罗模拟验证了所提估计器和测试方法的有限样本性能。通过一个实际的数据算例说明了该方法的实用性。此外,当时间维T较小时,提出了Block Bootstrap方法来计算检验的p值。
{"title":"Estimation for partially time-varying spatial autoregressive panel data model under linear constraints","authors":"Lingling Tian ,&nbsp;Chuanhua Wei ,&nbsp;Bing Sun ,&nbsp;Mixia Wu","doi":"10.1016/j.jmva.2025.105547","DOIUrl":"10.1016/j.jmva.2025.105547","url":null,"abstract":"<div><div>This paper investigates a constrained spatial autoregressive panel data model with fixed effects, partially linear time-varying coefficients, and time-varying spatial dependence. We propose a constrained profile two-stage least squares estimator and establish its asymptotic properties. Furthermore, a statistical test is constructed to examine whether the constant coefficients satisfy pre-specified linear constraints. Monte Carlo simulations under both independent and <span><math><mi>α</mi></math></span>-mixing error structures demonstrate the finite-sample performance of the proposed estimators and testing procedure. A real data example is provided to illustrate the practical applicability of the method. In addition, when the time dimension <span><math><mi>T</mi></math></span> is relatively small, a Block Bootstrap procedure is proposed to compute the <span><math><mi>p</mi></math></span>-value for the test.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"212 ","pages":"Article 105547"},"PeriodicalIF":1.4,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145571182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Multivariate Analysis
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1