{"title":"On GEE for Mean-Variance-Correlation Models: Variance Estimation and Model Selection.","authors":"Zhenyu Xu, Jason P Fine, Wenling Song, Jun Yan","doi":"10.1002/sim.10271","DOIUrl":null,"url":null,"abstract":"<p><p>Generalized estimating equations (GEE) are of great importance in analyzing clustered data without full specification of multivariate distributions. A recent approach by Luo and Pan jointly models the mean, variance, and correlation coefficients of clustered data through three sets of regressions. We note that it represents a specific case of the more general estimating equations proposed by Yan and Fine which further allow the variance to depend on the mean through a variance function. In certain scenarios, the proposed variance estimators for the variance and correlation parameters in Luo and Pan may face challenges due to the subtle dependence induced by the nested structure of the estimating equations. We characterize specific model settings where their variance estimation approach may encounter limitations and illustrate how the variance estimators in Yan and Fine can correctly account for such dependencies. In addition, we introduce a novel model selection criterion that enables the simultaneous selection of the mean-scale-correlation model. The sandwich variance estimator and the proposed model selection criterion are tested by several simulation studies and real data analysis, which validate its effectiveness in variance estimation and model selection. Our work also extends the R package geepack with the flexibility to apply different working covariance matrices for the variance and correlation structures.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":" ","pages":"e10271"},"PeriodicalIF":1.8000,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistics in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/sim.10271","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/12 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Generalized estimating equations (GEE) are of great importance in analyzing clustered data without full specification of multivariate distributions. A recent approach by Luo and Pan jointly models the mean, variance, and correlation coefficients of clustered data through three sets of regressions. We note that it represents a specific case of the more general estimating equations proposed by Yan and Fine which further allow the variance to depend on the mean through a variance function. In certain scenarios, the proposed variance estimators for the variance and correlation parameters in Luo and Pan may face challenges due to the subtle dependence induced by the nested structure of the estimating equations. We characterize specific model settings where their variance estimation approach may encounter limitations and illustrate how the variance estimators in Yan and Fine can correctly account for such dependencies. In addition, we introduce a novel model selection criterion that enables the simultaneous selection of the mean-scale-correlation model. The sandwich variance estimator and the proposed model selection criterion are tested by several simulation studies and real data analysis, which validate its effectiveness in variance estimation and model selection. Our work also extends the R package geepack with the flexibility to apply different working covariance matrices for the variance and correlation structures.
期刊介绍:
The journal aims to influence practice in medicine and its associated sciences through the publication of papers on statistical and other quantitative methods. Papers will explain new methods and demonstrate their application, preferably through a substantive, real, motivating example or a comprehensive evaluation based on an illustrative example. Alternatively, papers will report on case-studies where creative use or technical generalizations of established methodology is directed towards a substantive application. Reviews of, and tutorials on, general topics relevant to the application of statistics to medicine will also be published. The main criteria for publication are appropriateness of the statistical methods to a particular medical problem and clarity of exposition. Papers with primarily mathematical content will be excluded. The journal aims to enhance communication between statisticians, clinicians and medical researchers.