首页 > 最新文献

Statistical Methods in Medical Research最新文献

英文 中文
Joint regression analysis of clustered current status data with latent variables. 对带有潜变量的聚类现状数据进行联合回归分析。
IF 1.6 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-10-23 DOI: 10.1177/09622802241280792
Yanqin Feng, Sijie Wu, Jieli Ding

Clustered current status data frequently occur in many fields of survival studies. Some potential factors related to the hazards of interest cannot be directly observed but are characterized through multiple correlated observable surrogates. In this article, we propose a joint modeling method for regression analysis of clustered current status data with latent variables and potentially informative cluster sizes. The proposed models consist of a factor analysis model to characterize latent variables through their multiple surrogates and an additive hazards frailty model to investigate covariate effects on the failure time and incorporate intra-cluster correlations. We develop an estimation procedure that combines the expectation-maximization algorithm and the weighted estimating equations. The consistency and asymptotic normality of the proposed estimators are established. The finite-sample performance of the proposed method is assessed via a series of simulation studies. This procedure is applied to analyze clustered current status data from the National Toxicology Program on a tumorigenicity study given by the United States Department of Health and Human Services.

在许多领域的生存研究中,经常会出现聚类现状数据。有些与相关危害相关的潜在因素无法直接观察到,但可以通过多个相关的可观察代用指标来描述。在本文中,我们提出了一种联合建模方法,用于对具有潜变量和潜在信息聚类大小的聚类现状数据进行回归分析。我们提出的模型包括一个因子分析模型和一个加性危险虚弱模型,前者通过多个代理变量来描述潜变量的特征,后者则用于研究协变量对失败时间的影响,并纳入聚类内部的相关性。我们开发了一种结合期望最大化算法和加权估计方程的估计程序。我们建立了所提出估计器的一致性和渐近正态性。通过一系列模拟研究评估了所提方法的有限样本性能。该程序被应用于分析美国卫生与公众服务部提供的国家毒理学计划关于肿瘤致病性研究的聚类现状数据。
{"title":"Joint regression analysis of clustered current status data with latent variables.","authors":"Yanqin Feng, Sijie Wu, Jieli Ding","doi":"10.1177/09622802241280792","DOIUrl":"https://doi.org/10.1177/09622802241280792","url":null,"abstract":"<p><p>Clustered current status data frequently occur in many fields of survival studies. Some potential factors related to the hazards of interest cannot be directly observed but are characterized through multiple correlated observable surrogates. In this article, we propose a joint modeling method for regression analysis of clustered current status data with latent variables and potentially informative cluster sizes. The proposed models consist of a factor analysis model to characterize latent variables through their multiple surrogates and an additive hazards frailty model to investigate covariate effects on the failure time and incorporate intra-cluster correlations. We develop an estimation procedure that combines the expectation-maximization algorithm and the weighted estimating equations. The consistency and asymptotic normality of the proposed estimators are established. The finite-sample performance of the proposed method is assessed via a series of simulation studies. This procedure is applied to analyze clustered current status data from the National Toxicology Program on a tumorigenicity study given by the United States Department of Health and Human Services.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"9622802241280792"},"PeriodicalIF":1.6,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142508325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Graphical methods to illustrate the nature of the relation between a continuous variable and the outcome when using restricted cubic splines with a Cox proportional hazards model. 在使用限制性三次样条和 Cox 比例危险模型时,用图形方法说明连续变量和结果之间关系的性质。
IF 1.6 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-10-21 DOI: 10.1177/09622802241287707
Peter C Austin

Restricted cubic splines (RCS) allow analysts to model nonlinear relations between continuous covariates and the outcome in a regression model. When using RCS with the Cox proportional hazards model, there is no longer a single hazard ratio for the continuous variable. Instead, the hazard ratio depends on the values of the covariate for the two individuals being compared. Thus, using age as an example, when one assumes a linear relation between age and the log-hazard of the outcome there is a single hazard ratio comparing any two individuals whose age differs by 1 year. However, when allowing for a nonlinear relation between age and the log-hazard of the outcome, the hazard ratio comparing the hazard of the outcome between a 31- and a 30-year-old may differ from the hazard ratio comparing the hazard of the outcome between an 81- and an 80-year-old. We describe four methods to describe graphically the relation between a continuous variable and the outcome when using RCS with a Cox model. These graphical methods are based on plots of relative hazard ratios, cumulative incidence, hazards, and cumulative hazards against the continuous variable. Using a case study of patients presenting to hospital with heart failure and a series of mathematical derivations, we illustrate that the four methods will produce qualitatively similar conclusions about the nature of the relation between a continuous variable and the outcome. Use of these methods will allow for an intuitive communication of the nature of the relation between the variable and the outcome.

受限三次样条(RCS)允许分析师在回归模型中模拟连续协变量与结果之间的非线性关系。将 RCS 与 Cox 比例危险模型结合使用时,连续变量的危险比不再是单一的。相反,危险比取决于被比较的两个个体的协变量值。因此,以年龄为例,如果假定年龄与结果的对数危险度之间存在线性关系,那么在年龄相差 1 岁的两个人之间进行比较,就会得出单一的危险比。但是,如果考虑到年龄与结果危害对数之间的非线性关系,那么比较 31 岁与 30 岁之间结果危害的危害比可能不同于比较 81 岁与 80 岁之间结果危害的危害比。我们介绍了四种在使用 RCS 和 Cox 模型时用图形描述连续变量和结果之间关系的方法。这些图形方法基于相对危险比、累积发病率、危险度和累积危险度与连续变量的关系图。通过对心力衰竭入院患者的病例研究和一系列数学推导,我们说明了这四种方法对连续变量和结果之间关系性质的定性结论是相似的。使用这些方法可以直观地了解变量与结果之间关系的性质。
{"title":"Graphical methods to illustrate the nature of the relation between a continuous variable and the outcome when using restricted cubic splines with a Cox proportional hazards model.","authors":"Peter C Austin","doi":"10.1177/09622802241287707","DOIUrl":"https://doi.org/10.1177/09622802241287707","url":null,"abstract":"<p><p>Restricted cubic splines (RCS) allow analysts to model nonlinear relations between continuous covariates and the outcome in a regression model. When using RCS with the Cox proportional hazards model, there is no longer a single hazard ratio for the continuous variable. Instead, the hazard ratio depends on the values of the covariate for the two individuals being compared. Thus, using age as an example, when one assumes a linear relation between age and the log-hazard of the outcome there is a single hazard ratio comparing any two individuals whose age differs by 1 year. However, when allowing for a nonlinear relation between age and the log-hazard of the outcome, the hazard ratio comparing the hazard of the outcome between a 31- and a 30-year-old may differ from the hazard ratio comparing the hazard of the outcome between an 81- and an 80-year-old. We describe four methods to describe graphically the relation between a continuous variable and the outcome when using RCS with a Cox model. These graphical methods are based on plots of relative hazard ratios, cumulative incidence, hazards, and cumulative hazards against the continuous variable. Using a case study of patients presenting to hospital with heart failure and a series of mathematical derivations, we illustrate that the four methods will produce qualitatively similar conclusions about the nature of the relation between a continuous variable and the outcome. Use of these methods will allow for an intuitive communication of the nature of the relation between the variable and the outcome.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"9622802241287707"},"PeriodicalIF":1.6,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142475114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Delayed kernels for longitudinal survival analysis and dynamic prediction. 用于纵向生存分析和动态预测的延迟核。
IF 1.6 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-10-01 Epub Date: 2024-08-30 DOI: 10.1177/09622802241275382
Annabel Louisa Davies, Anthony Cc Coolen, Tobias Galla

Predicting patient survival probabilities based on observed covariates is an important assessment in clinical practice. These patient-specific covariates are often measured over multiple follow-up appointments. It is then of interest to predict survival based on the history of these longitudinal measurements, and to update predictions as more observations become available. The standard approaches to these so-called 'dynamic prediction' assessments are joint models and landmark analysis. Joint models involve high-dimensional parameterizations, and their computational complexity often prohibits including multiple longitudinal covariates. Landmark analysis is simpler, but discards a proportion of the available data at each 'landmark time'. In this work, we propose a 'delayed kernel' approach to dynamic prediction that sits somewhere in between the two standard methods in terms of complexity. By conditioning hazard rates directly on the covariate measurements over the observation time frame, we define a model that takes into account the full history of covariate measurements but is more practical and parsimonious than joint modelling. Time-dependent association kernels describe the impact of covariate changes at earlier times on the patient's hazard rate at later times. Under the constraints that our model (a) reduces to the standard Cox model for time-independent covariates, and (b) contains the instantaneous Cox model as a special case, we derive two natural kernel parameterizations. Upon application to three clinical data sets, we find that the predictive accuracy of the delayed kernel approach is comparable to that of the two existing standard methods.

根据观察到的协变量预测患者的生存概率是临床实践中的一项重要评估。这些患者特定的协变量通常是在多次随访中测量的。因此,根据这些纵向测量结果的历史预测生存率,并在获得更多观察结果后更新预测结果,是一项重要的工作。这些所谓 "动态预测 "评估的标准方法是联合模型和地标分析。联合模型涉及高维参数化,其计算复杂性往往使其无法包含多个纵向协变量。地标分析较为简单,但会在每个 "地标时间 "放弃一部分可用数据。在这项工作中,我们提出了一种 "延迟核 "动态预测方法,其复杂程度介于这两种标准方法之间。通过将危险率直接与观测时间框架内的协变量测量值挂钩,我们定义了一个模型,该模型考虑到了协变量测量值的全部历史,但比联合建模更实用、更简洁。与时间相关的关联核描述了协变量在早期的变化对患者后期危险率的影响。我们的模型(a)简化为时间无关协变量的标准 Cox 模型,(b)包含作为特例的瞬时 Cox 模型,在这两个约束条件下,我们得出了两个自然的核参数。通过对三个临床数据集的应用,我们发现延迟核方法的预测准确性与现有的两种标准方法相当。
{"title":"Delayed kernels for longitudinal survival analysis and dynamic prediction.","authors":"Annabel Louisa Davies, Anthony Cc Coolen, Tobias Galla","doi":"10.1177/09622802241275382","DOIUrl":"10.1177/09622802241275382","url":null,"abstract":"<p><p>Predicting patient survival probabilities based on observed covariates is an important assessment in clinical practice. These patient-specific covariates are often measured over multiple follow-up appointments. It is then of interest to predict survival based on the history of these longitudinal measurements, and to update predictions as more observations become available. The standard approaches to these so-called 'dynamic prediction' assessments are joint models and landmark analysis. Joint models involve high-dimensional parameterizations, and their computational complexity often prohibits including multiple longitudinal covariates. Landmark analysis is simpler, but discards a proportion of the available data at each 'landmark time'. In this work, we propose a 'delayed kernel' approach to dynamic prediction that sits somewhere in between the two standard methods in terms of complexity. By conditioning hazard rates directly on the covariate measurements over the observation time frame, we define a model that takes into account the full history of covariate measurements but is more practical and parsimonious than joint modelling. Time-dependent association kernels describe the impact of covariate changes at earlier times on the patient's hazard rate at later times. Under the constraints that our model (a) reduces to the standard Cox model for time-independent covariates, and (b) contains the instantaneous Cox model as a special case, we derive two natural kernel parameterizations. Upon application to three clinical data sets, we find that the predictive accuracy of the delayed kernel approach is comparable to that of the two existing standard methods.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1836-1858"},"PeriodicalIF":1.6,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11577694/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142112180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A tight fit of the SIR dynamic epidemic model to daily cases of COVID-19 reported during the 2021-2022 Omicron surge in New York City: A novel approach. 将 SIR 动态流行病模型与纽约市 2021-2022 年 Omicron 疫情激增期间报告的 COVID-19 每日病例紧密拟合:一种新方法。
IF 1.6 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-10-01 DOI: 10.1177/09622802241277956
Jeffrey E Harris

We describe a novel approach for recovering the underlying parameters of the SIR dynamic epidemic model from observed data on case incidence. We formulate a discrete-time approximation of the original continuous-time model and search for the parameter vector that minimizes the standard least squares criterion function. We show that the gradient vector and matrix of second-order derivatives of the criterion function with respect to the parameters adhere to their own systems of difference equations and thus can be exactly calculated iteratively. Applying our new approach, we estimated a four-parameter SIR model from daily reported cases of COVID-19 during the SARS-CoV-2 Omicron/BA.1 surge of December 2021-March 2022 in New York City. The estimated SIR model showed a tight fit to the observed data, but less so when we excluded residual cases attributable to the Delta variant during the initial upswing of the wave in December. Our analyses of both the real-world COVID-19 data and simulated case incidence data revealed an important problem of weak parameter identification. While our methods permitted for the separate estimation of the infection transmission parameter and the infection persistence parameter, only a linear combination of these two key parameters could be estimated with precision. The SIR model appears to be an adequate reduced-form description of the Omicron surge, but it is not necessarily the correct structural model. Prior information above and beyond case incidence data may be required to sharply identify the parameters and thus distinguish between alternative epidemic models.

我们介绍了一种从病例发生率观测数据中恢复 SIR 动态流行病模型基本参数的新方法。我们提出了原始连续时间模型的离散时间近似值,并寻找能使标准最小二乘法准则函数最小化的参数向量。我们证明,判据函数相对于参数的梯度向量和二阶导数矩阵遵循各自的差分方程组,因此可以通过迭代精确计算。应用我们的新方法,我们从 2021 年 12 月至 2022 年 3 月纽约市 SARS-CoV-2 Omicron/BA.1 高峰期间每日报告的 COVID-19 病例中估算出了一个四参数 SIR 模型。估算出的 SIR 模型与观察到的数据非常吻合,但当我们剔除了 12 月病例潮最初上升阶段的三角洲变异体残留病例后,与观察到的数据的吻合程度就降低了。我们对实际 COVID-19 数据和模拟病例发病率数据的分析表明了一个重要的问题,即参数识别能力较弱。虽然我们的方法允许对感染传播参数和感染持续参数进行单独估计,但只能对这两个关键参数的线性组合进行精确估计。SIR 模型似乎是对奥米克龙激增的充分简化描述,但并不一定是正确的结构模型。除了病例发生率数据之外,可能还需要其他先验信息来精确确定参数,从而区分其他流行病模型。
{"title":"A tight fit of the SIR dynamic epidemic model to daily cases of COVID-19 reported during the 2021-2022 Omicron surge in New York City: A novel approach.","authors":"Jeffrey E Harris","doi":"10.1177/09622802241277956","DOIUrl":"10.1177/09622802241277956","url":null,"abstract":"<p><p>We describe a novel approach for recovering the underlying parameters of the SIR dynamic epidemic model from observed data on case incidence. We formulate a discrete-time approximation of the original continuous-time model and search for the parameter vector that minimizes the standard least squares criterion function. We show that the gradient vector and matrix of second-order derivatives of the criterion function with respect to the parameters adhere to their own systems of difference equations and thus can be exactly calculated iteratively. Applying our new approach, we estimated a four-parameter SIR model from daily reported cases of COVID-19 during the SARS-CoV-2 Omicron/BA.1 surge of December 2021-March 2022 in New York City. The estimated SIR model showed a tight fit to the observed data, but less so when we excluded residual cases attributable to the Delta variant during the initial upswing of the wave in December. Our analyses of both the real-world COVID-19 data and simulated case incidence data revealed an important problem of weak parameter identification. While our methods permitted for the separate estimation of the infection transmission parameter and the infection persistence parameter, only a linear combination of these two key parameters could be estimated with precision. The SIR model appears to be an adequate reduced-form description of the Omicron surge, but it is not necessarily the correct structural model. Prior information above and beyond case incidence data may be required to sharply identify the parameters and thus distinguish between alternative epidemic models.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":"33 10","pages":"1877-1898"},"PeriodicalIF":1.6,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11577707/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142669223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint modeling of zero-inflated longitudinal measurements and time-to-event outcomes with applications to dynamic prediction. 零膨胀纵向测量和时间到事件结果的联合建模,并应用于动态预测。
IF 1.6 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-10-01 Epub Date: 2024-10-07 DOI: 10.1177/09622802241268466
Mojtaba Ganjali, Taban Baghfalaki, Narayanaswamy Balakrishnan

In this article, we present a joint modeling approach for zero-inflated longitudinal count measurements and time-to-event outcomes. For the longitudinal sub-model, a mixed effects Hurdle model is utilized, incorporating various distributional assumptions such as zero-inflated Poisson, zero-inflated negative binomial, or zero-inflated generalized Poisson. For the time-to-event sub-model, a Cox proportional hazard model is applied. For the functional form linking the longitudinal outcome history to the hazard of the event, a linear combination is used. This combination is derived from the current values of the linear predictors of Hurdle mixed effects. Some other forms are also considered, including a linear combination of the current slopes of the linear predictors of Hurdle mixed effects as well as the shared random effects. A Markov chain Monte Carlo method is implemented for Bayesian parameter estimation. Dynamic prediction using joint modeling is highly valuable in personalized medicine, as discussed here for joint modeling of zero-inflated longitudinal count measurements and time-to-event outcomes. We assess and demonstrate the effectiveness of the proposed joint models through extensive simulation studies, with a specific emphasis on parameter estimation and dynamic predictions for both over-dispersed and under-dispersed data. We finally apply the joint model to longitudinal microbiome pregnancy and HIV data sets.

在本文中,我们介绍了一种针对零膨胀纵向计数测量和时间到事件结果的联合建模方法。在纵向子模型中,我们采用了混合效应飓风模型,并结合了零膨胀泊松、零膨胀负二项或零膨胀广义泊松等各种分布假设。在时间到事件子模型中,采用了考克斯比例危险模型。对于将纵向结果历史与事件危害联系起来的函数形式,采用的是线性组合。该组合由赫尔德混合效应线性预测因子的当前值推导得出。还考虑了一些其他形式,包括赫尔德混合效应线性预测因子当前斜率的线性组合以及共享随机效应。采用马尔科夫链蒙特卡罗方法进行贝叶斯参数估计。使用联合建模进行动态预测在个性化医疗中具有很高的价值,本文讨论了零膨胀纵向计数测量和时间到事件结果的联合建模。我们通过大量的模拟研究评估并证明了所提出的联合模型的有效性,特别强调了参数估计以及对过度分散和过度分散数据的动态预测。最后,我们将联合模型应用于纵向妊娠微生物组和艾滋病数据集。
{"title":"Joint modeling of zero-inflated longitudinal measurements and time-to-event outcomes with applications to dynamic prediction.","authors":"Mojtaba Ganjali, Taban Baghfalaki, Narayanaswamy Balakrishnan","doi":"10.1177/09622802241268466","DOIUrl":"10.1177/09622802241268466","url":null,"abstract":"<p><p>In this article, we present a joint modeling approach for zero-inflated longitudinal count measurements and time-to-event outcomes. For the longitudinal sub-model, a mixed effects Hurdle model is utilized, incorporating various distributional assumptions such as zero-inflated Poisson, zero-inflated negative binomial, or zero-inflated generalized Poisson. For the time-to-event sub-model, a Cox proportional hazard model is applied. For the functional form linking the longitudinal outcome history to the hazard of the event, a linear combination is used. This combination is derived from the current values of the linear predictors of Hurdle mixed effects. Some other forms are also considered, including a linear combination of the current slopes of the linear predictors of Hurdle mixed effects as well as the shared random effects. A Markov chain Monte Carlo method is implemented for Bayesian parameter estimation. Dynamic prediction using joint modeling is highly valuable in personalized medicine, as discussed here for joint modeling of zero-inflated longitudinal count measurements and time-to-event outcomes. We assess and demonstrate the effectiveness of the proposed joint models through extensive simulation studies, with a specific emphasis on parameter estimation and dynamic predictions for both over-dispersed and under-dispersed data. We finally apply the joint model to longitudinal microbiome pregnancy and HIV data sets.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1731-1767"},"PeriodicalIF":1.6,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142381665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Average treatment effect on the treated, under lack of positivity. 在缺乏积极性的情况下,对被治疗者的平均治疗效果。
IF 1.6 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-10-01 Epub Date: 2024-09-09 DOI: 10.1177/09622802241269646
Yi Liu, Huiyue Li, Yunji Zhou, Roland A Matsouaka

The use of propensity score methods has become ubiquitous in causal inference. At the heart of these methods is the positivity assumption. Violation of the positivity assumption leads to the presence of extreme propensity score weights when estimating average causal effects, which affects statistical inference. To circumvent this issue, trimming or truncating methods have been widely used. Unfortunately, these methods require that we pre-specify a threshold. There are a number of alternative methods to deal with the lack of positivity when we estimate the average treatment effect (ATE). However, no other methods exist beyond trimming and truncation to deal with the same issue when the goal is to estimate the average treatment effect on the treated (ATT). In this article, we propose a propensity score weight-based alternative for the ATT, called overlap weighted average treatment effect on the treated. The appeal of our proposed method lies in its ability to obtain similar or even better results than trimming and truncation while relaxing the constraint to choose an a priori threshold (or related measures). The performance of the proposed method is illustrated via a series of Monte Carlo simulations and a data analysis on racial disparities in health care expenditures.

在因果推断中,倾向得分法的使用已变得无处不在。这些方法的核心是积极性假设。违反积极性假设会导致在估计平均因果效应时出现极端倾向得分权重,从而影响统计推断。为了规避这一问题,修剪或截断方法得到了广泛应用。遗憾的是,这些方法要求我们预先指定一个阈值。当我们估计平均治疗效果(ATE)时,有许多替代方法可以解决缺乏正向性的问题。然而,当我们的目标是估计平均治疗效果(ATT)时,除了修剪和截断外,还没有其他方法来解决同样的问题。在本文中,我们提出了一种基于倾向得分权重的 ATT 替代方法,称为 "重叠加权平均治疗效果"。我们提出的方法的吸引力在于,它能够获得与修剪和截断相似甚至更好的结果,同时放宽了选择先验阈值(或相关测量方法)的限制。我们通过一系列蒙特卡罗模拟和医疗支出种族差异的数据分析来说明所提方法的性能。
{"title":"Average treatment effect on the treated, under lack of positivity.","authors":"Yi Liu, Huiyue Li, Yunji Zhou, Roland A Matsouaka","doi":"10.1177/09622802241269646","DOIUrl":"10.1177/09622802241269646","url":null,"abstract":"<p><p>The use of propensity score methods has become ubiquitous in causal inference. At the heart of these methods is the positivity assumption. Violation of the positivity assumption leads to the presence of extreme propensity score weights when estimating average causal effects, which affects statistical inference. To circumvent this issue, trimming or truncating methods have been widely used. Unfortunately, these methods require that we pre-specify a threshold. There are a number of alternative methods to deal with the lack of positivity when we estimate the average treatment effect (ATE). However, no other methods exist beyond trimming and truncation to deal with the same issue when the goal is to estimate the average treatment effect on the treated (ATT). In this article, we propose a propensity score weight-based alternative for the ATT, called overlap weighted average treatment effect on the treated. The appeal of our proposed method lies in its ability to obtain similar or even better results than trimming and truncation while relaxing the constraint to choose an a priori threshold (or related measures). The performance of the proposed method is illustrated via a series of Monte Carlo simulations and a data analysis on racial disparities in health care expenditures.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1689-1717"},"PeriodicalIF":1.6,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142154972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Flexible joint model for time-to-event and non-Gaussian longitudinal outcomes. 时间到事件和非高斯纵向结果的灵活联合模型。
IF 1.6 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-10-01 Epub Date: 2024-09-09 DOI: 10.1177/09622802241269010
Hortense Doms, Philippe Lambert, Catherine Legrand

In medical studies, repeated measurements of biomarkers and time-to-event data are often collected during the follow-up period. To assess the association between these two outcomes, joint models are frequently considered. The most common approach uses a linear mixed model for the longitudinal part and a proportional hazard model for the survival part. The latter assumes a linear relationship between the survival covariates and the log hazard. In this work, we propose an extension allowing the inclusion of nonlinear covariate effects in the survival model using Bayesian penalized B-splines. Our model is valid for non-Gaussian longitudinal responses since we use a generalized linear mixed model for the longitudinal process. A simulation study shows that our method gives good statistical performance and highlights the importance of taking into account the possible nonlinear effects of certain survival covariates. Data from patients with a first progression of glioblastoma are analysed to illustrate the method.

在医学研究中,通常会在随访期间重复测量生物标志物和时间到事件数据。为了评估这两种结果之间的关联,通常会考虑联合模型。最常见的方法是纵向部分使用线性混合模型,生存部分使用比例危险模型。后者假定生存协变量与对数危险之间存在线性关系。在这项工作中,我们提出了一种扩展方法,允许在使用贝叶斯惩罚性 B 样条的生存模型中加入非线性协变量效应。我们的模型适用于非高斯纵向响应,因为我们对纵向过程使用了广义线性混合模型。模拟研究表明,我们的方法具有良好的统计性能,并强调了考虑某些生存协变量可能产生的非线性效应的重要性。我们对胶质母细胞瘤首次进展期患者的数据进行了分析,以说明该方法。
{"title":"Flexible joint model for time-to-event and non-Gaussian longitudinal outcomes.","authors":"Hortense Doms, Philippe Lambert, Catherine Legrand","doi":"10.1177/09622802241269010","DOIUrl":"10.1177/09622802241269010","url":null,"abstract":"<p><p>In medical studies, repeated measurements of biomarkers and time-to-event data are often collected during the follow-up period. To assess the association between these two outcomes, joint models are frequently considered. The most common approach uses a linear mixed model for the longitudinal part and a proportional hazard model for the survival part. The latter assumes a linear relationship between the survival covariates and the log hazard. In this work, we propose an extension allowing the inclusion of nonlinear covariate effects in the survival model using Bayesian penalized B-splines. Our model is valid for non-Gaussian longitudinal responses since we use a generalized linear mixed model for the longitudinal process. A simulation study shows that our method gives good statistical performance and highlights the importance of taking into account the possible nonlinear effects of certain survival covariates. Data from patients with a first progression of glioblastoma are analysed to illustrate the method.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1783-1799"},"PeriodicalIF":1.6,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142154973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic survival analysis: Modelling the hazard function via ordinary differential equations. 动态生存分析:通过常微分方程建立危险函数模型
IF 1.6 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-10-01 Epub Date: 2024-08-20 DOI: 10.1177/09622802241268504
J Andres Christen, F Javier Rubio

The hazard function represents one of the main quantities of interest in the analysis of survival data. We propose a general approach for parametrically modelling the dynamics of the hazard function using systems of autonomous ordinary differential equations (ODEs). This modelling approach can be used to provide qualitative and quantitative analyses of the evolution of the hazard function over time. Our proposal capitalises on the extensive literature on ODEs which, in particular, allows for establishing basic rules or laws on the dynamics of the hazard function via the use of autonomous ODEs. We show how to implement the proposed modelling framework in cases where there is an analytic solution to the system of ODEs or where an ODE solver is required to obtain a numerical solution. We focus on the use of a Bayesian modelling approach, but the proposed methodology can also be coupled with maximum likelihood estimation. A simulation study is presented to illustrate the performance of these models and the interplay of sample size and censoring. Two case studies using real data are presented to illustrate the use of the proposed approach and to highlight the interpretability of the corresponding models. We conclude with a discussion on potential extensions of our work and strategies to include covariates into our framework. Although we focus on examples of Medical Statistics, the proposed framework is applicable in any context where the interest lies in estimating and interpreting the dynamics of the hazard function.

危害函数是生存数据分析中的主要关注量之一。我们提出了一种使用自主常微分方程(ODE)系统对危险函数动态进行参数化建模的通用方法。这种建模方法可用于对危害函数随时间的演变进行定性和定量分析。我们的建议利用了有关 ODE 的大量文献,特别是通过使用自主 ODE,可以建立有关危害函数动态的基本规则或规律。我们展示了如何在 ODEs 系统有解析解或需要使用 ODEs 求解器获得数值解的情况下实施所建议的建模框架。我们将重点放在贝叶斯建模方法的使用上,但所提出的方法也可以与最大似然估计相结合。我们介绍了一项模拟研究,以说明这些模型的性能以及样本大小和删减的相互作用。我们还介绍了两个使用真实数据的案例研究,以说明建议方法的使用情况,并强调相应模型的可解释性。最后,我们讨论了我们工作的潜在扩展以及将协变量纳入我们框架的策略。虽然我们侧重于医学统计方面的例子,但所提出的框架适用于任何对估计和解释危险函数动态感兴趣的情况。
{"title":"Dynamic survival analysis: Modelling the hazard function via ordinary differential equations.","authors":"J Andres Christen, F Javier Rubio","doi":"10.1177/09622802241268504","DOIUrl":"10.1177/09622802241268504","url":null,"abstract":"<p><p>The hazard function represents one of the main quantities of interest in the analysis of survival data. We propose a general approach for parametrically modelling the <i>dynamics</i> of the hazard function using systems of autonomous ordinary differential equations (ODEs). This modelling approach can be used to provide qualitative and quantitative analyses of the evolution of the hazard function over time. Our proposal capitalises on the extensive literature on ODEs which, in particular, allows for establishing basic rules or laws on the dynamics of the hazard function via the use of autonomous ODEs. We show how to implement the proposed modelling framework in cases where there is an analytic solution to the system of ODEs or where an ODE solver is required to obtain a numerical solution. We focus on the use of a Bayesian modelling approach, but the proposed methodology can also be coupled with maximum likelihood estimation. A simulation study is presented to illustrate the performance of these models and the interplay of sample size and censoring. Two case studies using real data are presented to illustrate the use of the proposed approach and to highlight the interpretability of the corresponding models. We conclude with a discussion on potential extensions of our work and strategies to include covariates into our framework. Although we focus on examples of Medical Statistics, the proposed framework is applicable in any context where the interest lies in estimating and interpreting the dynamics of the hazard function.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1768-1782"},"PeriodicalIF":1.6,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11577698/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142005270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bayesian beta-binomial piecewise growth mixture model for longitudinal overdispersed binomial data. 针对纵向过度分散二项数据的贝叶斯贝塔-二项式片断增长混合模型。
IF 1.6 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-10-01 Epub Date: 2024-10-07 DOI: 10.1177/09622802241279109
Chun-Che Wen, Nathaniel Baker, Rajib Paul, Elizabeth Hill, Kelly Hunt, Hong Li, Kevin Gray, Brian Neelon

In a recent 12-week smoking cessation trial, varenicline tartrate failed to show significant improvements in enhancing end-of-treatment abstinence when compared with placebo among adolescents and young adults. The original analysis aimed to assess the average effect across the entire population using timeline followback methods, which typically involve overdispersed binomial counts. We instead propose to investigate treatment effect heterogeneity among latent classes of participants using a Bayesian beta-binomial piecewise linear growth mixture model specifically designed to address longitudinal overdispersed binomial responses. Within each class, we fit a piecewise linear beta-binomial mixed model with random changepoints for each study group to detect critical windows of treatment efficacy. Using this model, we can cluster subjects who share similar characteristics, estimate the class-specific mean abstinence trends for each study group, and quantify the treatment effect over time within each class. Our analysis identified two classes of subjects: one comprising high-abstinent individuals, typically young adults and light smokers, in which varenicline led to improved abstinence; and another comprising low-abstinent individuals for whom varenicline showed no discernible effect. These findings highlight the importance of tailoring varenicline to specific participant subgroups, thereby advancing precision medicine in smoking cessation studies.

在最近一项为期 12 周的戒烟试验中,与安慰剂相比,酒石酸伐尼克兰在提高青少年和年轻成人治疗结束后的戒烟率方面没有显示出明显的改善。最初的分析旨在使用时间轴回溯法评估整个人群的平均效果,这种方法通常涉及过度分散的二项式计数。相反,我们建议使用贝叶斯β-二叉片断线性增长混合模型来研究潜在参与者类别之间的治疗效果异质性,该模型是专门为解决纵向过度分散二叉反应而设计的。在每个类别中,我们拟合了一个片断线性贝塔-二叉混合模型,每个研究组都有随机变化点,以检测治疗效果的临界窗口。利用该模型,我们可以对具有相似特征的受试者进行分组,估算出每个研究组的特定班级平均戒断趋势,并量化每个班级随时间变化的治疗效果。我们的分析确定了两类受试者:一类是高戒断率人群,通常是年轻人和轻度吸烟者,伐伦克林提高了他们的戒断率;另一类是低戒断率人群,伐伦克林对他们没有明显效果。这些发现凸显了针对特定参与者亚群定制伐尼克兰的重要性,从而推动了戒烟研究中的精准医疗。
{"title":"A Bayesian beta-binomial piecewise growth mixture model for longitudinal overdispersed binomial data.","authors":"Chun-Che Wen, Nathaniel Baker, Rajib Paul, Elizabeth Hill, Kelly Hunt, Hong Li, Kevin Gray, Brian Neelon","doi":"10.1177/09622802241279109","DOIUrl":"10.1177/09622802241279109","url":null,"abstract":"<p><p>In a recent 12-week smoking cessation trial, varenicline tartrate failed to show significant improvements in enhancing end-of-treatment abstinence when compared with placebo among adolescents and young adults. The original analysis aimed to assess the average effect across the entire population using timeline followback methods, which typically involve overdispersed binomial counts. We instead propose to investigate treatment effect heterogeneity among latent classes of participants using a Bayesian beta-binomial piecewise linear growth mixture model specifically designed to address longitudinal overdispersed binomial responses. Within each class, we fit a piecewise linear beta-binomial mixed model with random changepoints for each study group to detect critical windows of treatment efficacy. Using this model, we can cluster subjects who share similar characteristics, estimate the class-specific mean abstinence trends for each study group, and quantify the treatment effect over time within each class. Our analysis identified two classes of subjects: one comprising high-abstinent individuals, typically young adults and light smokers, in which varenicline led to improved abstinence; and another comprising low-abstinent individuals for whom varenicline showed no discernible effect. These findings highlight the importance of tailoring varenicline to specific participant subgroups, thereby advancing precision medicine in smoking cessation studies.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1859-1876"},"PeriodicalIF":1.6,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142381663","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the estimation of population size-A comparison of capture-recapture and multiplier-benchmark methods. 关于种群规模的估计--捕获-再捕获法与乘数基准法的比较。
IF 1.6 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2024-10-01 Epub Date: 2024-09-30 DOI: 10.1177/09622802241275413
Jianing Wang, David M Kline, Laura Forsberg White

Approaches to population size estimation are of importance across a wide spectrum of disciplines, especially when census and simple random sampling are impractical. The capture-recapture method and the multiplier-benchmark method are two commonly used approaches that use data that partially capture the target population and overlap in a known way. Due to similarities in required data structures, the approaches are often used interchangeably without a critical appraisal of the underlying assumptions, especially in the two-sample case. Here, we describe the similarities and differences of the sampling mechanisms and assumptions underlying both approaches. We emphasize that the capture-recapture method assumes data sources as random samples and describes two-way inclusion histories, while in multiplier-benchmark method, one source captures a fixed sub-population, and the one-way inclusion histories are modeled. We also discuss the implications of these differences through simulation and real data to guide the choice of method in practice. A careful study of the data structures, relationships, and data generation processes is crucial for assessing the appropriateness of using these methods.

人口规模估算方法在众多学科中都很重要,尤其是在人口普查和简单随机抽样不切实际的情况下。捕获-再捕获法和乘数-基准法是两种常用的方法,它们使用的数据部分捕获了目标人口,并以已知的方式重叠。由于所需的数据结构相似,这两种方法经常被交替使用,而不对其基本假设进行批判性评估,尤其是在两个样本的情况下。在此,我们将介绍这两种方法的取样机制和基本假设的异同。我们强调,捕获-再捕获法将数据源假定为随机样本,并描述了双向包含历史;而在乘数基准法中,一个数据源捕获一个固定的子总体,并对单向包含历史进行建模。我们还通过模拟和真实数据讨论了这些差异的影响,以指导实践中方法的选择。仔细研究数据结构、关系和数据生成过程对于评估使用这些方法是否合适至关重要。
{"title":"On the estimation of population size-A comparison of capture-recapture and multiplier-benchmark methods.","authors":"Jianing Wang, David M Kline, Laura Forsberg White","doi":"10.1177/09622802241275413","DOIUrl":"10.1177/09622802241275413","url":null,"abstract":"<p><p>Approaches to population size estimation are of importance across a wide spectrum of disciplines, especially when census and simple random sampling are impractical. The capture-recapture method and the multiplier-benchmark method are two commonly used approaches that use data that partially capture the target population and overlap in a known way. Due to similarities in required data structures, the approaches are often used interchangeably without a critical appraisal of the underlying assumptions, especially in the two-sample case. Here, we describe the similarities and differences of the sampling mechanisms and assumptions underlying both approaches. We emphasize that the capture-recapture method assumes data sources as random samples and describes two-way inclusion histories, while in multiplier-benchmark method, one source captures a fixed sub-population, and the one-way inclusion histories are modeled. We also discuss the implications of these differences through simulation and real data to guide the choice of method in practice. A careful study of the data structures, relationships, and data generation processes is crucial for assessing the appropriateness of using these methods.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"1818-1835"},"PeriodicalIF":1.6,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142354183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Statistical Methods in Medical Research
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1