首页 > 最新文献

Statistical Methods in Medical Research最新文献

英文 中文
Rank-based methods for assessing equivalence/non-inferiority with assay sensitivity in a three-arm trial with ordinal endpoints. 在一项顺序终点的三组试验中评估等效性/非劣效性的基于秩的方法。
IF 1.9 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-02-11 DOI: 10.1177/09622802261417216
Shi-Fang Qiu, Dai-Min Li, Wai-Yin Poon

Various approaches have been developed to assess equivalence/non-inferiority with assay sensitivity in a three-arm trial with continuous or discrete endpoints. However, there is little work done on ordinal endpoints. Ordinal data do not have metric information, the method for analyzing metric endpoints can systematically lead to errors for ordinal observations. The win probability that a subject receiving one treatment achieves a better outcome (or "wins" against) compared to a subject receiving the other treatment, is developed to quantify the treatment effect. In this article, the equivalence/non-inferiority with assay sensitivity in a three-arm trial are assessed by the win probabilities from the perspective of simultaneous confidence intervals (SCIs). The proposed methods can be applied to studies with ordinal or continuous outcomes without making parametric assumptions. Empirical results show that the Fisher-z transformation-based SCI, the method of variance estimates recovery SCIs combing with logit transformation, logit with arcsinh transformation confidence limits perform well in the sense that their empirical coverage probabilities are pretty close to the nominal confidence level. Sample size determination for achieving the pre-specified power is also investigated according to the duality of hypothesis testing and interval estimation. An example taken from the study of prophylaxis of postoperative nausea and vomiting is used to illustrate the proposed methods.

在连续或离散终点的三组试验中,已经开发了各种方法来评估等效性/非劣效性。然而,在有序端点上做的工作很少。序数数据不包含度量信息,分析度量端点的方法会系统性地导致序数观测的误差。与接受另一种治疗的受试者相比,接受一种治疗的受试者获得更好结果(或“赢”)的获胜概率是用来量化治疗效果的。在本文中,从同时置信区间(SCIs)的角度,通过获胜概率来评估三组试验中具有检测敏感性的等效性/非劣效性。所提出的方法可以应用于具有有序或连续结果的研究,而无需进行参数假设。实证结果表明,基于Fisher-z变换的SCI、方差估计法与logit变换相结合的恢复SCI、logit与arcsinh变换置信限相结合的SCI表现良好,它们的经验覆盖概率非常接近名义置信水平。根据假设检验和区间估计的对偶性,研究了实现预定功率的样本量确定。从研究预防术后恶心和呕吐的一个例子是用来说明所提出的方法。
{"title":"Rank-based methods for assessing equivalence/non-inferiority with assay sensitivity in a three-arm trial with ordinal endpoints.","authors":"Shi-Fang Qiu, Dai-Min Li, Wai-Yin Poon","doi":"10.1177/09622802261417216","DOIUrl":"https://doi.org/10.1177/09622802261417216","url":null,"abstract":"<p><p>Various approaches have been developed to assess equivalence/non-inferiority with assay sensitivity in a three-arm trial with continuous or discrete endpoints. However, there is little work done on ordinal endpoints. Ordinal data do not have metric information, the method for analyzing metric endpoints can systematically lead to errors for ordinal observations. The win probability that a subject receiving one treatment achieves a better outcome (or \"wins\" against) compared to a subject receiving the other treatment, is developed to quantify the treatment effect. In this article, the equivalence/non-inferiority with assay sensitivity in a three-arm trial are assessed by the win probabilities from the perspective of simultaneous confidence intervals (SCIs). The proposed methods can be applied to studies with ordinal or continuous outcomes without making parametric assumptions. Empirical results show that the Fisher-z transformation-based SCI, the method of variance estimates recovery SCIs combing with logit transformation, logit with arcsinh transformation confidence limits perform well in the sense that their empirical coverage probabilities are pretty close to the nominal confidence level. Sample size determination for achieving the pre-specified power is also investigated according to the duality of hypothesis testing and interval estimation. An example taken from the study of prophylaxis of postoperative nausea and vomiting is used to illustrate the proposed methods.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"9622802261417216"},"PeriodicalIF":1.9,"publicationDate":"2026-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146166870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A hybrid prior Bayesian method for combining domestic real-world data and overseas data in global drug development. 结合国内外药物研发数据的混合先验贝叶斯方法。
IF 1.9 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-02-06 DOI: 10.1177/09622802251414586
Keer Chen, Zengyue Zheng, Pengfei Zhu, Shuping Jiang, Nan Li, Jumin Deng, Pingyan Chen, Zhenyu Wu, Ying Wu

BackgroundHybrid clinical trial design integrates traditional randomized controlled trials (RCTs) with real-world data (RWD), aiming to enhance trial efficiency through dynamic incorporation of external data (External trial data and RWD). However, existing methods, such as the Meta-Analytic Predictive (MAP) Prior, exhibit serious limitations in controlling data heterogeneity, adjusting baseline discrepancies, and optimizing dynamic borrowing proportions. These limitations often introduce external bias or compromise evidence reliability, hindering their application in complex analyses like bridging trials and multi-regional clinical trials (MRCTs).ObjectiveThis study proposes a novel hybrid Bayesian framework, EQPS Robust MAP (rMAP), to address heterogeneity and bias in multi-source data integration. Its feasibility and robustness are validated through systematic simulations and retrospective case analyses, using two independent datasets to evaluate the effect of Risankizumab in patients with moderate-to-severe plaque psoriasis.Design and MethodsThe EQPS-rMAP method operates in three stages: (1) Eliminating baseline covariate discrepancies through propensity score stratification; (2) constructing stratum-specific MAP priors to dynamically adjust weights for external data; and (3) introducing equivalence probability weights to quantify data conflict risks. The study evaluates the method's performance across six simulated analyses (heterogeneity differences, baseline shifts, etc.), comparing it with traditional methods (MAP, PSMAP, Empirical Bayes MAP) in terms of estimation bias, type I error control, and sample size requirements. Real-world case analyses further validate its applicability.ResultsSimulations demonstrate that EQPS-rMAP maintains estimation robustness under considerable heterogeneity while reducing sample size demands and enhancing trial efficiency. Case analyses confirm its ability to control external bias while preserving high estimation accuracy compared to conventional approaches.ConclusionThe EQPS-rMAP method provides empirical evidence for the feasibility of hybrid clinical designs. Its methodological advancements-resolving baseline and heterogeneity conflicts through adaptive mechanisms-offer broader applicability for integrating external and RWD across diverse analyses, including bridging trials, MRCTs, and post-marketing studies.

混合临床试验设计将传统的随机对照试验(RCTs)与真实世界数据(RWD)相结合,旨在通过外部数据(外部试验数据和RWD)的动态结合来提高试验效率。然而,现有的方法,如Meta-Analytic Predictive (MAP) Prior,在控制数据异质性、调整基线差异和优化动态借贷比例方面存在严重局限性。这些限制通常会引入外部偏倚或损害证据可靠性,阻碍了它们在桥接试验和多区域临床试验(mrct)等复杂分析中的应用。目的提出一种新的混合贝叶斯框架EQPS鲁棒MAP (rMAP),以解决多源数据集成中的异质性和偏倚问题。通过系统模拟和回顾性病例分析验证其可行性和稳健性,使用两个独立的数据集评估Risankizumab对中重度斑块型银屑病患者的疗效。设计与方法EQPS-rMAP方法分为三个阶段:(1)通过倾向评分分层消除基线协变量差异;(2)构建分层MAP先验,动态调整外部数据权值;(3)引入等价概率权重,量化数据冲突风险。本研究通过六种模拟分析(异质性差异、基线偏移等)评估了该方法的性能,并将其与传统方法(MAP、PSMAP、Empirical Bayes MAP)在估计偏差、I型误差控制和样本量要求方面进行了比较。实际案例分析进一步验证了其适用性。结果仿真结果表明,EQPS-rMAP在较大异质性下保持了估计稳健性,同时减少了样本量需求,提高了试验效率。案例分析证实了与传统方法相比,该方法能够控制外部偏差,同时保持较高的估计精度。结论EQPS-rMAP方法为混合临床设计的可行性提供了经验依据。其方法上的进步——通过适应性机制解决基线和异质性冲突——为整合外部和RWD的不同分析提供了更广泛的适用性,包括桥接试验、mrct和上市后研究。
{"title":"A hybrid prior Bayesian method for combining domestic real-world data and overseas data in global drug development.","authors":"Keer Chen, Zengyue Zheng, Pengfei Zhu, Shuping Jiang, Nan Li, Jumin Deng, Pingyan Chen, Zhenyu Wu, Ying Wu","doi":"10.1177/09622802251414586","DOIUrl":"https://doi.org/10.1177/09622802251414586","url":null,"abstract":"<p><p>BackgroundHybrid clinical trial design integrates traditional randomized controlled trials (RCTs) with real-world data (RWD), aiming to enhance trial efficiency through dynamic incorporation of external data (External trial data and RWD). However, existing methods, such as the Meta-Analytic Predictive (MAP) Prior, exhibit serious limitations in controlling data heterogeneity, adjusting baseline discrepancies, and optimizing dynamic borrowing proportions. These limitations often introduce external bias or compromise evidence reliability, hindering their application in complex analyses like bridging trials and multi-regional clinical trials (MRCTs).ObjectiveThis study proposes a novel hybrid Bayesian framework, EQPS Robust MAP (rMAP), to address heterogeneity and bias in multi-source data integration. Its feasibility and robustness are validated through systematic simulations and retrospective case analyses, using two independent datasets to evaluate the effect of Risankizumab in patients with moderate-to-severe plaque psoriasis.Design and MethodsThe EQPS-rMAP method operates in three stages: (1) Eliminating baseline covariate discrepancies through propensity score stratification; (2) constructing stratum-specific MAP priors to dynamically adjust weights for external data; and (3) introducing equivalence probability weights to quantify data conflict risks. The study evaluates the method's performance across six simulated analyses (heterogeneity differences, baseline shifts, etc.), comparing it with traditional methods (MAP, PSMAP, Empirical Bayes MAP) in terms of estimation bias, type I error control, and sample size requirements. Real-world case analyses further validate its applicability.ResultsSimulations demonstrate that EQPS-rMAP maintains estimation robustness under considerable heterogeneity while reducing sample size demands and enhancing trial efficiency. Case analyses confirm its ability to control external bias while preserving high estimation accuracy compared to conventional approaches.ConclusionThe EQPS-rMAP method provides empirical evidence for the feasibility of hybrid clinical designs. Its methodological advancements-resolving baseline and heterogeneity conflicts through adaptive mechanisms-offer broader applicability for integrating external and RWD across diverse analyses, including bridging trials, MRCTs, and post-marketing studies.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"9622802251414586"},"PeriodicalIF":1.9,"publicationDate":"2026-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146132856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Monitoring time to event in registry data using CUSUMs based on relative survival models. 使用基于相对生存模型的CUSUMs监视注册表数据中的时间到事件。
IF 1.9 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-02-05 DOI: 10.1177/09622802251411540
Jimmy Huy Tran, Jan Terje Kvaløy, Hartwig Kørner

An aspect of interest in surveillance of diseases is whether the survival time distribution changes over time. By following data in health registries over time, this can be monitored, either in real time or retrospectively. With relevant risk factors registered, these can be taken into account in the monitoring as well. A challenge in monitoring survival times based on registry data is that the information related to cause of death might either be missing or uncertain. To quantify the burden of disease in such cases, relative survival methods can be used, where the total hazard is modelled as the population hazard plus the excess hazard due to the disease.We propose a cumulative sum (CUSUM) procedure for monitoring for changes in the survival time distribution in cases where the use of excess hazard models is relevant. The CUSUM chart is based on a survival log-likelihood ratio and extends previously suggested methods for monitoring of time to event data to the excess hazard setting. The procedure takes into account changes in the population risk over time, as well as changes in the excess hazard which is explained by observed covariates. Properties, challenges and an application to cancer registry data will be presented.

对疾病监测感兴趣的一个方面是生存时间分布是否随时间而变化。通过长期跟踪健康登记处的数据,可以实时或回顾性地监测这一点。在登记了相关的风险因素后,这些因素也可以在监测中加以考虑。根据登记数据监测生存时间的一个挑战是,与死亡原因有关的信息可能缺失或不确定。为了量化这种情况下的疾病负担,可以使用相对生存法,其中将总危险建模为人口危险加上疾病造成的超额危险。我们提出了一个累积和(CUSUM)程序,用于监测在使用过量风险模型相关的情况下生存时间分布的变化。CUSUM图表基于生存对数似然比,并将以前建议的监测时间到事件数据的方法扩展到过量危险设置。该程序考虑到人口风险随时间的变化,以及由观察到的协变量解释的超额危险的变化。将介绍癌症注册数据的特性、挑战和应用。
{"title":"Monitoring time to event in registry data using CUSUMs based on relative survival models.","authors":"Jimmy Huy Tran, Jan Terje Kvaløy, Hartwig Kørner","doi":"10.1177/09622802251411540","DOIUrl":"https://doi.org/10.1177/09622802251411540","url":null,"abstract":"<p><p>An aspect of interest in surveillance of diseases is whether the survival time distribution changes over time. By following data in health registries over time, this can be monitored, either in real time or retrospectively. With relevant risk factors registered, these can be taken into account in the monitoring as well. A challenge in monitoring survival times based on registry data is that the information related to cause of death might either be missing or uncertain. To quantify the burden of disease in such cases, relative survival methods can be used, where the total hazard is modelled as the population hazard plus the excess hazard due to the disease.We propose a cumulative sum (CUSUM) procedure for monitoring for changes in the survival time distribution in cases where the use of excess hazard models is relevant. The CUSUM chart is based on a survival log-likelihood ratio and extends previously suggested methods for monitoring of time to event data to the excess hazard setting. The procedure takes into account changes in the population risk over time, as well as changes in the excess hazard which is explained by observed covariates. Properties, challenges and an application to cancer registry data will be presented.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"9622802251411540"},"PeriodicalIF":1.9,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146126543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A non-proportional hazards cure model with an application to gastric cancer data analysis. 非比例风险治愈模型及其在胃癌数据分析中的应用。
IF 1.9 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-01-30 DOI: 10.1177/09622802251414429
N Balakrishnan, M Mar Fenoy, M Carmen Pardo

In many practical situations, some subjects may never experience the event of interest in their lifetime. These subjects are referred to as the cured or non-susceptible subjects. In the context of chronic disease treatment, this is referred to as a cure fraction. In this work, we extend the generalized time-dependent logistic (GTDL) model proposed by MacKenzie (1996) to a flexible family of models which accommodates not only non-proportional hazards, but also long-term survivors. Inferential methods are then developed for the proposed model and a Monte Carlo simulation study is also carried out to evaluate the performance of the model as well as the inferential method developed here. A real data example on gastric cancer is then used to illustrate the usefulness of the proposed model.

在许多实际情况下,一些受试者可能一辈子都没有经历过感兴趣的事件。这些受试者被称为治愈或非易感受试者。在慢性病治疗的背景下,这被称为治愈部分。在这项工作中,我们将MacKenzie(1996)提出的广义时间依赖逻辑(GTDL)模型扩展到一个灵活的模型家族,该模型不仅适用于非比例风险,而且适用于长期幸存者。然后为所提出的模型开发了推理方法,并进行了蒙特卡罗模拟研究,以评估模型的性能以及这里开发的推理方法。然后用一个真实的胃癌数据例子来说明所提出模型的有效性。
{"title":"A non-proportional hazards cure model with an application to gastric cancer data analysis.","authors":"N Balakrishnan, M Mar Fenoy, M Carmen Pardo","doi":"10.1177/09622802251414429","DOIUrl":"https://doi.org/10.1177/09622802251414429","url":null,"abstract":"<p><p>In many practical situations, some subjects may never experience the event of interest in their lifetime. These subjects are referred to as the cured or non-susceptible subjects. In the context of chronic disease treatment, this is referred to as a cure fraction. In this work, we extend the generalized time-dependent logistic (GTDL) model proposed by MacKenzie (1996) to a flexible family of models which accommodates not only non-proportional hazards, but also long-term survivors. Inferential methods are then developed for the proposed model and a Monte Carlo simulation study is also carried out to evaluate the performance of the model as well as the inferential method developed here. A real data example on gastric cancer is then used to illustrate the usefulness of the proposed model.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"9622802251414429"},"PeriodicalIF":1.9,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146094060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parametric and nonparametric propensity score weighting analysis with subgroup covariate balance. 亚组协变量平衡的参数和非参数倾向得分加权分析。
IF 1.9 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-01-29 DOI: 10.1177/09622802251415157
Yan Li, Yong-Fang Kuo, Liang Li

Estimating the causal treatment effects by subgroups is important in observational studies when the treatment effect heterogeneity is present. Existing propensity score methods rely on a correctly specified propensity score model. Model misspecification results in biased treatment effect estimation and covariate imbalance. We proposed a method for the propensity score analysis with controlled subgroup balance (G-SBPS) to achieve covariate mean balance in all subgroups. We further incorporated nonparametric kernel regression for the propensity scores and developed a kernelized G-SBPS (kG-SBPS) to improve the subgroup mean balance of covariate transformations in a rich functional class. This extension increased robustness to propensity score model misspecification. Extensive numerical studies showed that G-SBPS and kG-SBPS improve both subgroup covariate balance and subgroup treatment effect estimation, compared to existing approaches. For illustration, we applied G-SBPS and kG-SBPS to a dataset on right heart catheterization to estimate the subgroup average treatment effects on the hospital length of stay and a dataset on diabetes self-management training to estimate the subgroup average treatment effects for the treated on the hospitalization rate.

在观察性研究中,当治疗效果存在异质性时,按亚组估计因果治疗效果是很重要的。现有的倾向评分方法依赖于正确指定的倾向评分模型。模型不规范导致治疗效果估计偏倚和协变量失衡。我们提出了一种使用控制亚组平衡(G-SBPS)的倾向评分分析方法,以实现所有亚组的协变量均值平衡。我们进一步将非参数核回归纳入倾向得分,并开发了核化G-SBPS (kG-SBPS),以改善富函数类中协变量变换的子组平均平衡。这种扩展增加了对倾向评分模型错误规范的稳健性。大量的数值研究表明,与现有方法相比,G-SBPS和kG-SBPS改善了亚组协变量平衡和亚组治疗效果估计。举例来说,我们将G-SBPS和kG-SBPS应用于右心导管数据集,以估计亚组平均治疗效果对住院时间的影响,并将G-SBPS应用于糖尿病自我管理培训数据集,以估计亚组平均治疗效果对住院率的影响。
{"title":"Parametric and nonparametric propensity score weighting analysis with subgroup covariate balance.","authors":"Yan Li, Yong-Fang Kuo, Liang Li","doi":"10.1177/09622802251415157","DOIUrl":"https://doi.org/10.1177/09622802251415157","url":null,"abstract":"<p><p>Estimating the causal treatment effects by subgroups is important in observational studies when the treatment effect heterogeneity is present. Existing propensity score methods rely on a correctly specified propensity score model. Model misspecification results in biased treatment effect estimation and covariate imbalance. We proposed a method for the propensity score analysis with controlled subgroup balance (G-SBPS) to achieve covariate mean balance in all subgroups. We further incorporated nonparametric kernel regression for the propensity scores and developed a kernelized G-SBPS (kG-SBPS) to improve the subgroup mean balance of covariate transformations in a rich functional class. This extension increased robustness to propensity score model misspecification. Extensive numerical studies showed that G-SBPS and kG-SBPS improve both subgroup covariate balance and subgroup treatment effect estimation, compared to existing approaches. For illustration, we applied G-SBPS and kG-SBPS to a dataset on right heart catheterization to estimate the subgroup average treatment effects on the hospital length of stay and a dataset on diabetes self-management training to estimate the subgroup average treatment effects for the treated on the hospitalization rate.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"9622802251415157"},"PeriodicalIF":1.9,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146087150","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian feature selection in joint models with application to a cardiovascular disease cohort study. 联合模型中的贝叶斯特征选择及其在心血管疾病队列研究中的应用
IF 1.9 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-01-29 DOI: 10.1177/09622802251414939
Mirajul Islam, Michael J Daniels, Zeynab Aghabazaz, Juned Siddique

Cardiovascular disease (CVD) cohorts collect data longitudinally to study the association between CVD risk factors and event times. An important area of scientific research is to better understand what features of CVD risk factor trajectories are associated with CVD. We develop methods for feature selection in joint models where feature selection is viewed as a bi-level variable selection problem with multiple features nested within multiple longitudinal risk factors. We modify a previously proposed Bayesian sparse group selection (BSGS) prior, which has not been implemented in joint models until now, to better represent prior beliefs when selecting features both at the group level (longitudinal risk factor) and within group (features of a longitudinal risk factor). One of the advantages of our method over the BSGS method is its ability to account for correlation among the features within a risk factor. As a result, it selects important features similarly, but excludes unimportant features within risk factors more efficiently than the BSGS prior. We evaluate our prior via simulations and apply our method to data from the Atherosclerosis Risk in Communities (ARIC) study, a population-based, prospective cohort study consisting of over 15,000 men and women aged 45-64 at baseline who were measured six additional times. We evaluate which CVD risk factors and which characteristics of their trajectories (features) are associated with death from CVD. We find that systolic and diastolic blood pressure, glucose, and total cholesterol are important risk factors with different important features associated with CVD death in both men and women.

心血管疾病(CVD)队列纵向收集数据,研究CVD危险因素与事件时间之间的关系。科学研究的一个重要领域是更好地了解与心血管疾病相关的心血管疾病危险因素轨迹的特征。我们开发了联合模型中的特征选择方法,其中特征选择被视为双水平变量选择问题,多个特征嵌套在多个纵向风险因素中。我们修改了先前提出的贝叶斯稀疏组选择(BSGS)先验,该先验目前尚未在联合模型中实现,以便在选择组级(纵向风险因素)和组内(纵向风险因素的特征)特征时更好地表示先验信念。与BSGS方法相比,我们的方法的优点之一是它能够考虑风险因素中特征之间的相关性。因此,它同样选择了重要的特征,但比先前的BSGS更有效地排除了风险因素中的不重要特征。我们通过模拟评估我们的先验,并将我们的方法应用于社区动脉粥样硬化风险(ARIC)研究的数据,这是一项基于人群的前瞻性队列研究,包括超过15,000名年龄在45-64岁的男性和女性,他们在基线时被测量了6次。我们评估了哪些心血管疾病危险因素及其轨迹特征(特征)与心血管疾病死亡相关。我们发现收缩压和舒张压、葡萄糖和总胆固醇是与男性和女性CVD死亡相关的重要危险因素,具有不同的重要特征。
{"title":"Bayesian feature selection in joint models with application to a cardiovascular disease cohort study.","authors":"Mirajul Islam, Michael J Daniels, Zeynab Aghabazaz, Juned Siddique","doi":"10.1177/09622802251414939","DOIUrl":"https://doi.org/10.1177/09622802251414939","url":null,"abstract":"<p><p>Cardiovascular disease (CVD) cohorts collect data longitudinally to study the association between CVD risk factors and event times. An important area of scientific research is to better understand what features of CVD risk factor trajectories are associated with CVD. We develop methods for feature selection in joint models where feature selection is viewed as a bi-level variable selection problem with multiple features nested within multiple longitudinal risk factors. We modify a previously proposed Bayesian sparse group selection (BSGS) prior, which has not been implemented in joint models until now, to better represent prior beliefs when selecting features both at the group level (longitudinal risk factor) and within group (features of a longitudinal risk factor). One of the advantages of our method over the BSGS method is its ability to account for correlation among the features within a risk factor. As a result, it selects important features similarly, but excludes unimportant features within risk factors more efficiently than the BSGS prior. We evaluate our prior via simulations and apply our method to data from the Atherosclerosis Risk in Communities (ARIC) study, a population-based, prospective cohort study consisting of over 15,000 men and women aged 45-64 at baseline who were measured six additional times. We evaluate which CVD risk factors and which characteristics of their trajectories (features) are associated with death from CVD. We find that systolic and diastolic blood pressure, glucose, and total cholesterol are important risk factors with different important features associated with CVD death in both men and women.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"9622802251414939"},"PeriodicalIF":1.9,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146087104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discrimination performance in illness-death models with interval-censored disease data. 区间截除疾病数据的疾病-死亡模型的判别性能。
IF 1.9 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-01-29 DOI: 10.1177/09622802251412855
Marta Spreafico, Anja J Rueten-Budde, Hein Putter, Marta Fiocco

In clinical studies, the illness-death model is often used to describe disease progression. A subject starts disease-free, may develop the disease and then die, or die directly. In clinical practice, disease can only be diagnosed at pre-specified follow-up visits, so the exact time of disease onset is often unknown, resulting in interval-censored data. This study examines the impact of ignoring this interval-censored nature of disease data on the discrimination performance of illness-death models, focusing on the time-specific area under the receiver operating characteristic curve in both incident/dynamic and cumulative/dynamic definitions. A simulation study with data simulated from Weibull transition hazards and disease state censored at regular intervals is conducted. Estimates are derived using different methods: the Cox model with a time-dependent binary disease marker, which ignores interval-censoring, and the illness-death model for interval-censored data estimated with three implementations-the piecewise-constant model from the msm package, the Weibull and M-spline models from the SmoothHazard package. These methods are also applied to a dataset of 2232 patients with high-grade soft tissue sarcoma, where the interval-censored disease state is the post-operative development of distant metastases. The results suggest that, in the presence of interval-censored disease times, it is important to account for interval-censoring not only when estimating the parameters of the model but also when evaluating the discrimination performance of the disease.

在临床研究中,疾病-死亡模型常用于描述疾病进展。一个实验对象开始时没有疾病,可能会发展成疾病,然后死亡,或者直接死亡。在临床实践中,疾病只能在预先指定的随访中诊断出来,因此疾病发病的确切时间往往是未知的,导致数据的间隔审查。本研究考察了忽略疾病数据的这种间隔审查性质对疾病-死亡模型的判别性能的影响,重点关注在事件/动态和累积/动态定义中接收者工作特征曲线下的特定时间区域。利用威布尔过渡危害和疾病状态定期剔除的模拟数据进行了仿真研究。估计是使用不同的方法得出的:Cox模型具有时间依赖的二元疾病标记,它忽略了区间审查,疾病-死亡模型用于区间审查数据的估计,使用三种实现-来自msm包的分段常数模型,来自SmoothHazard包的威布尔和m样条模型。这些方法也应用于2232例高级别软组织肉瘤患者的数据集,其中间隔审查的疾病状态是手术后远处转移的发展。结果表明,在存在间隔审查的疾病时间的情况下,不仅在估计模型参数时,而且在评估疾病的识别性能时,考虑间隔审查是很重要的。
{"title":"Discrimination performance in illness-death models with interval-censored disease data.","authors":"Marta Spreafico, Anja J Rueten-Budde, Hein Putter, Marta Fiocco","doi":"10.1177/09622802251412855","DOIUrl":"https://doi.org/10.1177/09622802251412855","url":null,"abstract":"<p><p>In clinical studies, the illness-death model is often used to describe disease progression. A subject starts disease-free, may develop the disease and then die, or die directly. In clinical practice, disease can only be diagnosed at pre-specified follow-up visits, so the exact time of disease onset is often unknown, resulting in interval-censored data. This study examines the impact of ignoring this interval-censored nature of disease data on the discrimination performance of illness-death models, focusing on the time-specific area under the receiver operating characteristic curve in both incident/dynamic and cumulative/dynamic definitions. A simulation study with data simulated from Weibull transition hazards and disease state censored at regular intervals is conducted. Estimates are derived using different methods: the Cox model with a time-dependent binary disease marker, which ignores interval-censoring, and the illness-death model for interval-censored data estimated with three implementations-the piecewise-constant model from the <i>msm</i> package, the Weibull and M-spline models from the <i>SmoothHazard</i> package. These methods are also applied to a dataset of 2232 patients with high-grade soft tissue sarcoma, where the interval-censored disease state is the post-operative development of distant metastases. The results suggest that, in the presence of interval-censored disease times, it is important to account for interval-censoring not only when estimating the parameters of the model but also when evaluating the discrimination performance of the disease.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"9622802251412855"},"PeriodicalIF":1.9,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146087108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical methods for clustered competing risk data when the event types are only available in a training dataset. 当事件类型仅在训练数据集中可用时,聚类竞争风险数据的统计方法。
IF 1.9 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-01-29 DOI: 10.1177/09622802251415022
Yujie Wu, Ce Yang, Molin Wang

We develop methods to analyze clustered competing risks data when the event types are only available in a training dataset and are missing in the main study. We propose to estimate the exposure effects through the cause-specific proportional hazards frailty model where random effects are introduced into the model to account for the within-cluster correlation. We propose a weighted penalized partial likelihood method where the weights represent the probabilities of the occurrence of events, and the weights can be obtained by fitting a classification model for the event types on the training dataset. Alternatively, we propose an imputation approach where the missing event types are imputed based on the predictions from the classification model. We derive the analytical variances, and evaluate the finite sample properties of our methods in an extensive simulation study. As an illustrative example, we apply our methods to estimate the associations between tinnitus and metabolic, sensory and metabolic+sensory hearing loss in the Conservation of Hearing Study Audiology Assessment Arm.

当事件类型仅在训练数据集中可用而在主要研究中缺失时,我们开发了分析聚类竞争风险数据的方法。我们建议通过特定原因的比例风险脆弱性模型来估计暴露效应,其中将随机效应引入模型以解释簇内相关性。我们提出了一种加权惩罚部分似然方法,其中权重表示事件发生的概率,并且可以通过在训练数据集上拟合事件类型的分类模型来获得权重。或者,我们提出了一种估算方法,其中缺失的事件类型根据分类模型的预测进行估算。我们推导了分析方差,并在广泛的模拟研究中评估了我们的方法的有限样本性质。作为一个例子,我们在听力保护研究听力学评估部门应用我们的方法来估计耳鸣与代谢性、感觉性和代谢性+感觉性听力损失之间的关系。
{"title":"Statistical methods for clustered competing risk data when the event types are only available in a training dataset.","authors":"Yujie Wu, Ce Yang, Molin Wang","doi":"10.1177/09622802251415022","DOIUrl":"https://doi.org/10.1177/09622802251415022","url":null,"abstract":"<p><p>We develop methods to analyze clustered competing risks data when the event types are only available in a training dataset and are missing in the main study. We propose to estimate the exposure effects through the cause-specific proportional hazards frailty model where random effects are introduced into the model to account for the within-cluster correlation. We propose a weighted penalized partial likelihood method where the weights represent the probabilities of the occurrence of events, and the weights can be obtained by fitting a classification model for the event types on the training dataset. Alternatively, we propose an imputation approach where the missing event types are imputed based on the predictions from the classification model. We derive the analytical variances, and evaluate the finite sample properties of our methods in an extensive simulation study. As an illustrative example, we apply our methods to estimate the associations between tinnitus and metabolic, sensory and metabolic+sensory hearing loss in the Conservation of Hearing Study Audiology Assessment Arm.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"9622802251415022"},"PeriodicalIF":1.9,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146087177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Truncated Gaussian copula principal component analysis with application to pediatric acute lymphoblastic leukemia patients' gut microbiome. 截断高斯联结主成分分析在小儿急性淋巴细胞白血病患者肠道菌群中的应用。
IF 1.9 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-01-23 DOI: 10.1177/09622802251412844
Lei Wang, Yang Ni, Irina Gaynanova

Increasing epidemiologic evidence suggests that the diversity and composition of the gut microbiome can predict infection risk in cancer patients. Infections remain a major cause of morbidity and mortality during chemotherapy. Analyzing microbiome data to identify associations with infection pathogenesis for proactive treatment has become a critical research focus. However, the high-dimensional nature of the data necessitates the use of dimension-reduction methods to facilitate inference and interpretation. Traditional dimension reduction methods, which assume Gaussianity, perform poorly with skewed and zero-inflated microbiome data. To address these challenges, we propose a semiparametric principal component analysis method based on a truncated latent Gaussian copula model that accommodates both skewness and zero inflation. Simulation studies demonstrate that the proposed method outperforms existing approaches by providing more accurate estimates of scores and loadings across various copula transformation settings. We apply our method, along with competing approaches, to gut microbiome data from pediatric patients with acute lymphoblastic leukemia. The principal scores derived from the proposed method reveal the strongest associations between pre-chemotherapy microbiome composition and adverse events during subsequent chemotherapy, offering valuable insights for improving patient outcomes.

越来越多的流行病学证据表明,肠道微生物组的多样性和组成可以预测癌症患者的感染风险。感染仍然是化疗期间发病和死亡的主要原因。分析微生物组数据,以确定与感染发病机制的关联,积极治疗已成为一个关键的研究重点。然而,数据的高维性质需要使用降维方法来促进推理和解释。传统的降维方法假设了高斯性,对于扭曲和零膨胀的微生物组数据表现不佳。为了解决这些挑战,我们提出了一种基于截断隐高斯copula模型的半参数主成分分析方法,该方法可以同时适应偏度和零膨胀。仿真研究表明,所提出的方法优于现有的方法,提供了更准确的估计分数和负载在各种copula转换设置。我们将我们的方法,连同竞争的方法,应用于急性淋巴细胞白血病儿科患者的肠道微生物组数据。该方法得出的主要评分揭示了化疗前微生物组组成与随后化疗期间不良事件之间的最强关联,为改善患者预后提供了有价值的见解。
{"title":"Truncated Gaussian copula principal component analysis with application to pediatric acute lymphoblastic leukemia patients' gut microbiome.","authors":"Lei Wang, Yang Ni, Irina Gaynanova","doi":"10.1177/09622802251412844","DOIUrl":"https://doi.org/10.1177/09622802251412844","url":null,"abstract":"<p><p>Increasing epidemiologic evidence suggests that the diversity and composition of the gut microbiome can predict infection risk in cancer patients. Infections remain a major cause of morbidity and mortality during chemotherapy. Analyzing microbiome data to identify associations with infection pathogenesis for proactive treatment has become a critical research focus. However, the high-dimensional nature of the data necessitates the use of dimension-reduction methods to facilitate inference and interpretation. Traditional dimension reduction methods, which assume Gaussianity, perform poorly with skewed and zero-inflated microbiome data. To address these challenges, we propose a semiparametric principal component analysis method based on a truncated latent Gaussian copula model that accommodates both skewness and zero inflation. Simulation studies demonstrate that the proposed method outperforms existing approaches by providing more accurate estimates of scores and loadings across various copula transformation settings. We apply our method, along with competing approaches, to gut microbiome data from pediatric patients with acute lymphoblastic leukemia. The principal scores derived from the proposed method reveal the strongest associations between pre-chemotherapy microbiome composition and adverse events during subsequent chemotherapy, offering valuable insights for improving patient outcomes.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"9622802251412844"},"PeriodicalIF":1.9,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146030825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A fast integrative clustering and feature selection approach for high-dimensional multiview data. 高维多视图数据快速集成聚类和特征选择方法。
IF 1.9 3区 医学 Q3 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-01-21 DOI: 10.1177/09622802251406584
Abdalkarim Alnajjar, Helen Bian, Zihang Lu

Cluster analysis has been widely used in biomedical studies for disaggregating heterogeneous diseases and identifying disease subtypes that may inform clinical decisions. In the era of advanced data science and engineering, cluster analysis faces new challenges due to high dimensionality, multimodality and computational complexity. In the present study, we propose a fast integrative clustering approach based on variational Bayesian inference, called iClusterVB. The iClusterVB enables the integration of multiple datasets into the clustering process while performing feature selection in high-dimensional settings for mixed data types, including continuous, categorical, and count data. Simulation studies are performed to compare the performance of iClusterVB with six competing methods and highlight its advantages. Additionally, iClusterVB is applied to three real-life studies to demonstrate its utility in identifying important features and cancer subtypes that are associated with distinct survival probabilities. A user-friendly R package iClusterVB and a tutorial are developed to implement the proposed approach.

聚类分析已广泛应用于生物医学研究中,用于分解异质性疾病和识别疾病亚型,从而为临床决策提供信息。在先进的数据科学与工程时代,聚类分析因其高维数、多模态和计算复杂性而面临新的挑战。在本研究中,我们提出了一种基于变分贝叶斯推理的快速综合聚类方法,称为iClusterVB。iClusterVB支持将多个数据集集成到聚类过程中,同时在高维设置中为混合数据类型(包括连续数据、分类数据和计数数据)执行特征选择。通过仿真研究,比较了iClusterVB与六种竞争方法的性能,突出了其优势。此外,iClusterVB应用于三个现实生活中的研究,以证明其在识别与不同生存概率相关的重要特征和癌症亚型方面的实用性。开发了一个用户友好的R包iClusterVB和一个教程来实现所提出的方法。
{"title":"A fast integrative clustering and feature selection approach for high-dimensional multiview data.","authors":"Abdalkarim Alnajjar, Helen Bian, Zihang Lu","doi":"10.1177/09622802251406584","DOIUrl":"https://doi.org/10.1177/09622802251406584","url":null,"abstract":"<p><p>Cluster analysis has been widely used in biomedical studies for disaggregating heterogeneous diseases and identifying disease subtypes that may inform clinical decisions. In the era of advanced data science and engineering, cluster analysis faces new challenges due to high dimensionality, multimodality and computational complexity. In the present study, we propose a fast integrative clustering approach based on variational Bayesian inference, called iClusterVB. The iClusterVB enables the integration of multiple datasets into the clustering process while performing feature selection in high-dimensional settings for mixed data types, including continuous, categorical, and count data. Simulation studies are performed to compare the performance of iClusterVB with six competing methods and highlight its advantages. Additionally, iClusterVB is applied to three real-life studies to demonstrate its utility in identifying important features and cancer subtypes that are associated with distinct survival probabilities. A user-friendly R package iClusterVB and a tutorial are developed to implement the proposed approach.</p>","PeriodicalId":22038,"journal":{"name":"Statistical Methods in Medical Research","volume":" ","pages":"9622802251406584"},"PeriodicalIF":1.9,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146019726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Statistical Methods in Medical Research
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1