首页 > 最新文献

Biometrical Journal最新文献

英文 中文
Random Survival Forests With Competing Events: A Subdistribution-Based Imputation Approach 具有竞争事件的随机生存森林:基于子分布的估算方法
IF 1.3 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-20 DOI: 10.1002/bimj.202400014
Charlotte Behning, Alexander Bigerl, Marvin N. Wright, Peggy Sekula, Moritz Berger, Matthias Schmid

Random survival forests (RSF) can be applied to many time-to-event research questions and are particularly useful in situations where the relationship between the independent variables and the event of interest is rather complex. However, in many clinical settings, the occurrence of the event of interest is affected by competing events, which means that a patient can experience an outcome other than the event of interest. Neglecting the competing event (i.e., regarding competing events as censoring) will typically result in biased estimates of the cumulative incidence function (CIF). A popular approach for competing events is Fine and Gray's subdistribution hazard model, which directly estimates the CIF by fitting a single-event model defined on a subdistribution timescale. Here, we integrate concepts from the subdistribution hazard modeling approach into the RSF. We develop several imputation strategies that use weights as in a discrete-time subdistribution hazard model to impute censoring times in cases where a competing event is observed. Our simulations show that the CIF is well estimated if the imputation already takes place outside the forest on the overall dataset. Especially in settings with a low rate of the event of interest or a high censoring rate, competing events must not be neglected, that is, treated as censoring. When applied to a real-world epidemiological dataset on chronic kidney disease, the imputation approach resulted in highly plausible predictor–response relationships and CIF estimates of renal events.

随机生存森林(RSF)可应用于许多从时间到事件的研究问题,尤其适用于自变量与相关事件之间关系相当复杂的情况。然而,在许多临床环境中,相关事件的发生会受到竞争事件的影响,这意味着患者可能会经历除相关事件之外的其他结果。忽略竞争事件(即把竞争事件视为普查)通常会导致对累积发病率函数(CIF)的估计出现偏差。针对竞争事件的一种流行方法是 Fine 和 Gray 的子分布危险模型,该模型通过拟合定义在子分布时间尺度上的单一事件模型来直接估计 CIF。在此,我们将亚分布危害建模方法的概念整合到 RSF 中。我们开发了几种估算策略,在观测到竞争事件的情况下,使用离散时间子分布危害模型中的权重来估算删减时间。我们的模拟结果表明,如果在整个数据集上的森林外已经进行了估算,那么 CIF 就能得到很好的估计。特别是在相关事件发生率较低或剔除率较高的情况下,竞争事件不应被忽视,即应被视为剔除事件。在应用于真实世界的慢性肾病流行病学数据集时,估算方法得出了高度可信的预测因子-响应关系和肾病事件的 CIF 估计值。
{"title":"Random Survival Forests With Competing Events: A Subdistribution-Based Imputation Approach","authors":"Charlotte Behning,&nbsp;Alexander Bigerl,&nbsp;Marvin N. Wright,&nbsp;Peggy Sekula,&nbsp;Moritz Berger,&nbsp;Matthias Schmid","doi":"10.1002/bimj.202400014","DOIUrl":"10.1002/bimj.202400014","url":null,"abstract":"<p>Random survival forests (RSF) can be applied to many time-to-event research questions and are particularly useful in situations where the relationship between the independent variables and the event of interest is rather complex. However, in many clinical settings, the occurrence of the event of interest is affected by competing events, which means that a patient can experience an outcome other than the event of interest. Neglecting the competing event (i.e., regarding competing events as censoring) will typically result in biased estimates of the cumulative incidence function (CIF). A popular approach for competing events is Fine and Gray's subdistribution hazard model, which directly estimates the CIF by fitting a single-event model defined on a subdistribution timescale. Here, we integrate concepts from the subdistribution hazard modeling approach into the RSF. We develop several imputation strategies that use weights as in a discrete-time subdistribution hazard model to impute censoring times in cases where a competing event is observed. Our simulations show that the CIF is well estimated if the imputation already takes place outside the forest on the overall dataset. Especially in settings with a low rate of the event of interest or a high censoring rate, competing events must not be neglected, that is, treated as censoring. When applied to a real-world epidemiological dataset on chronic kidney disease, the imputation approach resulted in highly plausible predictor–response relationships and CIF estimates of renal events.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"66 6","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.202400014","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142005984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semiparametric Additive Modeling of the Restricted Mean Survival Time 受限平均生存时间的半参数加法模型
IF 1.3 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-16 DOI: 10.1002/bimj.202200371
Yuan Zhang, Douglas E. Schaubel

Analysis of the restricted mean survival time (RMST) has become increasingly common in biomedical studies during the last decade as a means of estimating treatment or covariate effects on survival. Advantages of RMST over the hazard ratio (HR) include increased interpretability and lack of reliance on the often tenuous proportional hazards assumption. Some authors have argued that RMST regression should generally be the frontline analysis as opposed to methods based on counting process increments. However, in order for the use of the RMST to be more mainstream, it is necessary to broaden the range of data structures to which pertinent methods can be applied. In this report, we address this issue from two angles. First, most of existing methodological development for directly modeling RMST has focused on multiplicative models. An additive model may be preferred due to goodness of fit and/or parameter interpretation. Second, many settings encountered nowadays feature high-dimensional categorical (nuisance) covariates, for which parameter estimation is best avoided. Motivated by these considerations, we propose stratified additive models for direct RMST analysis. The proposed methods feature additive covariate effects. Moreover, nuisance factors can be factored out of the estimation, akin to stratification in Cox regression, such that focus can be appropriately awarded to the parameters of chief interest. Large-sample properties of the proposed estimators are derived, and a simulation study is performed to assess finite-sample performance. In addition, we provide techniques for evaluating a fitted model with respect to risk discrimination and predictive accuracy. The proposed methods are then applied to liver transplant data to estimate the effects of donor characteristics on posttransplant survival time.

在过去的十年中,受限平均生存时间(RMST)分析在生物医学研究中越来越普遍,成为估计治疗或共变量对生存影响的一种手段。与危险比(HR)相比,RMST 的优点包括可解释性更强,而且不依赖于往往很脆弱的比例危险假设。一些学者认为,RMST 回归通常应作为一线分析方法,而不是基于计数过程增量的方法。然而,为了使 RMST 的使用更加主流化,有必要扩大可应用相关方法的数据结构的范围。在本报告中,我们将从两个角度探讨这一问题。首先,现有的直接建立 RMST 模型的方法大多集中在乘法模型上。出于拟合度和/或参数解释的考虑,加法模型可能更受欢迎。其次,目前遇到的许多情况都具有高维分类(滋扰)协变量,最好避免对其进行参数估计。基于这些考虑,我们提出了用于直接 RMST 分析的分层加法模型。所提出的方法具有协变量的加性效应。此外,干扰因素可以从估计中剔除,类似于 Cox 回归中的分层,这样就可以将重点适当地放在主要相关参数上。我们推导出了所建议估计器的大样本特性,并进行了模拟研究以评估有限样本性能。此外,我们还提供了评估拟合模型的风险判别和预测准确性的技术。然后将所提出的方法应用于肝脏移植数据,以估计捐赠者特征对移植后存活时间的影响。
{"title":"Semiparametric Additive Modeling of the Restricted Mean Survival Time","authors":"Yuan Zhang,&nbsp;Douglas E. Schaubel","doi":"10.1002/bimj.202200371","DOIUrl":"10.1002/bimj.202200371","url":null,"abstract":"<div>\u0000 \u0000 <p>Analysis of the restricted mean survival time (RMST) has become increasingly common in biomedical studies during the last decade as a means of estimating treatment or covariate effects on survival. Advantages of RMST over the hazard ratio (HR) include increased interpretability and lack of reliance on the often tenuous proportional hazards assumption. Some authors have argued that RMST regression should generally be the frontline analysis as opposed to methods based on counting process increments. However, in order for the use of the RMST to be more mainstream, it is necessary to broaden the range of data structures to which pertinent methods can be applied. In this report, we address this issue from two angles. First, most of existing methodological development for directly modeling RMST has focused on multiplicative models. An additive model may be preferred due to goodness of fit and/or parameter interpretation. Second, many settings encountered nowadays feature high-dimensional categorical (nuisance) covariates, for which parameter estimation is best avoided. Motivated by these considerations, we propose stratified additive models for direct RMST analysis. The proposed methods feature additive covariate effects. Moreover, nuisance factors can be factored out of the estimation, akin to stratification in Cox regression, such that focus can be appropriately awarded to the parameters of chief interest. Large-sample properties of the proposed estimators are derived, and a simulation study is performed to assess finite-sample performance. In addition, we provide techniques for evaluating a fitted model with respect to risk discrimination and predictive accuracy. The proposed methods are then applied to liver transplant data to estimate the effects of donor characteristics on posttransplant survival time.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"66 6","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141989558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sample Size Calculation Under Nonproportional Hazards Using Average Hazard Ratios 使用平均危险比计算非比例危险下的样本量。
IF 1.3 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-12 DOI: 10.1002/bimj.202300271
Ina Dormuth, Markus Pauly, Geraldine Rauch, Carolin Herrmann

Many clinical trials assess time-to-event endpoints. To describe the difference between groups in terms of time to event, we often employ hazard ratios. However, the hazard ratio is only informative in the case of proportional hazards (PHs) over time. There exist many other effect measures that do not require PHs. One of them is the average hazard ratio (AHR). Its core idea is to utilize a time-dependent weighting function that accounts for time variation. Though propagated in methodological research papers, the AHR is rarely used in practice. To facilitate its application, we unfold approaches for sample size calculation of an AHR test. We assess the reliability of the sample size calculation by extensive simulation studies covering various survival and censoring distributions with proportional as well as nonproportional hazards (N-PHs). The findings suggest that a simulation-based sample size calculation approach can be useful for designing clinical trials with N-PHs. Using the AHR can result in increased statistical power to detect differences between groups with more efficient sample sizes.

许多临床试验都会评估从时间到事件的终点。为了描述不同组别在事件发生时间上的差异,我们通常采用危险比。然而,危害比只有在时间比例危害(PHs)的情况下才有参考价值。还有许多其他不需要 PH 的效应测量方法。平均危险比(AHR)就是其中之一。其核心理念是利用随时间变化的加权函数来考虑时间变化。虽然 AHR 在方法论研究论文中广为传播,但在实践中却很少使用。为了便于应用,我们展开了 AHR 检验的样本量计算方法。我们通过广泛的模拟研究评估了样本量计算的可靠性,这些模拟研究涵盖了各种生存和剔除分布,以及比例和非比例危害(N-PHs)。研究结果表明,基于模拟的样本量计算方法可用于设计 N-PHs 临床试验。使用 AHR 可以提高统计能力,以更有效的样本量发现组间差异。
{"title":"Sample Size Calculation Under Nonproportional Hazards Using Average Hazard Ratios","authors":"Ina Dormuth,&nbsp;Markus Pauly,&nbsp;Geraldine Rauch,&nbsp;Carolin Herrmann","doi":"10.1002/bimj.202300271","DOIUrl":"10.1002/bimj.202300271","url":null,"abstract":"<p>Many clinical trials assess time-to-event endpoints. To describe the difference between groups in terms of time to event, we often employ hazard ratios. However, the hazard ratio is only informative in the case of proportional hazards (PHs) over time. There exist many other effect measures that do not require PHs. One of them is the average hazard ratio (AHR). Its core idea is to utilize a time-dependent weighting function that accounts for time variation. Though propagated in methodological research papers, the AHR is rarely used in practice. To facilitate its application, we unfold approaches for sample size calculation of an AHR test. We assess the reliability of the sample size calculation by extensive simulation studies covering various survival and censoring distributions with proportional as well as nonproportional hazards (N-PHs). The findings suggest that a simulation-based sample size calculation approach can be useful for designing clinical trials with N-PHs. Using the AHR can result in increased statistical power to detect differences between groups with more efficient sample sizes.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"66 6","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.202300271","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141918192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Multiple Comparisons With the Best 最佳自适应多重比较。
IF 1.3 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-10 DOI: 10.1002/bimj.202300242
Haoyu Chen, Werner Brannath, Andreas Futschik

Subset selection methods aim to choose a nonempty subset of populations including a best population with some prespecified probability. An example application involves location parameters that quantify yields in agriculture to select the best wheat variety. This is quite different from variable selection problems, for instance, in regression.

Unfortunately, subset selection methods can become very conservative when the parameter configuration is not least favorable. This will lead to a selection of many non-best populations, making the set of selected populations less informative. To solve this issue, we propose less conservative adaptive approaches based on estimating the number of best populations. We also discuss variants of our adaptive approaches that are applicable when the sample sizes and/or variances differ between populations. Using simulations, we show that our methods yield a desirable performance. As an illustration of potential gains, we apply them to two real datasets, one on the yield of wheat varieties and the other obtained via genome sequencing of repeated samples.

子集选择方法的目的是选择一个非空的种群子集,其中包括具有某种预设概率的最佳种群。一个应用实例涉及农业中量化产量的位置参数,以选择最佳的小麦品种。这与回归等变量选择问题截然不同。遗憾的是,当参数配置不是最有利时,子集选择方法会变得非常保守。这将导致选择许多非最佳种群,从而使所选种群集的信息量减少。为了解决这个问题,我们提出了基于估计最佳种群数量的不太保守的自适应方法。我们还讨论了适应性方法的变体,这些变体适用于样本大小和/或种群间方差不同的情况。通过模拟,我们证明我们的方法具有理想的性能。为了说明潜在的收益,我们将这些方法应用于两个真实数据集,一个是关于小麦品种产量的数据集,另一个是通过重复样本的基因组测序获得的数据集。
{"title":"Adaptive Multiple Comparisons With the Best","authors":"Haoyu Chen,&nbsp;Werner Brannath,&nbsp;Andreas Futschik","doi":"10.1002/bimj.202300242","DOIUrl":"10.1002/bimj.202300242","url":null,"abstract":"<p>Subset selection methods aim to choose a nonempty subset of populations including a best population with some prespecified probability. An example application involves location parameters that quantify yields in agriculture to select the best wheat variety. This is quite different from variable selection problems, for instance, in regression.</p><p>Unfortunately, subset selection methods can become very conservative when the parameter configuration is not least favorable. This will lead to a selection of many non-best populations, making the set of selected populations less informative. To solve this issue, we propose less conservative adaptive approaches based on estimating the number of best populations. We also discuss variants of our adaptive approaches that are applicable when the sample sizes and/or variances differ between populations. Using simulations, we show that our methods yield a desirable performance. As an illustration of potential gains, we apply them to two real datasets, one on the yield of wheat varieties and the other obtained via genome sequencing of repeated samples.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"66 6","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.202300242","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141914669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A New Mixture Model With Cure Rate Applied to Breast Cancer Data 应用于乳腺癌数据的带有治愈率的新混合模型。
IF 1.3 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-05 DOI: 10.1002/bimj.202300257
Diego I. Gallardo, Márcia Brandão, Jeremias Leão, Marcelo Bourguignon, Vinicius Calsavara

We introduce a new modelling for long-term survival models, assuming that the number of competing causes follows a mixture of Poisson and the Birnbaum-Saunders distribution. In this context, we present some statistical properties of our model and demonstrate that the promotion time model emerges as a limiting case. We delve into detailed discussions of specific models within this class. Notably, we examine the expected number of competing causes, which depends on covariates. This allows for direct modeling of the cure rate as a function of covariates. We present an Expectation-Maximization (EM) algorithm for parameter estimation, to discuss the estimation via maximum likelihood (ML) and provide insights into parameter inference for this model. Additionally, we outline sufficient conditions for ensuring the consistency and asymptotic normal distribution of ML estimators. To evaluate the performance of our estimation method, we conduct a Monte Carlo simulation to provide asymptotic properties and a power study of LR test by contrasting our methodology against the promotion time model. To demonstrate the practical applicability of our model, we apply it to a real medical dataset from a population-based study of incidence of breast cancer in São Paulo, Brazil. Our results illustrate that the proposed model can outperform traditional approaches in terms of model fitting, highlighting its potential utility in real-world scenarios.

我们为长期生存模型引入了一种新的建模方法,假定竞争原因的数量服从泊松分布和伯恩鲍姆-桑德斯分布的混合分布。在此背景下,我们介绍了模型的一些统计特性,并证明晋升时间模型是一种极限情况。我们将详细讨论该类模型中的具体模型。值得注意的是,我们研究了竞争原因的预期数量,这取决于协变量。这样就可以将治愈率作为协变量的函数直接建模。我们提出了一种用于参数估计的期望最大化(EM)算法,以讨论通过最大似然法(ML)进行的估计,并为该模型的参数推断提供见解。此外,我们还概述了确保最大似然估计值一致性和渐近正态分布的充分条件。为了评估我们的估计方法的性能,我们进行了蒙特卡罗模拟,以提供渐近特性,并通过将我们的方法与晋升时间模型进行对比,对 LR 检验进行了功率研究。为了证明模型的实际应用性,我们将其应用于巴西圣保罗乳腺癌发病率人群研究的真实医疗数据集。我们的结果表明,所提出的模型在模型拟合方面优于传统方法,突出了其在现实世界中的潜在用途。
{"title":"A New Mixture Model With Cure Rate Applied to Breast Cancer Data","authors":"Diego I. Gallardo,&nbsp;Márcia Brandão,&nbsp;Jeremias Leão,&nbsp;Marcelo Bourguignon,&nbsp;Vinicius Calsavara","doi":"10.1002/bimj.202300257","DOIUrl":"10.1002/bimj.202300257","url":null,"abstract":"<div>\u0000 \u0000 <p>We introduce a new modelling for long-term survival models, assuming that the number of competing causes follows a mixture of Poisson and the Birnbaum-Saunders distribution. In this context, we present some statistical properties of our model and demonstrate that the promotion time model emerges as a limiting case. We delve into detailed discussions of specific models within this class. Notably, we examine the expected number of competing causes, which depends on covariates. This allows for direct modeling of the cure rate as a function of covariates. We present an Expectation-Maximization (EM) algorithm for parameter estimation, to discuss the estimation via maximum likelihood (ML) and provide insights into parameter inference for this model. Additionally, we outline sufficient conditions for ensuring the consistency and asymptotic normal distribution of ML estimators. To evaluate the performance of our estimation method, we conduct a Monte Carlo simulation to provide asymptotic properties and a power study of LR test by contrasting our methodology against the promotion time model. To demonstrate the practical applicability of our model, we apply it to a real medical dataset from a population-based study of incidence of breast cancer in São Paulo, Brazil. Our results illustrate that the proposed model can outperform traditional approaches in terms of model fitting, highlighting its potential utility in real-world scenarios.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"66 6","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141894971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Health Care Provider Clustering Using Fusion Penalty in Quasi-Likelihood 利用准可能性中的融合惩罚对医疗服务提供者进行聚类。
IF 1.3 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-05 DOI: 10.1002/bimj.202300185
Lili Liu, Kevin He, Di Wang, Shujie Ma, Annie Qu, Yihui Luan, J. Philip Miller, Yizhe Song, Lei Liu

There has been growing research interest in developing methodology to evaluate the health care providers' performance with respect to a patient outcome. Random and fixed effects models are traditionally used for such a purpose. We propose a new method, using a fusion penalty to cluster health care providers based on quasi-likelihood. Without any priori knowledge of grouping information, our method provides a desirable data-driven approach for automatically clustering health care providers into different groups based on their performance. Further, the quasi-likelihood is more flexible and robust than the regular likelihood in that no distributional assumption is needed. An efficient alternating direction method of multipliers algorithm is developed to implement the proposed method. We show that the proposed method enjoys the oracle properties; namely, it performs as well as if the true group structure were known in advance. The consistency and asymptotic normality of the estimators are established. Simulation studies and analysis of the national kidney transplant registry data demonstrate the utility and validity of our method.

研究人员对开发评估医疗服务提供者在患者治疗结果方面表现的方法越来越感兴趣。传统上,随机效应和固定效应模型被用于此目的。我们提出了一种新方法,使用融合惩罚来根据准可能性对医疗服务提供者进行分组。在不预先了解分组信息的情况下,我们的方法提供了一种理想的数据驱动方法,可根据医疗服务提供者的表现自动将其分为不同的组别。此外,准似然法比常规似然法更灵活、更稳健,因为它不需要分布假设。为了实现所提出的方法,我们开发了一种高效的交替方向乘法算法。我们证明了所提出的方法具有神谕特性,即它的性能与事先已知的真实群体结构一样好。我们还确定了估计值的一致性和渐近正态性。模拟研究和全国肾移植登记数据分析证明了我们方法的实用性和有效性。
{"title":"Health Care Provider Clustering Using Fusion Penalty in Quasi-Likelihood","authors":"Lili Liu,&nbsp;Kevin He,&nbsp;Di Wang,&nbsp;Shujie Ma,&nbsp;Annie Qu,&nbsp;Yihui Luan,&nbsp;J. Philip Miller,&nbsp;Yizhe Song,&nbsp;Lei Liu","doi":"10.1002/bimj.202300185","DOIUrl":"10.1002/bimj.202300185","url":null,"abstract":"<div>\u0000 \u0000 <p>There has been growing research interest in developing methodology to evaluate the health care providers' performance with respect to a patient outcome. Random and fixed effects models are traditionally used for such a purpose. We propose a new method, using a fusion penalty to cluster health care providers based on quasi-likelihood. Without any priori knowledge of grouping information, our method provides a desirable data-driven approach for automatically clustering health care providers into different groups based on their performance. Further, the quasi-likelihood is more flexible and robust than the regular likelihood in that no distributional assumption is needed. An efficient alternating direction method of multipliers algorithm is developed to implement the proposed method. We show that the proposed method enjoys the oracle properties; namely, it performs as well as if the true group structure were known in advance. The consistency and asymptotic normality of the estimators are established. Simulation studies and analysis of the national kidney transplant registry data demonstrate the utility and validity of our method.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"66 6","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141890997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of Nonconcurrent Controls in Adaptive Platform Trials: Separating Randomized and Nonrandomized Information 自适应平台试验中的非同期对照分析:分离随机和非随机信息。
IF 1.3 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-08-05 DOI: 10.1002/bimj.202300334
Ian C. Marschner, I. Manjula Schou

Adaptive platform trials allow treatments to be added or dropped during the study, meaning that the control arm may be active for longer than the experimental arms. This leads to nonconcurrent controls, which provide nonrandomized information that may increase efficiency but may introduce bias from temporal confounding and other factors. Various methods have been proposed to control confounding from nonconcurrent controls, based on adjusting for time period. We demonstrate that time adjustment is insufficient to prevent bias in some circumstances where nonconcurrent controls are present in adaptive platform trials, and we propose a more general analytical framework that accounts for nonconcurrent controls in such circumstances. We begin by defining nonconcurrent controls using the concept of a concurrently randomized cohort, which is a subgroup of participants all subject to the same randomized design. We then use cohort adjustment rather than time adjustment. Due to flexibilities in platform trials, more than one randomized design may be in force at any time, meaning that cohort-adjusted and time-adjusted analyses may be quite different. Using simulation studies, we demonstrate that time-adjusted analyses may be biased while cohort-adjusted analyses remove this bias. We also demonstrate that the cohort-adjusted analysis may be interpreted as a synthesis of randomized and indirect comparisons analogous to mixed treatment comparisons in network meta-analysis. This allows the use of network meta-analysis methodology to separate the randomized and nonrandomized components and to assess their consistency. Whenever nonconcurrent controls are used in platform trials, the separate randomized and indirect contributions to the treatment effect should be presented.

自适应平台试验允许在研究过程中增加或减少治疗,这意味着对照组的活动时间可能长于实验组。这就产生了非同期对照组,它们提供的非随机信息可能会提高效率,但也可能带来时间混杂和其他因素造成的偏差。人们提出了各种方法来控制来自非同期对照的混杂因素,这些方法基于对时间段的调整。我们证明,在自适应平台试验中存在非同期对照的某些情况下,时间调整不足以防止偏差,因此我们提出了一个更通用的分析框架,在这种情况下考虑非同期对照。首先,我们使用同期随机队列的概念来定义非同期对照组,即采用相同随机设计的参试者子群。然后,我们使用队列调整而不是时间调整。由于平台试验的灵活性,任何时候都可能有一种以上的随机设计在实施,这意味着队列调整分析和时间调整分析可能会有很大不同。通过模拟研究,我们证明了时间调整分析可能存在偏差,而队列调整分析可以消除这种偏差。我们还证明,队列调整分析可解释为随机和间接比较的综合,类似于网络荟萃分析中的混合治疗比较。这样就可以使用网络荟萃分析方法将随机和非随机成分分开,并评估其一致性。只要在平台试验中使用了非同期对照,就应分别列出随机和间接对治疗效果的贡献。
{"title":"Analysis of Nonconcurrent Controls in Adaptive Platform Trials: Separating Randomized and Nonrandomized Information","authors":"Ian C. Marschner,&nbsp;I. Manjula Schou","doi":"10.1002/bimj.202300334","DOIUrl":"10.1002/bimj.202300334","url":null,"abstract":"<p>Adaptive platform trials allow treatments to be added or dropped during the study, meaning that the control arm may be active for longer than the experimental arms. This leads to nonconcurrent controls, which provide nonrandomized information that may increase efficiency but may introduce bias from temporal confounding and other factors. Various methods have been proposed to control confounding from nonconcurrent controls, based on adjusting for time period. We demonstrate that time adjustment is insufficient to prevent bias in some circumstances where nonconcurrent controls are present in adaptive platform trials, and we propose a more general analytical framework that accounts for nonconcurrent controls in such circumstances. We begin by defining nonconcurrent controls using the concept of a concurrently randomized cohort, which is a subgroup of participants all subject to the same randomized design. We then use cohort adjustment rather than time adjustment. Due to flexibilities in platform trials, more than one randomized design may be in force at any time, meaning that cohort-adjusted and time-adjusted analyses may be quite different. Using simulation studies, we demonstrate that time-adjusted analyses may be biased while cohort-adjusted analyses remove this bias. We also demonstrate that the cohort-adjusted analysis may be interpreted as a synthesis of randomized and indirect comparisons analogous to mixed treatment comparisons in network meta-analysis. This allows the use of network meta-analysis methodology to separate the randomized and nonrandomized components and to assess their consistency. Whenever nonconcurrent controls are used in platform trials, the separate randomized and indirect contributions to the treatment effect should be presented.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"66 6","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.202300334","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141894972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MTML: An Efficient Multitrait Multilocus GWAS Method Based on the Cauchy Combination Test MTML:基于考奇组合检验的高效多特征多焦点 GWAS 方法
IF 1.3 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-07-30 DOI: 10.1002/bimj.202300130
Hongping Guo, Tong Li, Yao Shi, Xiao Wang

Genome-wide association study (GWAS) by measuring the joint effect of multiple loci on multiple traits, has recently attracted interest, due to the decreased costs of high-throughput genotyping and phenotyping technologies. Previous studies mainly focused on either multilocus models that identify associations with a single trait or multitrait models that scan a single marker at a time. Since these types of models cannot fully utilize the association information, the powers of the tests are usually low. To potentially address this problem, we present here a multitrait multilocus (MTML) modeling framework that implements in three steps: (1) simplify the complex calculation; (2) reduce the model dimension; (3) integrate the joint contribution of single markers to multiple traits by Cauchy combination. The performances of MTML are evaluated and compared with other three published methods by Monte Carlo simulations. Simulation results show that MTML is more powerful for quantitative trait nucleotide detection and robust for various numbers of traits. In the meanwhile, MTML can effectively control type I error rate at a reasonable level. Real data analysis of Arabidopsis thaliana shows that MTML identifies more pleiotropic genetic associations. Therefore, we conclude that MTML is an efficient GWAS method for joint analysis of multiple quantitative traits. The R package MTML, which facilitates the implementation of the proposed method, is publicly available on GitHub https://github.com/Guohongping/MTML.

全基因组关联研究(GWAS)通过测量多个基因位点对多个性状的联合效应,最近引起了人们的兴趣,原因是高通量基因分型和表型技术的成本降低了。以往的研究主要集中在确定与单个性状关联的多位点模型或一次扫描单个标记的多性状模型。由于这些类型的模型不能充分利用关联信息,因此检验的功率通常较低。为了有可能解决这个问题,我们在此提出了一个多性状多焦点(MTML)建模框架,该框架分三步实现:(1) 简化复杂的计算;(2) 减少模型维度;(3) 通过考奇组合整合单个标记对多个性状的联合贡献。通过蒙特卡罗模拟,对 MTML 的性能进行了评估,并与其他三种已发布的方法进行了比较。模拟结果表明,MTML 在定量性状核苷酸检测方面更强大,而且对不同数量的性状具有鲁棒性。同时,MTML 能有效地将 I 型错误率控制在合理水平。拟南芥的真实数据分析显示,MTML 能识别更多的多向遗传关联。因此,我们认为 MTML 是一种高效的 GWAS 方法,可用于多个数量性状的联合分析。MTML 的 R 软件包可在 GitHub https://github.com/Guohongping/MTML 上公开获取,该软件包有助于实现所提出的方法。
{"title":"MTML: An Efficient Multitrait Multilocus GWAS Method Based on the Cauchy Combination Test","authors":"Hongping Guo,&nbsp;Tong Li,&nbsp;Yao Shi,&nbsp;Xiao Wang","doi":"10.1002/bimj.202300130","DOIUrl":"10.1002/bimj.202300130","url":null,"abstract":"<div>\u0000 \u0000 <p>Genome-wide association study (GWAS) by measuring the joint effect of multiple loci on multiple traits, has recently attracted interest, due to the decreased costs of high-throughput genotyping and phenotyping technologies. Previous studies mainly focused on either multilocus models that identify associations with a single trait or multitrait models that scan a single marker at a time. Since these types of models cannot fully utilize the association information, the powers of the tests are usually low. To potentially address this problem, we present here a multitrait multilocus (MTML) modeling framework that implements in three steps: (1) simplify the complex calculation; (2) reduce the model dimension; (3) integrate the joint contribution of single markers to multiple traits by Cauchy combination. The performances of MTML are evaluated and compared with other three published methods by Monte Carlo simulations. Simulation results show that MTML is more powerful for quantitative trait nucleotide detection and robust for various numbers of traits. In the meanwhile, MTML can effectively control type I error rate at a reasonable level. Real data analysis of <i>Arabidopsis thaliana</i> shows that MTML identifies more pleiotropic genetic associations. Therefore, we conclude that MTML is an efficient GWAS method for joint analysis of multiple quantitative traits. The R package MTML, which facilitates the implementation of the proposed method, is publicly available on GitHub https://github.com/Guohongping/MTML.</p>\u0000 </div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"66 6","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141794125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Factor-Analytic Variance–Covariance Structures for Prediction Into a Target Population of Environments 用于预测目标环境人群的因子分析方差-协方差结构。
IF 1.3 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-07-25 DOI: 10.1002/bimj.202400008
Hans-Peter Piepho, Emlyn Williams

Finlay–Wilkinson regression is a popular method for modeling genotype–environment interaction in plant breeding and crop variety testing. When environment is a random factor, this model may be cast as a factor-analytic variance–covariance structure, implying a regression on random latent environmental variables. This paper reviews such models with a focus on their use in the analysis of multi-environment trials for the purpose of making predictions in a target population of environments. We investigate the implication of random versus fixed effects assumptions, starting from basic analysis-of-variance models, then moving on to factor-analytic models and considering the transition to models involving observable environmental covariates, which promise to provide more accurate and targeted predictions than models with latent environmental variables.

芬莱-威尔金森回归法是植物育种和作物品种测试中模拟基因型-环境交互作用的常用方法。当环境是一个随机因素时,该模型可被视为一个因素分析方差-协方差结构,意味着对随机潜在环境变量的回归。本文回顾了此类模型,重点介绍了它们在多环境试验分析中的应用,目的是对目标环境群体进行预测。我们从基本的方差分析模型入手,研究了随机效应假设与固定效应假设的影响,然后转向因子分析模型,并考虑向涉及可观测环境协变量的模型过渡,这些模型有望提供比潜在环境变量模型更准确、更有针对性的预测。
{"title":"Factor-Analytic Variance–Covariance Structures for Prediction Into a Target Population of Environments","authors":"Hans-Peter Piepho,&nbsp;Emlyn Williams","doi":"10.1002/bimj.202400008","DOIUrl":"10.1002/bimj.202400008","url":null,"abstract":"<p>Finlay–Wilkinson regression is a popular method for modeling genotype–environment interaction in plant breeding and crop variety testing. When environment is a random factor, this model may be cast as a factor-analytic variance–covariance structure, implying a regression on random latent environmental variables. This paper reviews such models with a focus on their use in the analysis of multi-environment trials for the purpose of making predictions in a target population of environments. We investigate the implication of random versus fixed effects assumptions, starting from basic analysis-of-variance models, then moving on to factor-analytic models and considering the transition to models involving observable environmental covariates, which promise to provide more accurate and targeted predictions than models with latent environmental variables.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"66 6","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.202400008","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141762842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Issue Information: Biometrical Journal 6'24 发行信息:生物计量学杂志 6'24
IF 1.3 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-07-22 DOI: 10.1002/bimj.202470006
{"title":"Issue Information: Biometrical Journal 6'24","authors":"","doi":"10.1002/bimj.202470006","DOIUrl":"10.1002/bimj.202470006","url":null,"abstract":"","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"66 6","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.202470006","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141778012","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biometrical Journal
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1