Biometrical Journal最新文献

A Bias-Corrected Bayesian Nonparametric Model for Combining Studies With Varying Quality in Meta-Analysis

IF 1.3 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2025-02-07 DOI: 10.1002/bimj.70034

Pablo Emilio Verde, Gary L. Rosner

Bayesian nonparametric (BNP) approaches for meta-analysis have been developed to relax distributional assumptions and handle the heterogeneity of random effects distributions. These models account for possible clustering and multimodality of the random effects distribution. However, when we combine studies of varying quality, the resulting posterior is not only a combination of the results of interest but also factors threatening the integrity of the studies' results. We refer to these factors as the studies' internal validity biases (e.g., reporting bias, data quality, and patient selection bias). In this paper, we introduce a new meta-analysis model called the bias-corrected Bayesian nonparametric (BC-BNP) model, which aims to automatically correct for internal validity bias in meta-analysis by only using the reported effects and their standard errors. The BC-BNP model is based on a mixture of a parametric random effects distribution, which represents the model of interest, and a BNP model for the bias component. This model relaxes the parametric assumptions of the bias distribution of the model introduced by Verde. Using simulated data sets, we evaluate the BC-BNP model and illustrate its applications with two real case studies. Our results show several potential advantages of the BC-BNP model: (1) It can detect bias when present while producing results similar to a simple normal–normal random effects model when bias is absent. (2) Relaxing the parametric assumptions of the bias component does not affect the model of interest and yields consistent results with the model of Verde. (3) In some applications, a BNP model of bias offers a better understanding of the studies' biases by clustering studies with similar biases. We implemented the BC-BNP model in the R package jarbes, facilitating its practical application.

{"title":"A Bias-Corrected Bayesian Nonparametric Model for Combining Studies With Varying Quality in Meta-Analysis","authors":"Pablo Emilio Verde, Gary L. Rosner","doi":"10.1002/bimj.70034","DOIUrl":"https://doi.org/10.1002/bimj.70034","url":null,"abstract":"Bayesian nonparametric (BNP) approaches for meta-analysis have been developed to relax distributional assumptions and handle the heterogeneity of random effects distributions. These models account for possible clustering and multimodality of the random effects distribution. However, when we combine studies of varying quality, the resulting posterior is not only a combination of the results of interest but also factors threatening the integrity of the studies' results. We refer to these factors as the studies' internal validity biases (e.g., reporting bias, data quality, and patient selection bias). In this paper, we introduce a new meta-analysis model called the bias-corrected Bayesian nonparametric (BC-BNP) model, which aims to automatically correct for internal validity bias in meta-analysis by only using the reported effects and their standard errors. The BC-BNP model is based on a mixture of a parametric random effects distribution, which represents the model of interest, and a BNP model for the bias component. This model relaxes the parametric assumptions of the bias distribution of the model introduced by Verde. Using simulated data sets, we evaluate the BC-BNP model and illustrate its applications with two real case studies. Our results show several potential advantages of the BC-BNP model: (1) It can detect bias when present while producing results similar to a simple normal–normal random effects model when bias is absent. (2) Relaxing the parametric assumptions of the bias component does not affect the model of interest and yields consistent results with the model of Verde. (3) In some applications, a BNP model of bias offers a better understanding of the studies' biases by clustering studies with similar biases. We implemented the BC-BNP model in the R package jarbes, facilitating its practical application.","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.70034","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143362461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Mediation Analysis With Exposure–Mediator Interaction and Covariate Measurement Error Under the Additive Hazards Model

IF 1.3 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2025-02-07 DOI: 10.1002/bimj.70035

Ying Yan, Lingzhu Shen

Causal mediation analysis is a useful tool to examine how an exposure variable causally affects an outcome variable through an intermediate variable. In recent years, there is increasing research interest in mediation analysis with survival data. The existing literature usually requires accurate measurements of the mediator and the confounders, which is infeasible in many biomedical and social science studies. Ignoring measurement errors may lead to misleading inference results. Furthermore, the current identification results of causal effects under the additive hazards model are limited to the scenario with no exposure–mediator interaction, which can be unappealing in mediation analysis. In this paper, we derive the identification results of direct and indirect effects under the additive hazards model in the presence of exposure–mediator interaction. Furthermore, we propose a corrected approach to adjust for the impact of measurement error in the mediator and the confounders and obtain consistent estimations of the direct and indirect effects. The performance of the proposed method is studied in simulation studies and a real data study.

{"title":"Mediation Analysis With Exposure–Mediator Interaction and Covariate Measurement Error Under the Additive Hazards Model","authors":"Ying Yan, Lingzhu Shen","doi":"10.1002/bimj.70035","DOIUrl":"https://doi.org/10.1002/bimj.70035","url":null,"abstract":"<div>\u0000 \u0000 Causal mediation analysis is a useful tool to examine how an exposure variable causally affects an outcome variable through an intermediate variable. In recent years, there is increasing research interest in mediation analysis with survival data. The existing literature usually requires accurate measurements of the mediator and the confounders, which is infeasible in many biomedical and social science studies. Ignoring measurement errors may lead to misleading inference results. Furthermore, the current identification results of causal effects under the additive hazards model are limited to the scenario with no exposure–mediator interaction, which can be unappealing in mediation analysis. In this paper, we derive the identification results of direct and indirect effects under the additive hazards model in the presence of exposure–mediator interaction. Furthermore, we propose a corrected approach to adjust for the impact of measurement error in the mediator and the confounders and obtain consistent estimations of the direct and indirect effects. The performance of the proposed method is studied in simulation studies and a real data study.</div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143362514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multiple Contrast Tests in the Presence of Partial Heteroskedasticity 部分异方差存在下的多重对比检验。

IF 1.3 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2025-01-13 DOI: 10.1002/bimj.70019

Mario Hasler, Tim Birr, Ludwig A. Hothorn

This paper proposes a general approach for handling multiple contrast tests for normally distributed data in the presence of partial heteroskedasticity. In contrast to the usual case of complete heteroskedasticity, the treatments belong to subgroups according to their variances. Treatments within these subgroups are homoskedastic, whereas treatments of different subgroups are heteroskedastic. New candidate as well as already existing approaches are described and compared by $� � α � �$ -simulations. Power simulations show that a gain in power is achieved when the partial heteroskedasticity is taken into account compared to procedures which wrongly assume complete heteroskedasticity. The new approaches will be applied to a phytopathological experiment.

本文提出了一种处理存在部分异方差的正态分布数据的多重对比检验的一般方法。与通常情况下的完全异方差相反，治疗根据其方差属于亚组。这些亚组内的处理是同方差的，而不同亚组的处理是异方差的。通过α $ α $模拟对新的候选方法和现有方法进行了描述和比较。功率仿真表明，与错误地假设完全异方差的方法相比，考虑部分异方差的方法可以获得功率增益。新方法将应用于植物病理学实验。

引用次数: 0

Quantification of Difference in Nonselectivity Between In Vitro Diagnostic Medical Devices 体外诊断医疗器械非选择性差异的定量分析。

IF 1.3 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2025-01-02 DOI: 10.1002/bimj.70032

Pernille Kjeilen Fauskanger, Sverre Sandberg, Jesper Johansen, Thomas Keller, Jeffrey Budd, W. Greg Miller, Anne Stavelin, Vincent Delatour, Mauro Panteghini, Bård Støve

Correct measurement results from in vitro diagnostic (IVD) medical devices (MD) are crucial for optimal patient care. The performance of IVD-MDs is often assessed through method comparison studies. Such studies can be compromised by the influence of various factors. The effect of these factors must be examined in every method comparison study, for example, nonselectivity differences between compared IVD-MDs are examined. Historically, selectivity or nonselectivity has been defined as a qualitative term. However, a quantification of nonselectivity differences between IVD-MDs is needed. This paper fills this need by introducing a novel measure for quantifying differences in nonselectivity (DINS) between a pair of IVD-MDs. Assuming one of the IVD-MDs involved in the comparison exhibits high selectivity for the analyte, it becomes feasible to quantify nonselectivity in the other IVD-MD by employing this DINS measure. Our approach leverages elements from univariate ordinary least squares regression and incorporates repeatability IVD-MD variances, resulting in a normalized measure. We also introduce a plug-in estimator for this measure, which is notably linked to the average relative increase in prediction interval widths attributable to DINS. This connection is exploited to establish a criterion for identifying excessive DINS utilizing a proof-of-hazard approach. Utilizing Monte Carlo simulations, we investigate how the estimator relates to population characteristics like DINS and heteroskedasticity. We find that DINS impacts the mean, variance, and 99th percentile of the estimator, while heteroskedasticity affects only the latter two, and to a considerably smaller extent compared to DINS. Importantly, the size of the study design modulates these effects. We also confirm, when using clinical data, that DINS between pairs of IVD-MDs influence the estimator correspondingly to those of simulated data. Thus, the proposed estimator serves as an effective metric for quantifying DINS between IVD-MDs and helping to determine the quality of a method comparison study.

体外诊断（IVD）医疗设备（MD）的正确测量结果对于最佳患者护理至关重要。IVD-MDs的性能通常通过方法比较研究来评估。这些研究可能受到各种因素的影响。在每一种方法的比较研究中都必须检查这些因素的影响，例如，检查比较的IVD-MDs之间的非选择性差异。历史上，选择性或非选择性一直被定义为一个定性术语。然而，需要对IVD-MDs之间的非选择性差异进行量化。本文通过引入一种新的方法来量化一对ivd - md之间的非选择性差异（DINS），填补了这一需求。假设参与比较的其中一个IVD-MD对被分析物表现出高选择性，那么通过采用该DINS测量来量化另一个IVD-MD的非选择性是可行的。我们的方法利用了单变量普通最小二乘回归的元素，并结合了可重复性IVD-MD方差，从而得到了标准化的测量结果。我们还为该度量引入了一个插件估计器，它与归因于DINS的预测区间宽度的平均相对增加明显相关。这种联系被用来建立一个标准，以识别过度的DINS利用证明危害的方法。利用蒙特卡罗模拟，我们研究了估计量如何与DINS和异方差等种群特征相关。我们发现DINS影响估计量的均值、方差和第99百分位，而异方差性仅影响后两者，而且与DINS相比，其影响程度要小得多。重要的是，研究设计的规模调节了这些影响。我们还证实，当使用临床数据时，ivd - md对之间的DINS对估计量的影响与模拟数据的估计量相对应。因此，所提出的估计量可作为量化ivd - md之间DINS的有效度量，并有助于确定方法比较研究的质量。

{"title":"Quantification of Difference in Nonselectivity Between In Vitro Diagnostic Medical Devices","authors":"Pernille Kjeilen Fauskanger, Sverre Sandberg, Jesper Johansen, Thomas Keller, Jeffrey Budd, W. Greg Miller, Anne Stavelin, Vincent Delatour, Mauro Panteghini, Bård Støve","doi":"10.1002/bimj.70032","DOIUrl":"10.1002/bimj.70032","url":null,"abstract":"Correct measurement results from in vitro diagnostic (IVD) medical devices (MD) are crucial for optimal patient care. The performance of IVD-MDs is often assessed through method comparison studies. Such studies can be compromised by the influence of various factors. The effect of these factors must be examined in every method comparison study, for example, nonselectivity differences between compared IVD-MDs are examined. Historically, selectivity or nonselectivity has been defined as a qualitative term. However, a quantification of nonselectivity differences between IVD-MDs is needed. This paper fills this need by introducing a novel measure for quantifying differences in nonselectivity (DINS) between a pair of IVD-MDs. Assuming one of the IVD-MDs involved in the comparison exhibits high selectivity for the analyte, it becomes feasible to quantify nonselectivity in the other IVD-MD by employing this DINS measure. Our approach leverages elements from univariate ordinary least squares regression and incorporates repeatability IVD-MD variances, resulting in a normalized measure. We also introduce a plug-in estimator for this measure, which is notably linked to the average relative increase in prediction interval widths attributable to DINS. This connection is exploited to establish a criterion for identifying excessive DINS utilizing a proof-of-hazard approach. Utilizing Monte Carlo simulations, we investigate how the estimator relates to population characteristics like DINS and heteroskedasticity. We find that DINS impacts the mean, variance, and 99th percentile of the estimator, while heteroskedasticity affects only the latter two, and to a considerably smaller extent compared to DINS. Importantly, the size of the study design modulates these effects. We also confirm, when using clinical data, that DINS between pairs of IVD-MDs influence the estimator correspondingly to those of simulated data. Thus, the proposed estimator serves as an effective metric for quantifying DINS between IVD-MDs and helping to determine the quality of a method comparison study.","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11695778/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142923893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Sensitivity Analysis for Effects of Multiple Exposures in the Presence of Unmeasured Confounding 存在未测量的混杂因素时多重暴露影响的敏感性分析。

IF 1.3 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2024-12-30 DOI: 10.1002/bimj.70033

Boram Jeong, Seungjae Lee, Shinhee Ye, Donghwan Lee, Woojoo Lee

Epidemiological research aims to investigate how multiple exposures affect health outcomes of interest, but observational studies often suffer from biases caused by unmeasured confounders. In this study, we develop a novel sensitivity model to investigate the effect of correlated multiple exposures on the continuous health outcomes of interest. The proposed sensitivity analysis is model-agnostic and can be applied to any machine learning algorithm. The interval of single- or joint-exposure effects is efficiently obtained by solving a linear programming problem with a quadratic constraint. Some strategies for reducing the input burden in the sensitivity analysis are discussed. We demonstrate the usefulness of sensitivity analysis via numerical studies and real data application.

流行病学研究的目的是调查多重暴露对健康结果的影响，但观察性研究经常受到未测量混杂因素造成的偏差的影响。在这项研究中，我们建立了一个新的敏感性模型来研究相关多重暴露对持续健康结果的影响。提出的灵敏度分析是模型不可知的，可以应用于任何机器学习算法。通过求解具有二次约束的线性规划问题，有效地得到了单暴露或联合暴露效应的区间。讨论了降低灵敏度分析中输入负担的一些策略。我们通过数值研究和实际数据应用证明了灵敏度分析的有效性。

引用次数: 0

Developing and Comparing Four Families of Bayesian Network Autocorrelation Models for Binary Outcomes: Estimating Peer Effects Involving Adoption of Medical Technologies 发展和比较四类贝叶斯网络自相关模型的二元结果：估计涉及医疗技术采用的同伴效应。

IF 1.3 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2024-12-30 DOI: 10.1002/bimj.70030

Guanqing Chen, A. James O'Malley

Despite the extensive use of network autocorrelation models in social network analysis, network autocorrelation models for binary dependent variables have received surprisingly scant attention. In this paper, we develop four network autocorrelation models for a binary random variable defined by whether the peer effect (also termed social influence or contagion) acts on latent continuous outcomes leading to an indirect effect under a normal or a logistic distribution or on the probability of the observed outcome itself under a probit or a logit link function defining a direct effect to account for interdependence between outcomes. For all models, we use a Bayesian approach for model estimation under a uniform prior on a transformed peer effect parameter ( $� � ρ � �$ ) designed to enhance model computation and compare results to those under the uniform prior for $� � ρ � �$ . We use simulation to assess the performance of Bayesian point and interval estimators for each of the four models when the model that generated the data is used for estimation (precision assessment) and when each of the other three models instead generated the data (robustness assessment). We construct a United States New England region patient-sharing hospital network and apply the four network autocorrelation models to study the adoption of robotic surgery, a new medical technology, among hospitals using a cohort of United States Medicare beneficiaries in 2016 and 2017. Finally, we develop a deviance information criterion for each of the four models to compare their fit to the observed data and use posterior predictive p-values to assess the models' ability to recover specified features of the data. The results find that although the indirect peer effect of the propensity of peer hospital adoption on that of the focal hospital is positive under both latent response autocorrelation models, the direct peer effect of the peer hospital's probability of adopting robotic surgery on the probability of the focal hospital adopting robotic surgery decreases under both mean autocorrelation data models. However, neither of these associations is statistically significant.

尽管网络自相关模型在社会网络分析中得到了广泛的应用，但二元因变量的网络自相关模型却很少受到关注。在本文中，我们为一个二元随机变量开发了四个网络自相关模型，其定义是同伴效应（也称为社会影响或传染）是否作用于潜在的连续结果，导致正态分布或逻辑分布下的间接效应，或作用于probit或logit链接函数下观察到的结果本身的概率，定义了直接效应，以解释结果之间的相互依赖性。对于所有模型，我们使用贝叶斯方法对转换的对等效应参数（ρ $rho$）进行均匀先验下的模型估计，旨在增强模型计算并将结果与ρ $rho$均匀先验下的结果进行比较。当生成数据的模型用于估计（精度评估）以及其他三个模型中的每一个模型生成数据（鲁棒性评估）时，我们使用模拟来评估贝叶斯点和区间估计器对四个模型中的每一个模型的性能。我们构建了一个美国新英格兰地区的患者共享医院网络，并应用四个网络自相关模型研究了2016年和2017年美国医疗保险受益人队列中医院对机器人手术这一新型医疗技术的采用情况。最后，我们为每个模型开发了一个偏差信息标准，以比较它们与观测数据的拟合，并使用后验预测p值来评估模型恢复数据特定特征的能力。结果发现，在两种潜在反应自相关模型下，同行医院采用机器人手术的概率对焦点医院采用机器人手术的概率的间接对等效应均为正，而在两种平均自相关数据模型下，同行医院采用机器人手术的概率对焦点医院采用机器人手术的概率的直接对等效应均减小。然而，这两种关联在统计上都不显著。

{"title":"Developing and Comparing Four Families of Bayesian Network Autocorrelation Models for Binary Outcomes: Estimating Peer Effects Involving Adoption of Medical Technologies","authors":"Guanqing Chen, A. James O'Malley","doi":"10.1002/bimj.70030","DOIUrl":"10.1002/bimj.70030","url":null,"abstract":"<div>\u0000 \u0000 Despite the extensive use of network autocorrelation models in social network analysis, network autocorrelation models for binary dependent variables have received surprisingly scant attention. In this paper, we develop four network autocorrelation models for a binary random variable defined by whether the peer effect (also termed social influence or contagion) acts on latent continuous outcomes leading to an indirect effect under a normal or a logistic distribution or on the probability of the observed outcome itself under a probit or a logit link function defining a direct effect to account for interdependence between outcomes. For all models, we use a Bayesian approach for model estimation under a uniform prior on a transformed peer effect parameter (<math>\u0000 <semantics>\u0000 <mi>ρ</mi>\u0000 <annotation>$rho$</annotation>\u0000 </semantics></math>) designed to enhance model computation and compare results to those under the uniform prior for <math>\u0000 <semantics>\u0000 <mi>ρ</mi>\u0000 <annotation>$rho$</annotation>\u0000 </semantics></math>. We use simulation to assess the performance of Bayesian point and interval estimators for each of the four models when the model that generated the data is used for estimation (precision assessment) and when each of the other three models instead generated the data (robustness assessment). We construct a United States New England region patient-sharing hospital network and apply the four network autocorrelation models to study the adoption of robotic surgery, a new medical technology, among hospitals using a cohort of United States Medicare beneficiaries in 2016 and 2017. Finally, we develop a deviance information criterion for each of the four models to compare their fit to the observed data and use posterior predictive p-values to assess the models' ability to recover specified features of the data. The results find that although the indirect peer effect of the propensity of peer hospital adoption on that of the focal hospital is positive under both latent response autocorrelation models, the direct peer effect of the peer hospital's probability of adopting robotic surgery on the probability of the focal hospital adopting robotic surgery decreases under both mean autocorrelation data models. However, neither of these associations is statistically significant.</div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142911256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Preplanned Multi-Stage Platform Trial for Discovering Multiple Superior Treatments With Control of FWER and Power 一个预先计划的多阶段平台试验，以发现具有控制功率和功率的多种优越治疗方法。

IF 1.3 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2024-12-22 DOI: 10.1002/bimj.70025

Peter Greenstreet, Thomas Jaki, Alun Bedding, Pavel Mozgunov

There is a growing interest in the implementation of platform trials, which provide the flexibility to incorporate new treatment arms during the trial and the ability to halt treatments early based on lack of benefit or observed superiority. In such trials, it can be important to ensure that error rates are controlled. This paper introduces a multi-stage design that enables the addition of new treatment arms, at any point, in a preplanned manner within a platform trial, while still maintaining control over the family-wise error rate. This paper focuses on finding the required sample size to achieve a desired level of statistical power when treatments are continued to be tested even after a superior treatment has already been found. This may be of interest if there are treatments from different sponsors which are also superior to the current control or multiple doses being tested. The calculations to determine the expected sample size is given. A motivating trial is presented in which the sample size of different configurations is studied. In addition, the approach is compared to running multiple separate trials and it is shown that in many scenarios if family-wise error rate control is needed there may not be benefit in using a platform trial when comparing the sample size of the trial.

人们对平台试验的实施越来越感兴趣，平台试验提供了在试验期间纳入新治疗臂的灵活性，并且能够在缺乏益处或观察到的优势的情况下早期停止治疗。在这样的试验中，确保错误率得到控制是很重要的。本文介绍了一种多阶段设计，可以在平台试验的任何时候以预先计划的方式添加新的治疗臂，同时仍然保持对家庭错误率的控制。本文的重点是找到所需的样本量，以达到理想的统计能力水平，当治疗继续进行测试，即使在一个更好的治疗已经发现。如果来自不同赞助方的治疗方法也优于目前的对照或正在测试的多剂量，这可能会引起人们的兴趣。给出了确定预期样本量的计算方法。提出了一个激励试验，研究了不同构型的样本量。此外，将该方法与运行多个单独的试验进行比较，结果表明，在许多情况下，如果需要家庭错误率控制，那么在比较试验的样本量时，使用平台试验可能没有好处。

{"title":"A Preplanned Multi-Stage Platform Trial for Discovering Multiple Superior Treatments With Control of FWER and Power","authors":"Peter Greenstreet, Thomas Jaki, Alun Bedding, Pavel Mozgunov","doi":"10.1002/bimj.70025","DOIUrl":"10.1002/bimj.70025","url":null,"abstract":"There is a growing interest in the implementation of platform trials, which provide the flexibility to incorporate new treatment arms during the trial and the ability to halt treatments early based on lack of benefit or observed superiority. In such trials, it can be important to ensure that error rates are controlled. This paper introduces a multi-stage design that enables the addition of new treatment arms, at any point, in a preplanned manner within a platform trial, while still maintaining control over the family-wise error rate. This paper focuses on finding the required sample size to achieve a desired level of statistical power when treatments are continued to be tested even after a superior treatment has already been found. This may be of interest if there are treatments from different sponsors which are also superior to the current control or multiple doses being tested. The calculations to determine the expected sample size is given. A motivating trial is presented in which the sample size of different configurations is studied. In addition, the approach is compared to running multiple separate trials and it is shown that in many scenarios if family-wise error rate control is needed there may not be benefit in using a platform trial when comparing the sample size of the trial.","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11664203/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142878648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Investigating a Domain Adaptation Approach for Integrating Different Measurement Instruments in a Longitudinal Clinical Registry 纵向临床登记中整合不同测量仪器的域适应方法研究。

IF 1.3 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2024-12-19 DOI: 10.1002/bimj.70023

Maren Hackenberg, Michelle Pfaffenlehner, Max Behrens, Astrid Pechmann, Janbernd Kirschner, Harald Binder

In a longitudinal clinical registry, different measurement instruments might have been used for assessing individuals at different time points. To combine them, we investigate deep learning techniques for obtaining a joint latent representation, to which the items of different measurement instruments are mapped. This corresponds to domain adaptation, an established concept in computer science for image data. Using the proposed approach as an example, we evaluate the potential of domain adaptation in a longitudinal cohort setting with a rather small number of time points, motivated by an application with different motor function measurement instruments in a registry of spinal muscular atrophy (SMA) patients. There, we model trajectories in the latent representation by ordinary differential equations (ODEs), where person-specific ODE parameters are inferred from baseline characteristics. The goodness of fit and complexity of the ODE solutions then allow to judge the measurement instrument mappings. We subsequently explore how alignment can be improved by incorporating corresponding penalty terms into model fitting. To systematically investigate the effect of differences between measurement instruments, we consider several scenarios based on modified SMA data, including scenarios where a mapping should be feasible in principle and scenarios where no perfect mapping is available. While misalignment increases in more complex scenarios, some structure is still recovered, even if the availability of measurement instruments depends on patient state. A reasonable mapping is feasible also in the more complex real SMA data set. These results indicate that domain adaptation might be more generally useful in statistical modeling for longitudinal registry data.

在纵向临床登记，不同的测量仪器可能已被用于评估个体在不同的时间点。为了将它们结合起来，我们研究了深度学习技术来获得联合潜在表示，不同测量仪器的项目被映射到该联合潜在表示。这对应于领域自适应，这是计算机科学中关于图像数据的一个既定概念。以所提出的方法为例，我们通过在脊髓性肌萎缩症（SMA）患者登记册中应用不同的运动功能测量仪器，在纵向队列设置中评估区域适应的潜力。在那里，我们通过常微分方程（ODE）对潜在表征中的轨迹进行建模，其中个人特定的ODE参数是从基线特征推断出来的。然后，ODE解决方案的拟合优度和复杂性允许判断测量仪器映射。我们随后探讨了如何通过将相应的惩罚项合并到模型拟合中来改进对齐。为了系统地研究测量仪器之间差异的影响，我们基于修改后的SMA数据考虑了几种场景，包括原则上应该可行的映射场景和没有完美映射的场景。虽然在更复杂的情况下，不对准会增加，但即使测量仪器的可用性取决于患者的状态，一些结构仍然可以恢复。在更复杂的实际SMA数据集中，合理的映射也是可行的。这些结果表明，领域自适应在纵向注册数据的统计建模中可能更为普遍。

{"title":"Investigating a Domain Adaptation Approach for Integrating Different Measurement Instruments in a Longitudinal Clinical Registry","authors":"Maren Hackenberg, Michelle Pfaffenlehner, Max Behrens, Astrid Pechmann, Janbernd Kirschner, Harald Binder","doi":"10.1002/bimj.70023","DOIUrl":"10.1002/bimj.70023","url":null,"abstract":"In a longitudinal clinical registry, different measurement instruments might have been used for assessing individuals at different time points. To combine them, we investigate deep learning techniques for obtaining a joint latent representation, to which the items of different measurement instruments are mapped. This corresponds to domain adaptation, an established concept in computer science for image data. Using the proposed approach as an example, we evaluate the potential of domain adaptation in a longitudinal cohort setting with a rather small number of time points, motivated by an application with different motor function measurement instruments in a registry of spinal muscular atrophy (SMA) patients. There, we model trajectories in the latent representation by ordinary differential equations (ODEs), where person-specific ODE parameters are inferred from baseline characteristics. The goodness of fit and complexity of the ODE solutions then allow to judge the measurement instrument mappings. We subsequently explore how alignment can be improved by incorporating corresponding penalty terms into model fitting. To systematically investigate the effect of differences between measurement instruments, we consider several scenarios based on modified SMA data, including scenarios where a mapping should be feasible in principle and scenarios where no perfect mapping is available. While misalignment increases in more complex scenarios, some structure is still recovered, even if the availability of measurement instruments depends on patient state. A reasonable mapping is feasible also in the more complex real SMA data set. These results indicate that domain adaptation might be more generally useful in statistical modeling for longitudinal registry data.","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.70023","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142857076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

To Tweak or Not to Tweak. How Exploiting Flexibilities in Gene Set Analysis Leads to Overoptimism 调整还是不调整。如何利用基因集分析的灵活性导致过度乐观。

IF 1.3 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2024-12-19 DOI: 10.1002/bimj.70016

Milena Wünsch, Christina Sauer, Moritz Herrmann, Ludwig Christian Hinske, Anne-Laure Boulesteix

Gene set analysis, a popular approach for analyzing high-throughput gene expression data, aims to identify sets of genes that show enriched expression patterns between two conditions. In addition to the multitude of methods available for this task, users are typically left with many options when creating the required input and specifying the internal parameters of the chosen method. This flexibility can lead to uncertainty about the “right” choice, further reinforced by a lack of evidence-based guidance. Especially when their statistical experience is scarce, this uncertainty might entice users to produce preferable results using a “trial-and-error” approach. While it may seem unproblematic at first glance, this practice can be viewed as a form of “cherry-picking” and cause an optimistic bias, rendering the results nonreplicable on independent data. After this problem has attracted a lot of attention in the context of classical hypothesis testing, we now aim to raise awareness of such overoptimism in the different and more complex context of gene set analyses. We mimic a hypothetical researcher who systematically selects the analysis variants yielding their preferred results, thereby considering three distinct goals they might pursue. Using a selection of popular gene set analysis methods, we tweak the results in this way for two frequently used benchmark gene expression data sets. Our study indicates that the potential for overoptimism is particularly high for a group of methods frequently used despite being commonly criticized. We conclude by providing practical recommendations to counter overoptimism in research findings in gene set analysis and beyond.

基因集分析是分析高通量基因表达数据的一种流行方法，旨在识别在两种情况下表现出丰富表达模式的基因集。除了可用于此任务的众多方法之外，在创建所需的输入并指定所选方法的内部参数时，用户通常还有许多选项。这种灵活性可能导致“正确”选择的不确定性，而缺乏基于证据的指导则进一步加剧了这种不确定性。特别是当他们缺乏统计经验时，这种不确定性可能会诱使用户使用“试错”方法产生更可取的结果。虽然乍一看似乎没有问题，但这种做法可以被视为一种“挑选樱桃”的形式，并导致乐观的偏见，使结果无法在独立数据上复制。在这个问题在经典假设检验的背景下引起了很多关注之后，我们现在的目标是在基因集分析的不同和更复杂的背景下提高对这种过度乐观的认识。我们模拟一个假设的研究人员系统地选择分析变量产生他们喜欢的结果，从而考虑他们可能追求的三个不同的目标。通过选择流行的基因集分析方法，我们以这种方式调整了两个常用的基准基因表达数据集的结果。我们的研究表明，尽管经常受到批评，但对于一组经常使用的方法来说，过度乐观的可能性尤其高。最后，我们提供了一些实用的建议，以防止在基因集分析和其他领域的研究结果过于乐观。

{"title":"To Tweak or Not to Tweak. How Exploiting Flexibilities in Gene Set Analysis Leads to Overoptimism","authors":"Milena Wünsch, Christina Sauer, Moritz Herrmann, Ludwig Christian Hinske, Anne-Laure Boulesteix","doi":"10.1002/bimj.70016","DOIUrl":"10.1002/bimj.70016","url":null,"abstract":"Gene set analysis, a popular approach for analyzing high-throughput gene expression data, aims to identify sets of genes that show enriched expression patterns between two conditions. In addition to the multitude of methods available for this task, users are typically left with many options when creating the required input and specifying the internal parameters of the chosen method. This flexibility can lead to uncertainty about the “right” choice, further reinforced by a lack of evidence-based guidance. Especially when their statistical experience is scarce, this uncertainty might entice users to produce preferable results using a “trial-and-error” approach. While it may seem unproblematic at first glance, this practice can be viewed as a form of “cherry-picking” and cause an optimistic bias, rendering the results nonreplicable on independent data. After this problem has attracted a lot of attention in the context of classical hypothesis testing, we now aim to raise awareness of such overoptimism in the different and more complex context of gene set analyses. We mimic a hypothetical researcher who systematically selects the analysis variants yielding their preferred results, thereby considering three distinct goals they might pursue. Using a selection of popular gene set analysis methods, we tweak the results in this way for two frequently used benchmark gene expression data sets. Our study indicates that the potential for overoptimism is particularly high for a group of methods frequently used despite being commonly criticized. We conclude by providing practical recommendations to counter overoptimism in research findings in gene set analysis and beyond.","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.70016","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142857080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Progression-Free-Survival Ratio in Molecularly Aided Tumor Trials: A Critical Examination of Current Practice and Suggestions for Alternative Methods 分子辅助肿瘤试验中的无进展生存率：对当前实践和替代方法建议的关键检查。

IF 1.3 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2024-12-18 DOI: 10.1002/bimj.70028

Dominic Edelmann, Tobias Terzer, Peter Horak, Richard Schlenk, Axel Benner

The progression-free-survival ratio is a popular endpoint in oncology trials, which is frequently applied to evaluate the efficacy of molecularly targeted treatments in late-stage patients. Using elementary calculations and simulations, numerous shortcomings of the current methodology are pointed out. As a remedy to these shortcomings, an alternative methodology is proposed, using a marginal Cox model or a marginal accelerated failure time model for clustered time-to-event data. Using comprehensive simulations, it is shown that this methodology outperforms existing methods in settings where the intrapatient correlation is low to moderate. The performance of the model is further demonstrated in a real data example from a molecularly aided tumor trial. Sample size considerations are discussed.

无进展生存率是肿瘤学试验中一个流行的终点，经常用于评估晚期患者分子靶向治疗的疗效。通过初步的计算和仿真，指出了当前方法的许多不足。为了弥补这些缺点，提出了一种替代方法，即使用边际Cox模型或边际加速失效时间模型来处理聚类时间到事件数据。综合模拟表明，该方法优于现有的方法，在设置中，患者内部的相关性是低到中等。该模型的性能在一个来自分子辅助肿瘤试验的真实数据示例中得到进一步证明。讨论了样本量的考虑。

引用次数: 0