Biometrical Journal最新文献_第4页

Sharp Bounds for Continuous-Valued Treatment Effects with Unobserved Confounders 具有未观察混杂因素的连续值治疗效果的锐界。

IF 1.8 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2025-10-14 DOI: 10.1002/bimj.70084

Jean-Baptiste Baitairian, Bernard Sebastien, Rana Jreich, Sandrine Katsahian, Agathe Guilloux

In causal inference, treatment effects are typically estimated under the ignorability, or unconfoundedness, assumption, which is often unrealistic in observational data. By relaxing this assumption and conducting a sensitivity analysis, we introduce novel bounds and derive confidence intervals for the Average Potential Outcome (APO)—a standard metric for evaluating continuous-valued treatment or exposure effects. We demonstrate that these bounds are sharp under a continuous sensitivity model, in the sense that they give the smallest possible interval under this model, and propose a doubly robust version of our estimators. In a comparative analysis with another method from the literature, using both simulated and real data sets, we show that our approach not only yields sharper bounds but also achieves good coverage of the true APO, with significantly reduced computation times.

在因果推理中，治疗效果通常是在可忽略性或非混淆性假设下估计的，这在观察数据中往往是不现实的。通过放宽这一假设并进行敏感性分析，我们引入了新的界限并推导了平均潜在结果（APO）的置信区间——APO是评估连续值治疗或暴露效应的标准度量。我们证明了这些边界在连续灵敏度模型下是尖锐的，在某种意义上，它们给出了该模型下最小的可能区间，并提出了我们估计的双鲁棒版本。在与文献中的另一种方法（使用模拟和真实数据集）的比较分析中，我们表明，我们的方法不仅产生更清晰的边界，而且还实现了对真实APO的良好覆盖，大大减少了计算时间。

引用次数: 0

Multivariate Bayesian Dynamic Borrowing for Repeated Measures Data With Application to External Control Arms in Open-Label Extension Studies 重复测量数据的多元贝叶斯动态借用及其在开放标签扩展研究中的应用。

IF 1.8 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2025-10-07 DOI: 10.1002/bimj.70079

Benjamin F. Hartley, Matthew A. Psioda, Adrian P. Mander

Borrowing analyses are increasingly important in clinical trials. We develop a method for using robust mixture priors in multivariate dynamic borrowing. The method was motivated by a desire to produce causally valid, long-term treatment effect estimates of a continuous endpoint from a single active-arm open-label extension study following a randomized clinical trial by dynamically incorporating prior beliefs from a long-term external control arm. The proposed method is a generally applicable Bayesian dynamic borrowing analysis for estimates of multivariate summary metrics based on a multivariate normal likelihood function for various parameter models, some of which we describe. There are important connections to estimation incorporating a prior belief for a hypothetical estimand strategy, that is, had the event not occurred, for intercurrent events which lead to missing data.

借用分析在临床试验中越来越重要。提出了一种在多元动态借贷中使用鲁棒混合先验的方法。该方法的动机是希望在随机临床试验之后，通过动态地结合长期外部对照组的先验信念，对单个主动臂开放标签扩展研究的连续终点进行因果有效的长期治疗效果估计。本文提出的方法是一种基于多元正态似然函数的多元汇总指标估计的贝叶斯动态借用分析方法，适用于各种参数模型，我们描述了其中的一些。对于一个假设的估计策略，也就是说，如果事件没有发生，对于导致丢失数据的交互事件，与纳入先验信念的估计有重要的联系。

引用次数: 0

Weibull Regression With Both Measurement Error and Misclassification in Covariates 协变量存在测量误差和误分类的威布尔回归。

IF 1.8 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2025-10-07 DOI: 10.1002/bimj.70083

Zhiqiang Cao, Man Yu Wong

The problem of measurement error and misclassification in covariates is ubiquitous in nutritional epidemiology and some other research areas, which often leads to biased estimate and loss of power. However, addressing both measurement error and misclassification simultaneously in a single analysis is challenged and less actively studied, especially in regression model for survival data with censoring. The approximate maximum likelihood estimation (AMLE) has been proved to be an effective method to correct both measurement error and misclassification simultaneously in a logistic regression model. However, its impact on survival analysis models has not been studied. In this paper, we study biases caused by both measurement error and misclassification in covariates from a Weibull accelerated failure time model, and explore the use of AMLE and its asymptotic properties to correct these biases. Extensive simulation studies are conducted to evaluate the finite-sample performance of the resulting estimator. The proposed method is then applied to deal with measurement error and misclassification in some nutrients of interest from the EPIC-InterAct Study.

在营养流行病学和其他一些研究领域中，协变量的测量误差和误分类问题普遍存在，这往往导致估计的偏倚和功率损失。然而，在单一分析中同时解决测量误差和错误分类是一个挑战，而且研究较少，特别是在带有审查的生存数据的回归模型中。近似最大似然估计（AMLE）已被证明是一种同时校正逻辑回归模型测量误差和误分类的有效方法。然而，它对生存分析模型的影响尚未得到研究。在本文中，我们研究了威布尔加速失效时间模型中由测量误差和误分类引起的偏差，并探索了利用AMLE及其渐近性质来纠正这些偏差。进行了广泛的仿真研究，以评估所得估计器的有限样本性能。提出的方法随后被应用于处理EPIC-InterAct研究中一些感兴趣的营养物质的测量误差和错误分类。

{"title":"Weibull Regression With Both Measurement Error and Misclassification in Covariates","authors":"Zhiqiang Cao, Man Yu Wong","doi":"10.1002/bimj.70083","DOIUrl":"10.1002/bimj.70083","url":null,"abstract":"<div>\u0000 \u0000 The problem of measurement error and misclassification in covariates is ubiquitous in nutritional epidemiology and some other research areas, which often leads to biased estimate and loss of power. However, addressing both measurement error and misclassification simultaneously in a single analysis is challenged and less actively studied, especially in regression model for survival data with censoring. The approximate maximum likelihood estimation (AMLE) has been proved to be an effective method to correct both measurement error and misclassification simultaneously in a logistic regression model. However, its impact on survival analysis models has not been studied. In this paper, we study biases caused by both measurement error and misclassification in covariates from a Weibull accelerated failure time model, and explore the use of AMLE and its asymptotic properties to correct these biases. Extensive simulation studies are conducted to evaluate the finite-sample performance of the resulting estimator. The proposed method is then applied to deal with measurement error and misclassification in some nutrients of interest from the EPIC-InterAct Study.</div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 5","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145245843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Improving Genomic Prediction Using High-Dimensional Secondary Phenotypes: The Genetic Latent Factor Approach 利用高维次级表型改进基因组预测：遗传潜在因子方法。

IF 1.8 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2025-10-07 DOI: 10.1002/bimj.70081

Killian A. C. Melsen, Jonathan F. Kunst, José Crossa, Margaret R. Krause, Fred A. van Eeuwijk, Willem Kruijer, Carel F. W. Peeters

Decreasing costs and new technologies have led to an increase in the amount of data available to plant breeding programs. High-throughput phenotyping (HTP) platforms routinely generate high-dimensional datasets of secondary features that may be used to improve genomic prediction accuracy. However, integration of these data comes with challenges such as multicollinearity, parameter estimation in $� � � p � > � n � � �$ settings, and the computational complexity of many standard approaches. Several methods have emerged to analyze such data, but interpretation of model parameters often remains challenging. We propose genetic latent factor best linear unbiased prediction (glfBLUP), a prediction pipeline that reduces the dimensionality of the original secondary HTP data using generative factor analysis. In short, glfBLUP uses redundancy filtered and regularized genetic and residual correlation matrices to fit a maximum likelihood factor model and estimate genetic latent factor scores. These latent factors are subsequently used in multitrait genomic prediction. Our approach performs better than alternatives in extensive simulations and a real-world application, while producing easily interpretable and biologically relevant parameters. We discuss several possible extensions and highlight glfBLUP as the basis for a flexible and modular multitrait genomic prediction framework.

成本的降低和新技术的发展使得植物育种项目的数据量有所增加。高通量表型（HTP）平台通常生成次要特征的高维数据集，可用于提高基因组预测的准确性。然而，这些数据的集成带来了多重共线性、p > n$设置中的参数估计以及许多标准方法的计算复杂性等挑战。已经出现了几种方法来分析这些数据，但模型参数的解释往往仍然具有挑战性。我们提出了遗传潜在因子最佳线性无偏预测（glfBLUP），这是一种利用生成因子分析降低原始次要HTP数据维数的预测管道。简而言之，glfBLUP使用冗余过滤和正则化的遗传和残差相关矩阵来拟合最大似然因子模型并估计遗传潜在因子得分。这些潜在因素随后被用于多性状基因组预测。我们的方法在广泛的模拟和实际应用中比其他方法表现得更好，同时产生易于解释和生物相关的参数。我们讨论了几种可能的扩展，并强调glfBLUP作为灵活和模块化多性状基因组预测框架的基础。

{"title":"Improving Genomic Prediction Using High-Dimensional Secondary Phenotypes: The Genetic Latent Factor Approach","authors":"Killian A. C. Melsen, Jonathan F. Kunst, José Crossa, Margaret R. Krause, Fred A. van Eeuwijk, Willem Kruijer, Carel F. W. Peeters","doi":"10.1002/bimj.70081","DOIUrl":"10.1002/bimj.70081","url":null,"abstract":"Decreasing costs and new technologies have led to an increase in the amount of data available to plant breeding programs. High-throughput phenotyping (HTP) platforms routinely generate high-dimensional datasets of secondary features that may be used to improve genomic prediction accuracy. However, integration of these data comes with challenges such as multicollinearity, parameter estimation in <math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>p</mi>\u0000 <mo>></mo>\u0000 <mi>n</mi>\u0000 </mrow>\u0000 <annotation>$p > n$</annotation>\u0000 </semantics></math> settings, and the computational complexity of many standard approaches. Several methods have emerged to analyze such data, but interpretation of model parameters often remains challenging. We propose genetic latent factor best linear unbiased prediction (glfBLUP), a prediction pipeline that reduces the dimensionality of the original secondary HTP data using generative factor analysis. In short, glfBLUP uses redundancy filtered and regularized genetic and residual correlation matrices to fit a maximum likelihood factor model and estimate genetic latent factor scores. These latent factors are subsequently used in multitrait genomic prediction. Our approach performs better than alternatives in extensive simulations and a real-world application, while producing easily interpretable and biologically relevant parameters. We discuss several possible extensions and highlight glfBLUP as the basis for a flexible and modular multitrait genomic prediction framework.","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 5","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12504928/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145245751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Federated Mixed Effects Logistic Regression Based on One-Time Shared Summary Statistics 基于一次性共享汇总统计的联邦混合效应逻辑回归。

IF 1.8 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2025-09-29 DOI: 10.1002/bimj.70080

Marie Analiz April Limpoco, Christel Faes, Niel Hens

Upholding data privacy, especially in medical research, has become tantamount to facing difficulties in accessing individual-level patient data. Estimating mixed effects binary logistic regression models involving data from multiple data providers, like hospitals, thus becomes more challenging. Federated learning has emerged as an option to preserve the privacy of individual observations while still estimating a global model that can be interpreted on the individual level, but it usually involves iterative communication between the data providers and the data analyst. In this paper, we present a strategy to estimate a mixed effects binary logistic regression model that requires data providers to share summary statistics only once. It involves generating pseudo-data whose summary statistics match those of the actual data and using these in the model estimation process instead of the actual unavailable data. Our strategy is able to include multiple predictors, which can be a combination of continuous and categorical variables. Through simulation, we show that our approach estimates the true model at least as good as the one that requires the pooled individual observations. An illustrative example using real data is provided. Unlike typical federated learning algorithms, our approach eliminates infrastructure requirements and security issues while being communication efficient and while accounting for heterogeneity.

维护数据隐私，特别是在医学研究方面，已经等同于在获取个人层面的患者数据方面面临困难。因此，估计涉及多个数据提供者（如医院）数据的混合效应二元逻辑回归模型变得更具挑战性。联邦学习作为一种保护个人观察的隐私的选择而出现，同时仍然估计可以在个人级别上解释的全局模型，但它通常涉及数据提供者和数据分析师之间的迭代通信。在本文中，我们提出了一种估计混合效应二元逻辑回归模型的策略，该模型要求数据提供者只共享一次汇总统计数据。它涉及生成与实际数据的汇总统计相匹配的伪数据，并在模型估计过程中使用这些伪数据，而不是实际的不可用数据。我们的策略能够包含多个预测因子，这些预测因子可以是连续变量和分类变量的组合。通过模拟，我们表明我们的方法估计真实模型至少与需要汇集个人观测的模型一样好。给出了一个使用实际数据的示例。与典型的联邦学习算法不同，我们的方法消除了基础设施需求和安全问题，同时保证了通信效率和异构性。

{"title":"Federated Mixed Effects Logistic Regression Based on One-Time Shared Summary Statistics","authors":"Marie Analiz April Limpoco, Christel Faes, Niel Hens","doi":"10.1002/bimj.70080","DOIUrl":"10.1002/bimj.70080","url":null,"abstract":"<div>\u0000 \u0000 Upholding data privacy, especially in medical research, has become tantamount to facing difficulties in accessing individual-level patient data. Estimating mixed effects binary logistic regression models involving data from multiple data providers, like hospitals, thus becomes more challenging. Federated learning has emerged as an option to preserve the privacy of individual observations while still estimating a global model that can be interpreted on the individual level, but it usually involves iterative communication between the data providers and the data analyst. In this paper, we present a strategy to estimate a mixed effects binary logistic regression model that requires data providers to share summary statistics only once. It involves generating pseudo-data whose summary statistics match those of the actual data and using these in the model estimation process instead of the actual unavailable data. Our strategy is able to include multiple predictors, which can be a combination of continuous and categorical variables. Through simulation, we show that our approach estimates the true model at least as good as the one that requires the pooled individual observations. An illustrative example using real data is provided. Unlike typical federated learning algorithms, our approach eliminates infrastructure requirements and security issues while being communication efficient and while accounting for heterogeneity.</div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 5","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145187761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A New Logistic Model With Subject-Specific and Serially Correlated Time-Specific Distribution-Free Random Effects on the Unit Interval for Longitudinal Binary Data 纵向二值数据单位区间上具有主体特异性和序列相关时间特异性无分布随机效应的Logistic模型。

IF 1.8 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2025-09-28 DOI: 10.1002/bimj.70078

Lulu Zhang, Renjun Ma, Guohua Yan, Xifen Huang

Various beta-binomial mixed effects models have been developed in recent years for longitudinal binary data; however, these approaches rely heavily on the parametric specification of beta and normal random effects. Furthermore, their incorporation of normal random effects into beta-binomial models has been done at the sacrifice of certain computational convenience and clear interpretation with beta-binomial models. In this paper, we introduce a new model that incorporates subject-specific and serially correlated time-specific distribution-free random effects on the unit interval into logistic regression multiplicatively with fixed effects. This new multiplicative model facilitates the interpretation of random effects on the unit interval as risk modifiers. This multiplicative model setup also eases the model derivation and random effects prediction. A quasi-likelihood approach has been developed in the estimation of our model. Our results are robust against random effects distributions. Our method is illustrated through the analysis of multiple sclerosis trial data.

近年来，针对纵向二元数据建立了多种β -二项混合效应模型；然而，这些方法严重依赖于beta和正态随机效应的参数规范。此外，它们将正态随机效应纳入β -二项模型是以牺牲某些计算便利性和β -二项模型的清晰解释为代价的。在本文中，我们引入了一个新的模型，该模型将单位区间上的特定主题和序列相关的特定时间的无分布随机效应纳入到具有固定效应的乘法逻辑回归中。这种新的乘法模型有助于解释单位区间上的随机效应作为风险修正因子。这种乘法模型的建立也简化了模型的推导和随机效应的预测。在我们的模型的估计中发展了一种准似然方法。我们的结果对于随机效应分布是稳健的。我们的方法通过对多发性硬化症试验数据的分析来说明。

{"title":"A New Logistic Model With Subject-Specific and Serially Correlated Time-Specific Distribution-Free Random Effects on the Unit Interval for Longitudinal Binary Data","authors":"Lulu Zhang, Renjun Ma, Guohua Yan, Xifen Huang","doi":"10.1002/bimj.70078","DOIUrl":"10.1002/bimj.70078","url":null,"abstract":"<div>\u0000 \u0000 Various beta-binomial mixed effects models have been developed in recent years for longitudinal binary data; however, these approaches rely heavily on the parametric specification of beta and normal random effects. Furthermore, their incorporation of normal random effects into beta-binomial models has been done at the sacrifice of certain computational convenience and clear interpretation with beta-binomial models. In this paper, we introduce a new model that incorporates subject-specific and serially correlated time-specific distribution-free random effects on the unit interval into logistic regression multiplicatively with fixed effects. This new multiplicative model facilitates the interpretation of random effects on the unit interval as risk modifiers. This multiplicative model setup also eases the model derivation and random effects prediction. A quasi-likelihood approach has been developed in the estimation of our model. Our results are robust against random effects distributions. Our method is illustrated through the analysis of multiple sclerosis trial data.</div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 5","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145187785","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

ADDIS-Graphs for Online Error Control With Application to Platform Trials 在线错误控制的adis -图及其在平台试验中的应用。

IF 1.8 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2025-09-28 DOI: 10.1002/bimj.70075

Lasse Fischer, Marta Bofill Roig, Werner Brannath

In contemporary research, online error control is often required, where an error criterion, such as familywise error rate (FWER) or false discovery rate (FDR), shall remain under control while testing an a priori unbounded sequence of hypotheses. The existing online literature mainly considered large-scale studies and constructed powerful but rigid algorithms for these. However, smaller studies, such as platform trials, require high flexibility and easy interpretability to take study objectives into account and facilitate the communication. Another challenge in platform trials is that due to the shared control arm some of the $� � p � �$ -values are dependent and significance levels need to be prespecified before the decisions for all the past treatments are available. We propose adaptive-discarding-Graphs (ADDIS-Graphs) with FWER control that due to their graphical structure perfectly adapt to such settings and provably uniformly improve the state-of-the-art method. We introduce several extensions of these ADDIS-Graphs, including the incorporation of information about the joint distribution of the $� � p � �$ -values and a version for FDR control.

在当代研究中，经常需要在线错误控制，在测试先验无界假设序列时，需要控制错误标准，如家庭错误率（FWER）或错误发现率（FDR）。现有的网络文献主要考虑大规模的研究，并为此构建了强大但严格的算法。然而，较小的研究，如平台试验，需要高度的灵活性和易于解释，以考虑研究目标并促进交流。平台试验的另一个挑战是，由于共享控制臂，一些p$ p$值是依赖的，需要在所有过去治疗的决策可用之前预先指定显著性水平。我们提出了具有FWER控制的自适应丢弃图（adis - graphs），由于其图形结构完美地适应了这种设置，并且可以证明其均匀地改进了最先进的方法。我们介绍了这些adis - graph的几个扩展，包括p$ p$值的联合分布信息的合并和FDR控制的一个版本。

{"title":"ADDIS-Graphs for Online Error Control With Application to Platform Trials","authors":"Lasse Fischer, Marta Bofill Roig, Werner Brannath","doi":"10.1002/bimj.70075","DOIUrl":"10.1002/bimj.70075","url":null,"abstract":"In contemporary research, online error control is often required, where an error criterion, such as familywise error rate (FWER) or false discovery rate (FDR), shall remain under control while testing an a priori unbounded sequence of hypotheses. The existing online literature mainly considered large-scale studies and constructed powerful but rigid algorithms for these. However, smaller studies, such as platform trials, require high flexibility and easy interpretability to take study objectives into account and facilitate the communication. Another challenge in platform trials is that due to the shared control arm some of the <math>\u0000 <semantics>\u0000 <mi>p</mi>\u0000 <annotation>$p$</annotation>\u0000 </semantics></math>-values are dependent and significance levels need to be prespecified before the decisions for all the past treatments are available. We propose adaptive-discarding-Graphs (ADDIS-Graphs) with FWER control that due to their graphical structure perfectly adapt to such settings and provably uniformly improve the state-of-the-art method. We introduce several extensions of these ADDIS-Graphs, including the incorporation of information about the joint distribution of the <math>\u0000 <semantics>\u0000 <mi>p</mi>\u0000 <annotation>$p$</annotation>\u0000 </semantics></math>-values and a version for FDR control.","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 5","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12476829/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145187788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Inference Under Covariate-Adaptive Randomization Using Random Center-Effect 基于随机中心效应的协变量自适应随机化推理

IF 1.8 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2025-09-22 DOI: 10.1002/bimj.70076

Anjali Pandey, Harsha Shree BS, Andrea Callegaro

The minimization method is a popular choice for covariate-adaptive randomization in multicenter trials. Existing literature suggests that the type-I error is controlled if minimization variables are included in the statistical analysis. However, in practice, minimization variables with many categories, such as the recruitment center, are often not included in the model. In this paper, we propose including the minimization variable “center” as a random effect and assess its performance using simulations for Gaussian, binary, and Poisson endpoint variables. Our simulation study suggests that the random-effect model controls type-I error and preserves maximum power for all three endpoints under varied clinical trial settings. This approach offers an alternative to the re-randomization test, which regulatory authorities often suggest for sensitivity analysis.

最小化方法是多中心试验中协变量自适应随机化的常用方法。现有文献表明，如果在统计分析中加入最小化变量，则可以控制i型误差。然而，在实践中，具有许多类别的最小化变量，例如招聘中心，通常不包括在模型中。在本文中，我们建议将最小化变量“中心”作为随机效应，并通过模拟高斯、二进制和泊松端点变量来评估其性能。我们的模拟研究表明，随机效应模型控制了i型误差，并在不同的临床试验设置下保留了所有三个终点的最大功率。这种方法提供了一种替代的再随机化试验，这是监管机构经常建议敏感性分析。

引用次数: 0

From Data to Knowledge. Advancing Life Sciences: Editorial for the CEN2023 Special Issue 从数据到知识。推进生命科学：CEN2023特刊社论。

IF 1.8 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2025-09-17 DOI: 10.1002/bimj.70077

Werner Brannath, Frank Bretz, Hans Ulrich Burger, Malgorzata Graczyk, Annette Kopp-Schneider

This Special Issue—From Data to Knowledge. Advancing Life Sciences—arose from the Fifth Conference of the Central European Network (CEN2023) of the International Biometric Society, which took place on September 3–7, 2023, in Basel, Switzerland (https://cen2023.github.io/home/). More than 500 colleagues registered for in-person attendance and a further 100 participated virtually, representing more than 30 countries. The scientific program began on Sunday with seven short courses. From Monday through Thursday, the main conference featured seven parallel tracks and nearly 400 oral and poster contributions, including keynote presentations by Ruth Keogh, Alicja Szabelska-Beręsewicz, and Peter Bühlmann.This special issue consists of 14 peer-reviewed articles generated from research work presented at the symposium. The collection reflects the vibrancy and breadth of current research in biometrics, spanning areas such as clinical trials, epidemiology, genomics, and ecology. Von Felten et al. performed a simulation study comparing multiple approaches to estimating the survivor average causal effect in randomized trials with outcomes truncated by death. Carrozzo et al. compared the statistical efficiency of a two-arm crossover randomized controlled trial with that of a meta-analysis of N-of-1 studies, highlighting the potential of sequential aggregation. Burk et al. proposed a cooperative penalized regression approach for high-dimensional variable selection with competing risks, improving feature selection over traditional methods. Erdmann et al. demonstrated how multistate modeling of progression-free and overall survival endpoints can enhance oncology clinical trial design, especially in the presence of nonproportional hazards. Wünsch et al. investigated how the flexibility in gene set analysis can lead to overoptimistic findings, raising awareness of methodological uncertainty and offering practical guidance. Nassiri et al. proposed a Bayesian posterior probability adjustment method to mitigate class imbalance in classification tasks, improving predictive accuracy. Kim et al. introduced an inverse-weighted quantile regression approach tailored for partially interval-censored data, applicable to complex biomedical endpoints. Teschke et al. developed a method using cross-leverage scores to efficiently detect interaction effects in high-dimensional genetic data. Uno et al. proposed Firth-type penalized regression methods to improve the performance of modified Poisson and least-squares regression models in small or sparse binary outcome settings. Langthaler et al. developed a nonparametric inference method for assessing ecological niche overlap among multiple species, supporting biodiversity research. Kipruto and Sauerbrei revisited postestimation shrinkage in linear models, introducing a modified parameter-wise shrinkage method and assessing its performance in various settings. Röver and Friede explored the concept of “study twins”

本期特刊——从数据到知识。推进生命科学——起源于国际生物识别学会中欧网络第五次会议（CEN2023），该会议于2023年9月3日至7日在瑞士巴塞尔举行（https://cen2023.github.io/home/）。500多名同事注册亲自出席，另有100人参加了虚拟会议，代表了30多个国家。科学项目从周日开始，有七个短期课程。从周一到周四，主要会议有七个平行的轨道和近400个口头和海报贡献，包括Ruth Keogh， Alicja Szabelska-Beręsewicz和Peter b hlmann的主题演讲。本期特刊收录了14篇经同行评议的论文，这些论文来自于在研讨会上发表的研究工作。这些收集反映了当前生物测定学研究的活力和广度，涵盖了临床试验、流行病学、基因组学和生态学等领域。Von Felten等人进行了一项模拟研究，比较了在随机试验中估计幸存者平均因果效应的多种方法，结果被死亡截断。Carrozzo等人比较了两组交叉随机对照试验与N-of-1项研究的荟萃分析的统计效率，强调了顺序聚合的潜力。Burk等人提出了一种合作惩罚回归方法，用于具有竞争风险的高维变量选择，改进了传统方法的特征选择。Erdmann等人证明了无进展和总生存终点的多状态建模如何增强肿瘤临床试验设计，特别是在存在非比例风险的情况下。w nsch等人研究了基因集分析的灵活性如何导致过度乐观的结果，提高了对方法不确定性的认识，并提供了实际指导。Nassiri等人提出了一种贝叶斯后验概率调整方法来缓解分类任务中的类不平衡，提高预测准确率。Kim等人介绍了一种针对部分区间截尾数据量身定制的逆加权分位数回归方法，适用于复杂的生物医学终点。Teschke等人开发了一种使用交叉杠杆分数来有效检测高维遗传数据中的相互作用效应的方法。Uno等人提出了firth型惩罚回归方法，以提高修正泊松和最小二乘回归模型在小或稀疏二进制结果设置中的性能。Langthaler等人开发了一种非参数推理方法来评估多物种之间的生态位重叠，支持生物多样性研究。Kipruto和Sauerbrei重新研究了线性模型中的后估计收缩，引入了一种改进的参数收缩方法，并评估了其在各种设置中的性能。Röver和Friede在荟萃分析中探索了“研究双胞胎”的概念，显示了来自两个试验的有限信息如何使关于异质性的决策复杂化。Behning等人通过结合基于子分布的imputation策略，将随机生存森林扩展到相互竞争的风险设置中，证明了累积关联函数预测的改进。最后，Rousson和Locatelli根据生命损失年数制定了死亡率指标，并应用这些指标量化了COVID-19在30个国家的影响。我们对许多担任审稿人的同事表示感谢，他们为提交的文章提供了周到、高质量的评估。如果没有他们慷慨和专业的承诺，这个问题是不可能解决的。按照《生物计量学杂志》的惯例，所有审稿人的名字都将在一份年终名单中公布，并在未来的一期中发表。我们还要感谢Matthias Schmid、Monika Kortenjann和整个《生物计量学杂志》的编辑团队为确保顺利及时的制作过程所做的不懈努力。最后，我们感谢CEN2023会议的赞助商和资助机构的慷慨支持：安进、巴塞尔城市、百济神州、勃林格殷格翰、百时美施贵宝、CRC Press、Cytel、Datamap、Denali、杨森、Karger、诺华、PHRT Network、Posit、罗氏、赛诺菲、施普林格和瑞士国家科学基金会。我们期待着2026年在华沙举行的下一届CEN会议。到时见！

{"title":"From Data to Knowledge. Advancing Life Sciences: Editorial for the CEN2023 Special Issue","authors":"Werner Brannath, Frank Bretz, Hans Ulrich Burger, Malgorzata Graczyk, Annette Kopp-Schneider","doi":"10.1002/bimj.70077","DOIUrl":"10.1002/bimj.70077","url":null,"abstract":"This Special Issue—From Data to Knowledge. Advancing Life Sciences—arose from the Fifth Conference of the Central European Network (CEN2023) of the International Biometric Society, which took place on September 3–7, 2023, in Basel, Switzerland (https://cen2023.github.io/home/). More than 500 colleagues registered for in-person attendance and a further 100 participated virtually, representing more than 30 countries. The scientific program began on Sunday with seven short courses. From Monday through Thursday, the main conference featured seven parallel tracks and nearly 400 oral and poster contributions, including keynote presentations by Ruth Keogh, Alicja Szabelska-Beręsewicz, and Peter Bühlmann.This special issue consists of 14 peer-reviewed articles generated from research work presented at the symposium. The collection reflects the vibrancy and breadth of current research in biometrics, spanning areas such as clinical trials, epidemiology, genomics, and ecology. Von Felten et al. performed a simulation study comparing multiple approaches to estimating the survivor average causal effect in randomized trials with outcomes truncated by death. Carrozzo et al. compared the statistical efficiency of a two-arm crossover randomized controlled trial with that of a meta-analysis of N-of-1 studies, highlighting the potential of sequential aggregation. Burk et al. proposed a cooperative penalized regression approach for high-dimensional variable selection with competing risks, improving feature selection over traditional methods. Erdmann et al. demonstrated how multistate modeling of progression-free and overall survival endpoints can enhance oncology clinical trial design, especially in the presence of nonproportional hazards. Wünsch et al. investigated how the flexibility in gene set analysis can lead to overoptimistic findings, raising awareness of methodological uncertainty and offering practical guidance. Nassiri et al. proposed a Bayesian posterior probability adjustment method to mitigate class imbalance in classification tasks, improving predictive accuracy. Kim et al. introduced an inverse-weighted quantile regression approach tailored for partially interval-censored data, applicable to complex biomedical endpoints. Teschke et al. developed a method using cross-leverage scores to efficiently detect interaction effects in high-dimensional genetic data. Uno et al. proposed Firth-type penalized regression methods to improve the performance of modified Poisson and least-squares regression models in small or sparse binary outcome settings. Langthaler et al. developed a nonparametric inference method for assessing ecological niche overlap among multiple species, supporting biodiversity research. Kipruto and Sauerbrei revisited postestimation shrinkage in linear models, introducing a modified parameter-wise shrinkage method and assessing its performance in various settings. Röver and Friede explored the concept of “study twins” ","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 5","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.70077","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145076681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Estimands for Early-Phase Dose Optimization Trials in Oncology 肿瘤早期剂量优化试验的估计。

IF 1.8 3区生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Biometrical Journal

Pub Date : 2025-09-10 DOI: 10.1002/bimj.70072

Ayon Mukherjee, Jonathan L. Moscovici, Zheng Liu

Phase I dose escalation trials in oncology generally aim to find the maximum tolerated dose. However, with the advent of molecular-targeted therapies and antibody drug conjugates, dose-limiting toxicities are less frequently observed, giving rise to the concept of optimal biological dose (OBD), which considers both efficacy and toxicity. The estimand framework presented in the addendum of the ICH E9(R1) guidelines strengthens the dialogue between different stakeholders by bringing in greater clarity in the clinical trial objectives and by providing alignment between the targeted estimand under consideration and the statistical analysis methods. However, there is a lack of clarity in implementing this framework in early-phase dose optimization studies. This paper aims to discuss the estimand framework for dose optimization trials in oncology, considering efficacy and toxicity through utility functions. Such trials should include pharmacokinetics data, toxicity data, and efficacy data. Based on these data, the analysis methods used to identify the optimized dose/s are also described. Focusing on optimizing the utility function to estimate the OBD, the population-level summary measure should reflect only the properties used for estimating this utility function. A detailed strategy recommendation for intercurrent events has been provided using a real-life oncology case study. Key recommendations regarding the estimand attributes include that in a seamless phase I/II dose optimization trial, the treatment attribute should start when the subject receives the first dose. We argue that such a framework brings in additional clarity to dose optimization trial objectives and strengthens the understanding of the drug under consideration, which would enable the correct dose to move to phase II of clinical development.

肿瘤学I期剂量递增试验通常旨在找到最大耐受剂量。然而，随着分子靶向治疗和抗体药物偶联物的出现，剂量限制性毒性较少被观察到，从而产生了最佳生物剂量（OBD）的概念，该概念同时考虑了疗效和毒性。ICH E9（R1）指南附录中提出的评估框架通过使临床试验目标更加清晰，并通过在考虑的目标评估与统计分析方法之间提供一致性，加强了不同利益相关者之间的对话。然而，在早期剂量优化研究中实施这一框架缺乏明确性。本文旨在探讨肿瘤剂量优化试验的估计框架，通过效用函数考虑疗效和毒性。此类试验应包括药代动力学数据、毒性数据和疗效数据。在此基础上，介绍了确定最佳剂量/s的分析方法。关注于优化效用函数来估计OBD，总体水平的汇总度量应该只反映用于估计该效用函数的属性。通过一个真实的肿瘤学案例研究，对并发事件提供了详细的策略建议。关于估计属性的关键建议包括，在无缝I/II期剂量优化试验中，治疗属性应在受试者接受第一次剂量时开始。我们认为，这样的框架为剂量优化试验目标带来了额外的清晰度，并加强了对正在考虑的药物的理解，这将使正确的剂量进入临床开发的第二阶段。

{"title":"Estimands for Early-Phase Dose Optimization Trials in Oncology","authors":"Ayon Mukherjee, Jonathan L. Moscovici, Zheng Liu","doi":"10.1002/bimj.70072","DOIUrl":"10.1002/bimj.70072","url":null,"abstract":"Phase I dose escalation trials in oncology generally aim to find the maximum tolerated dose. However, with the advent of molecular-targeted therapies and antibody drug conjugates, dose-limiting toxicities are less frequently observed, giving rise to the concept of optimal biological dose (OBD), which considers both efficacy and toxicity. The estimand framework presented in the addendum of the ICH E9(R1) guidelines strengthens the dialogue between different stakeholders by bringing in greater clarity in the clinical trial objectives and by providing alignment between the targeted estimand under consideration and the statistical analysis methods. However, there is a lack of clarity in implementing this framework in early-phase dose optimization studies. This paper aims to discuss the estimand framework for dose optimization trials in oncology, considering efficacy and toxicity through utility functions. Such trials should include pharmacokinetics data, toxicity data, and efficacy data. Based on these data, the analysis methods used to identify the optimized dose/s are also described. Focusing on optimizing the utility function to estimate the OBD, the population-level summary measure should reflect only the properties used for estimating this utility function. A detailed strategy recommendation for intercurrent events has been provided using a real-life oncology case study. Key recommendations regarding the estimand attributes include that in a seamless phase I/II dose optimization trial, the treatment attribute should start when the subject receives the first dose. We argue that such a framework brings in additional clarity to dose optimization trial objectives and strengthens the understanding of the drug under consideration, which would enable the correct dose to move to phase II of clinical development.","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 5","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.70072","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145034873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0