首页 > 最新文献

Biometrical Journal最新文献

英文 中文
Issue Information: Biometrical Journal 1'26 期刊信息:bioometic Journal 1'26
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-01-09 DOI: 10.1002/bimj.70109
{"title":"Issue Information: Biometrical Journal 1'26","authors":"","doi":"10.1002/bimj.70109","DOIUrl":"https://doi.org/10.1002/bimj.70109","url":null,"abstract":"","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.70109","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145983580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Time-Dependent Predictive Accuracy Metrics in the Context of Interval Censoring and Competing Risks 区间筛选和竞争风险下的时变预测精度度量。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-01-05 DOI: 10.1002/bimj.70108
Zhenwei Yang, Dimitris Rizopoulos, Lisa F. Newcomb, Nicole S. Erler

Evaluating the performance of a prediction model is a common task in medical statistics. Standard accuracy metrics require the observation of the true outcomes. This is typically not possible in the setting with time-to-event outcomes due to censoring. Interval censoring, the presence of time-varying covariates, and competing risks present additional challenges in obtaining those accuracy metrics. In this study, we propose two methods to deal with interval censoring in a time-varying competing risk setting: a model-based approach and the inverse probability of censoring weighting (IPCW) approach, focusing on three key time-dependent metrics: area under the receiver-operating characteristic curve, Brier score, and expected predictive cross-entropy. The evaluation is conducted over a medically relevant time interval of interest, [t,Δt)$[t, Delta t)$. The model-based approach includes all subjects in the risk set, using their predicted risks to contribute to the accuracy metrics. In contrast, the IPCW approach only considers the subset of subjects who are known to be event-free or experience the event within the interval of interest. We performed a simulation study to compare the performance of the two approaches with regard to the three metrics. Furthermore, we demonstrated the three metrics using the two approaches on an example prostate cancer surveillance cohort. Risk predictions were generated from a joint model handling the interval-censored cancer progression and the competing event, early treatment, and repeatedly measured biomarkers.

评估预测模型的性能是医学统计中常见的任务。标准精度度量要求观察真实结果。由于审查,这在具有时间到事件结果的设置中通常是不可能的。区间审查、时变协变量的存在以及竞争风险为获得这些精度指标带来了额外的挑战。在这项研究中,我们提出了两种方法来处理时变竞争风险设置中的区间审查:基于模型的方法和审查加权逆概率(IPCW)方法,重点关注三个关键的时间相关指标:接收者操作特征曲线下的面积,Brier评分和预期预测交叉熵。评估是在一个医学相关的时间间隔内进行的,[t, Δ t)$ [t, Δ t)$。基于模型的方法包括风险集中的所有主题,使用他们预测的风险来贡献准确性度量。相比之下,IPCW方法只考虑已知没有事件或在感兴趣的时间间隔内经历事件的受试者子集。我们进行了一项模拟研究,以比较两种方法在三个指标方面的性能。此外,我们在一个前列腺癌监测队列中使用这两种方法证明了这三个指标。风险预测是由一个联合模型生成的,该模型处理间隔审查的癌症进展和竞争事件、早期治疗和反复测量的生物标志物。
{"title":"Time-Dependent Predictive Accuracy Metrics in the Context of Interval Censoring and Competing Risks","authors":"Zhenwei Yang,&nbsp;Dimitris Rizopoulos,&nbsp;Lisa F. Newcomb,&nbsp;Nicole S. Erler","doi":"10.1002/bimj.70108","DOIUrl":"10.1002/bimj.70108","url":null,"abstract":"<p>Evaluating the performance of a prediction model is a common task in medical statistics. Standard accuracy metrics require the observation of the true outcomes. This is typically not possible in the setting with time-to-event outcomes due to censoring. Interval censoring, the presence of time-varying covariates, and competing risks present additional challenges in obtaining those accuracy metrics. In this study, we propose two methods to deal with interval censoring in a time-varying competing risk setting: a model-based approach and the inverse probability of censoring weighting (IPCW) approach, focusing on three key time-dependent metrics: area under the receiver-operating characteristic curve, Brier score, and expected predictive cross-entropy. The evaluation is conducted over a medically relevant time interval of interest, <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mo>[</mo>\u0000 <mi>t</mi>\u0000 <mo>,</mo>\u0000 <mi>Δ</mi>\u0000 <mi>t</mi>\u0000 <mo>)</mo>\u0000 </mrow>\u0000 <annotation>$[t, Delta t)$</annotation>\u0000 </semantics></math>. The model-based approach includes all subjects in the risk set, using their predicted risks to contribute to the accuracy metrics. In contrast, the IPCW approach only considers the subset of subjects who are known to be event-free or experience the event within the interval of interest. We performed a simulation study to compare the performance of the two approaches with regard to the three metrics. Furthermore, we demonstrated the three metrics using the two approaches on an example prostate cancer surveillance cohort. Risk predictions were generated from a joint model handling the interval-censored cancer progression and the competing event, early treatment, and repeatedly measured biomarkers.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12766878/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145901368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Informative Co-Data Learning for High-Dimensional Horseshoe Regression 高维马蹄形回归的信息协同数据学习。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-12-30 DOI: 10.1002/bimj.70105
Claudio Busatto, Mark A. van de Wiel

High-dimensional data often arise from clinical genomics research to infer relevant predictors of a particular trait. A way to improve the predictive performance is by incorporating information about the predictors obtained from existing from prior knowledge or previous studies. Such information is also referred to as “co-data.” To this aim, we develop a novel Bayesian model for including co-data in a high-dimensional regression framework, termed informative Horseshoe regression (infHS). The proposed approach regresses the prior variances of the regression parameters on the co-data variables, improving variable selection and prediction. We implement both a Gibbs sampler and a Variational approximation algorithm. The former is suited for applications of moderate dimensions which, besides prediction, target posterior inference, whereas the latter's computational efficiency allows handling a very large number of variables. We show the benefits of including co-data through a simulation study. Lastly, we demonstrate that infHS outperforms competing approaches in two genomics applications.

高维数据通常来自临床基因组学研究,用于推断特定性状的相关预测因子。提高预测性能的一种方法是结合从现有的先验知识或以前的研究中获得的预测因子的信息。这样的信息也被称为“共同数据”。为此,我们开发了一种新的贝叶斯模型,用于将协数据包含在高维回归框架中,称为信息马蹄回归(infHS)。该方法对回归参数在协数据变量上的先验方差进行回归,提高了变量的选择和预测能力。我们实现了吉布斯采样器和变分近似算法。前者适用于中等维度的应用,除了预测之外,目标是后验推理,而后者的计算效率允许处理非常大量的变量。我们通过模拟研究展示了包含共同数据的好处。最后,我们证明了infHS在两个基因组学应用中优于竞争方法。
{"title":"Informative Co-Data Learning for High-Dimensional Horseshoe Regression","authors":"Claudio Busatto,&nbsp;Mark A. van de Wiel","doi":"10.1002/bimj.70105","DOIUrl":"10.1002/bimj.70105","url":null,"abstract":"<div>\u0000 \u0000 <p>High-dimensional data often arise from clinical genomics research to infer relevant predictors of a particular trait. A way to improve the predictive performance is by incorporating information about the predictors obtained from existing from prior knowledge or previous studies. Such information is also referred to as “co-data.” To this aim, we develop a novel Bayesian model for including co-data in a high-dimensional regression framework, termed informative Horseshoe regression (infHS). The proposed approach regresses the prior variances of the regression parameters on the co-data variables, improving variable selection and prediction. We implement both a Gibbs sampler and a Variational approximation algorithm. The former is suited for applications of moderate dimensions which, besides prediction, target posterior inference, whereas the latter's computational efficiency allows handling a very large number of variables. We show the benefits of including co-data through a simulation study. Lastly, we demonstrate that infHS outperforms competing approaches in two genomics applications.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145859358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modified Skew Discrete Laplace Regression Models for Integer-Valued Data With Applications to Paired Samples 整数数据的修正偏态离散拉普拉斯回归模型及其在成对样本中的应用。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-12-29 DOI: 10.1002/bimj.70106
Rodrigo M. R. de Medeiros, Marcelo Bourguignon

Modeling events associated with discrete-valued observations arises in several practical situations. Until now, research on statistical methods for discrete data has primarily focused on modeling count data. Nevertheless, discrete observations that may assume any value in the set of integers Z={,2,1,0,1,2,}$mathbb {Z} = lbrace ldots, -2, -1, 0, 1, 2, ldots rbrace$ are also found in various contexts. This paper introduces a general parametric modeling framework for the analysis of integer-valued data, with applications to paired discrete observations. The proposed model is based on the modified skew discrete Laplace distribution. Our approach enables a straightforward interpretation of regression coefficients in terms of mean and dispersion, while properly accounting for the discrete nature of the data. We adopt a frequentist approach to perform inference and define diagnostic tools to assess goodness-of-fit. Additionally, we conduct several simulation studies to examine the asymptotic properties of the estimators and test statistics, as well as the distribution of the residuals. We illustrate the usefulness of the proposed model with two real datasets: one from an experimental study conducted in a French penitentiary, and another involving diagnostic imaging to assess kidney function through dynamic and static scintigraphy. Estimation and inference procedures for the new regression model are implemented in the R package sdlrm.

与离散值观测相关的建模事件出现在几种实际情况中。到目前为止,离散数据的统计方法研究主要集中在计数数据的建模上。然而,可以在整数集合Z ={…,-2,-1,0,1,2,…}$mathbb {Z} = lbrace ldots, -2, -1, 0, 1, 2, ldots rbrace$中假设任意值的离散观测值也可以在各种上下文中找到。本文介绍了一种用于整数值数据分析的通用参数化建模框架,并将其应用于成对离散观测。该模型基于修正的偏态离散拉普拉斯分布。我们的方法可以根据平均值和离散度直接解释回归系数,同时适当地考虑数据的离散性。我们采用频率论的方法来进行推理,并定义诊断工具来评估拟合优度。此外,我们进行了一些模拟研究,以检查估计量和检验统计量的渐近性质,以及残差的分布。我们用两个真实的数据集来说明所提出的模型的实用性:一个来自法国监狱进行的实验研究,另一个涉及通过动态和静态闪烁成像来评估肾功能的诊断成像。新回归模型的估计和推理程序在R包sdlrm中实现。
{"title":"Modified Skew Discrete Laplace Regression Models for Integer-Valued Data With Applications to Paired Samples","authors":"Rodrigo M. R. de Medeiros,&nbsp;Marcelo Bourguignon","doi":"10.1002/bimj.70106","DOIUrl":"10.1002/bimj.70106","url":null,"abstract":"<div>\u0000 \u0000 <p>Modeling events associated with discrete-valued observations arises in several practical situations. Until now, research on statistical methods for discrete data has primarily focused on modeling count data. Nevertheless, discrete observations that may assume any value in the set of integers <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>Z</mi>\u0000 <mo>=</mo>\u0000 <mo>{</mo>\u0000 <mtext>…</mtext>\u0000 <mo>,</mo>\u0000 <mo>−</mo>\u0000 <mn>2</mn>\u0000 <mo>,</mo>\u0000 <mo>−</mo>\u0000 <mn>1</mn>\u0000 <mo>,</mo>\u0000 <mn>0</mn>\u0000 <mo>,</mo>\u0000 <mn>1</mn>\u0000 <mo>,</mo>\u0000 <mn>2</mn>\u0000 <mo>,</mo>\u0000 <mtext>…</mtext>\u0000 <mo>}</mo>\u0000 </mrow>\u0000 <annotation>$mathbb {Z} = lbrace ldots, -2, -1, 0, 1, 2, ldots rbrace$</annotation>\u0000 </semantics></math> are also found in various contexts. This paper introduces a general parametric modeling framework for the analysis of integer-valued data, with applications to paired discrete observations. The proposed model is based on the modified skew discrete Laplace distribution. Our approach enables a straightforward interpretation of regression coefficients in terms of mean and dispersion, while properly accounting for the discrete nature of the data. We adopt a frequentist approach to perform inference and define diagnostic tools to assess goodness-of-fit. Additionally, we conduct several simulation studies to examine the asymptotic properties of the estimators and test statistics, as well as the distribution of the residuals. We illustrate the usefulness of the proposed model with two real datasets: one from an experimental study conducted in a French penitentiary, and another involving diagnostic imaging to assess kidney function through dynamic and static scintigraphy. Estimation and inference procedures for the new regression model are implemented in the R package <span>sdlrm</span>.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145851446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Covariance-Based Penalty Estimator for Model Assessment With Censored Data 基于协方差的删节数据模型评估惩罚估计。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-12-29 DOI: 10.1002/bimj.70103
Zhuoran Zhang, Daniel L. Gillen

Prediction model selection and assessment are primary objectives of many statistical analyses. Covariance-based penalty estimators provide analytic estimates of the optimism associated with naive training error estimates for multiple classes of prediction models and error assessment rules. While the majority of work on covariance-based penalties has focused on prediction for uncensored data, little attention has been given to time-to-event data. In this article, we consider estimating the optimism for survival prediction models assessed via the Brier score. We first analytically derive an expression of the optimism in a single group scenario with uncensored data based on a reformulation of the optimism. With the same reformulation, we propose an algorithm to estimate the optimism for Cox's proportional hazards regression under a general prediction setting involving covariates and right censoring. We verify the derived theory and demonstrate the applicability of the proposed algorithm via simulation studies. Finally, we illustrate the utility of our new covariance-based penalty estimator through an application predicting time to hemodialysis access failure among patients with end-stage renal disease using data from the United States Renal Data System.

预测模型的选择和评估是许多统计分析的主要目标。基于协方差的惩罚估计提供了与多类预测模型和误差评估规则的朴素训练误差估计相关的乐观度的分析估计。虽然基于协方差的惩罚的大部分工作都集中在对未经审查的数据的预测上,但对事件时间数据的关注很少。在本文中,我们考虑通过Brier评分评估生存预测模型的乐观度。我们首先基于乐观主义的重新表述,解析地推导出在单个群体场景中使用未经审查的数据的乐观主义表达式。通过同样的重新表述,我们提出了一种算法来估计Cox比例风险回归在涉及协变量和右审查的一般预测设置下的乐观度。我们通过仿真研究验证了所推导的理论,并证明了所提出算法的适用性。最后,我们通过使用美国肾脏数据系统的数据预测终末期肾病患者血液透析获得失败的时间,说明了我们新的基于协方差的惩罚估计器的实用性。
{"title":"A Covariance-Based Penalty Estimator for Model Assessment With Censored Data","authors":"Zhuoran Zhang,&nbsp;Daniel L. Gillen","doi":"10.1002/bimj.70103","DOIUrl":"10.1002/bimj.70103","url":null,"abstract":"<div>\u0000 \u0000 <p>Prediction model selection and assessment are primary objectives of many statistical analyses. Covariance-based penalty estimators provide analytic estimates of the optimism associated with naive training error estimates for multiple classes of prediction models and error assessment rules. While the majority of work on covariance-based penalties has focused on prediction for uncensored data, little attention has been given to time-to-event data. In this article, we consider estimating the optimism for survival prediction models assessed via the Brier score. We first analytically derive an expression of the optimism in a single group scenario with uncensored data based on a reformulation of the optimism. With the same reformulation, we propose an algorithm to estimate the optimism for Cox's proportional hazards regression under a general prediction setting involving covariates and right censoring. We verify the derived theory and demonstrate the applicability of the proposed algorithm via simulation studies. Finally, we illustrate the utility of our new covariance-based penalty estimator through an application predicting time to hemodialysis access failure among patients with end-stage renal disease using data from the United States Renal Data System.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145851500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dimension Reduction for the Conditional Quantiles of Functional Data With Categorical Predictors 具有分类预测因子的功能数据条件分位数的降维。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-12-18 DOI: 10.1002/bimj.70102
Shanshan Wang, Eliana Christou, Eftychia Solea, Jun Song

Functional data analysis has received significant attention due to its frequent occurrence in modern applications, such as in the medical field, where electrocardiograms or electroencephalograms can be used for a better understanding of various medical conditions. Due to the infinite-dimensional nature of functional elements, the current work focuses on dimension reduction techniques. This study shifts its focus to modeling the conditional quantiles of functional data, noting that existing works are limited to quantitative predictors. Consequently, we introduce the first approach to partial dimension reduction for the conditional quantiles under the presence of both functional and categorical predictors. We present the proposed algorithm and derive the convergence rates of the estimators. Moreover, we demonstrate the finite sample performance of the method using simulation examples and a real dataset based on functional magnetic resonance imaging.

功能数据分析由于其在现代应用中的频繁出现而受到了极大的关注,例如在医疗领域,心电图或脑电图可以用于更好地了解各种医疗状况。由于功能元素的无限维性质,目前的工作重点是降维技术。本研究将重点转移到对功能数据的条件分位数进行建模,注意到现有的工作仅限于定量预测因子。因此,我们引入了第一种方法来部分降维的条件分位数在功能和分类预测的存在。我们给出了该算法,并推导了估计量的收敛速率。此外,我们使用仿真示例和基于功能磁共振成像的真实数据集来证明该方法的有限样本性能。
{"title":"Dimension Reduction for the Conditional Quantiles of Functional Data With Categorical Predictors","authors":"Shanshan Wang,&nbsp;Eliana Christou,&nbsp;Eftychia Solea,&nbsp;Jun Song","doi":"10.1002/bimj.70102","DOIUrl":"10.1002/bimj.70102","url":null,"abstract":"<div>\u0000 \u0000 <p>Functional data analysis has received significant attention due to its frequent occurrence in modern applications, such as in the medical field, where electrocardiograms or electroencephalograms can be used for a better understanding of various medical conditions. Due to the infinite-dimensional nature of functional elements, the current work focuses on dimension reduction techniques. This study shifts its focus to modeling the conditional quantiles of functional data, noting that existing works are limited to quantitative predictors. Consequently, we introduce the first approach to partial dimension reduction for the conditional quantiles under the presence of both functional and categorical predictors. We present the proposed algorithm and derive the convergence rates of the estimators. Moreover, we demonstrate the finite sample performance of the method using simulation examples and a real dataset based on functional magnetic resonance imaging.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 6","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145776649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Empirical Likelihood Comparison of Absolute Risks 绝对风险的经验似然比较。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-12-18 DOI: 10.1002/bimj.70104
Paul Blanche, Frank Eriksson

In the competing risks setting, the t$t$-year absolute risk for a specific time t$t$ (e.g., 2 years), also called the cumulative incidence function at time t$t$, is often interesting to estimate. It is routinely estimated using the nonparametric Aalen–Johansen estimator. This estimator handles right-censored data and has desirable large sample properties, as it is the nonparametric maximum likelihood estimator (NPMLE). Inference for comparing absolute risks, via either a risk difference or a risk ratio, can therefore be done via usual asymptotic normal approximations and the delta method. However, the small sample performances of this approach are not fully satisfactory. Especially, (i) coverage of confidence intervals may be inaccurate and (ii) comparisons made using a risk ratio and a risk difference can lead to inconsistent conclusions, in terms of statistical significance. We, therefore, introduce an alternative empirical likelihood approach. One advantage of this approach is that it always leads to consistent conclusions when comparing absolute risks via a risk ratio and a risk difference, in terms of significance. Simulation results also suggest that small sample inference using this approach can be more accurate. We present the computation of confidence intervals and p-values using this approach and the asymptotic properties that justify them. We provide formulas and algorithms to compute constrained NPMLE, from which empirical likelihood ratios and inference procedures are derived. The novel approach has been implemented in the timeEL package for R, and some of its advantages are demonstrated via reproducible analyses of bone marrow transplant data.

在竞争风险设置中,特定时间t$ t$(例如,2年)的t$ t$年绝对风险,也称为时间t$ t$的累积关联函数,通常是有趣的估计。通常使用非参数aallen - johansen估计量进行估计。该估计器处理右截尾数据,并具有理想的大样本特性,因为它是非参数最大似然估计器(NPMLE)。因此,通过风险差或风险比来比较绝对风险的推理可以通过通常的渐近正态近似和delta方法来完成。然而,这种方法的小样本性能并不完全令人满意。特别是,(i)置信区间的覆盖范围可能不准确,(ii)使用风险比和风险差异进行比较可能导致统计显著性方面的结论不一致。因此,我们引入了另一种经验似然方法。这种方法的一个优点是,当通过风险比和风险差比较绝对风险时,在显著性方面,它总是得出一致的结论。仿真结果也表明,使用该方法进行小样本推理可以获得更准确的结果。我们给出了用这种方法计算置信区间和p值以及证明它们的渐近性质。我们提供了计算约束NPMLE的公式和算法,并由此导出了经验似然比和推理程序。这种新方法已经在R的timeEL软件包中实现,并且通过对骨髓移植数据的可重复分析证明了它的一些优点。
{"title":"Empirical Likelihood Comparison of Absolute Risks","authors":"Paul Blanche,&nbsp;Frank Eriksson","doi":"10.1002/bimj.70104","DOIUrl":"10.1002/bimj.70104","url":null,"abstract":"<div>\u0000 \u0000 <p>In the competing risks setting, the <span></span><math>\u0000 <semantics>\u0000 <mi>t</mi>\u0000 <annotation>$t$</annotation>\u0000 </semantics></math>-year absolute risk for a specific time <span></span><math>\u0000 <semantics>\u0000 <mi>t</mi>\u0000 <annotation>$t$</annotation>\u0000 </semantics></math> (e.g., 2 years), also called the cumulative incidence function at time <span></span><math>\u0000 <semantics>\u0000 <mi>t</mi>\u0000 <annotation>$t$</annotation>\u0000 </semantics></math>, is often interesting to estimate. It is routinely estimated using the nonparametric Aalen–Johansen estimator. This estimator handles right-censored data and has desirable large sample properties, as it is the nonparametric maximum likelihood estimator (NPMLE). Inference for comparing absolute risks, via either a risk difference or a risk ratio, can therefore be done via usual asymptotic normal approximations and the delta method. However, the small sample performances of this approach are not fully satisfactory. Especially, (i) coverage of confidence intervals may be inaccurate and (ii) comparisons made using a risk ratio and a risk difference can lead to inconsistent conclusions, in terms of statistical significance. We, therefore, introduce an alternative empirical likelihood approach. One advantage of this approach is that it always leads to consistent conclusions when comparing absolute risks via a risk ratio and a risk difference, in terms of significance. Simulation results also suggest that small sample inference using this approach can be more accurate. We present the computation of confidence intervals and <i>p</i>-values using this approach and the asymptotic properties that justify them. We provide formulas and algorithms to compute constrained NPMLE, from which empirical likelihood ratios and inference procedures are derived. The novel approach has been implemented in the <span>timeEL</span> package for <span>R</span>, and some of its advantages are demonstrated via reproducible analyses of bone marrow transplant data.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 6","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145776656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian Nonparametric Sensitivity Analysis of Multiple Test Procedures Under Dependence 依赖条件下多个测试程序的贝叶斯非参数敏感性分析。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-12-14 DOI: 10.1002/bimj.70101
George Karabatsos

This paper introduces a sensitivity analysis method for multiple testing procedures (MTPs) using marginal p$p$-values. The method is based on the Dirichlet process (DP) prior distribution, specified to support the entire space of MTPs, where each MTP controls either the family-wise error rate (FWER) or the false discovery rate (FDR) under arbitrary dependence between p$p$-values. This DP-MTP sensitivity analysis method provides uncertainty quantification for MTPs, by accounting for uncertainty in the selection of such MTPs and their respective threshold-based decisions regarding which number of smallest p$p$-values are significant discoveries, from a given set of null hypothesis tested, while measuring each p$p$-value's probability of significance over the DP prior predictive distribution of this space of all MTPs, and reducing the possible conservativeness of using only one such MTP for multiple testing. The DP-MTP sensitivity analysis method is illustrated through the analysis of over 28,000 p$p$-values arising from hypothesis tests performed on a 2022 dataset of a representative sample of three million U.S. high school students observed on 239 variables. They include tests which, respectively, relate variables about the disruption caused by school closures during the COVID-19 pandemic, with various mathematical cognition, academic achievement, and student background variables. R software code for the DP-MTP sensitivity analysis method is provided in the Code and Data Supplement (CDS) of this paper.

本文介绍了一种利用边际p$ p$值对多重测试程序(MTPs)进行灵敏度分析的方法。该方法基于Dirichlet过程(DP)先验分布,指定支持MTP的整个空间,其中每个MTP控制在p$ p$ -值之间任意依赖的家庭错误率(FWER)或错误发现率(FDR)。这种DP- mtp敏感性分析方法为MTPs提供了不确定性量化,通过考虑这些MTPs选择的不确定性,以及它们各自基于阈值的决策,即从给定的一组检验的零假设中,哪些最小的p$ p$值是重要的发现,同时测量每个p$ p$值在所有MTPs空间的DP先验预测分布上的显著性概率。并减少仅使用一种这样的MTP进行多次测试的可能的保守性。DP-MTP敏感性分析方法是通过分析超过28,000个p$ p$值来说明的,这些p$ p$值是在2022年的数据集上进行的假设检验中产生的,该数据集包含300万美国高中生的代表性样本,观察到239个变量。其中包括测试,这些测试分别将COVID-19大流行期间学校关闭造成的中断的变量与各种数学认知、学习成绩和学生背景变量联系起来。本文的code and Data Supplement (CDS)提供了DP-MTP灵敏度分析法的R软件代码。
{"title":"Bayesian Nonparametric Sensitivity Analysis of Multiple Test Procedures Under Dependence","authors":"George Karabatsos","doi":"10.1002/bimj.70101","DOIUrl":"10.1002/bimj.70101","url":null,"abstract":"<p>This paper introduces a sensitivity analysis method for multiple testing procedures (MTPs) using marginal <span></span><math>\u0000 <semantics>\u0000 <mi>p</mi>\u0000 <annotation>$p$</annotation>\u0000 </semantics></math>-values. The method is based on the Dirichlet process (DP) prior distribution, specified to support the entire space of MTPs, where each MTP controls either the family-wise error rate (FWER) or the false discovery rate (FDR) under arbitrary dependence between <span></span><math>\u0000 <semantics>\u0000 <mi>p</mi>\u0000 <annotation>$p$</annotation>\u0000 </semantics></math>-values. This DP-MTP sensitivity analysis method provides uncertainty quantification for MTPs, by accounting for uncertainty in the selection of such MTPs and their respective threshold-based decisions regarding which number of smallest <span></span><math>\u0000 <semantics>\u0000 <mi>p</mi>\u0000 <annotation>$p$</annotation>\u0000 </semantics></math>-values are significant discoveries, from a given set of null hypothesis tested, while measuring each <span></span><math>\u0000 <semantics>\u0000 <mi>p</mi>\u0000 <annotation>$p$</annotation>\u0000 </semantics></math>-value's probability of significance over the DP prior predictive distribution of this space of all MTPs, and reducing the possible conservativeness of using only one such MTP for multiple testing. The DP-MTP sensitivity analysis method is illustrated through the analysis of over 28,000 <span></span><math>\u0000 <semantics>\u0000 <mi>p</mi>\u0000 <annotation>$p$</annotation>\u0000 </semantics></math>-values arising from hypothesis tests performed on a 2022 dataset of a representative sample of three million U.S. high school students observed on 239 variables. They include tests which, respectively, relate variables about the disruption caused by school closures during the COVID-19 pandemic, with various mathematical cognition, academic achievement, and student background variables. <span>R</span> software code for the DP-MTP sensitivity analysis method is provided in the Code and Data Supplement (CDS) of this paper.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 6","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12703229/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145758395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiple Contrast Tests for Count Data: Small Sample Approximations and Their Limitations 计数数据的多重对比检验:小样本近似及其局限性。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-12-07 DOI: 10.1002/bimj.70098
Mareen Pigorsch, Ludwig A. Hothorn, Frank Konietschke

Although count data are collected in many experiments, their analysis remains challenging, especially in small sample sizes. Until now, linear or generalized linear models in Poisson or Negative Binomial distributional families have often been used. However, these data frequently show signs of over-, underdispersion, or even zero-inflation, casting doubt on these distributional assumptions and leading to inaccurate test results. Since their distributions are usually skewed, data transformations (e.g., log-transformation) are not unusual. This underscores the need for statistical methods not to hinge on specific distributional assumptions. We delve into multiple contrast tests that allow general contrasts (e.g., many-to-one or all-pairs comparisons) to analyze count data in multi-arm trials. The methods vary in their effect and variance estimation, as well as in approximating the joint distribution of multiple test statistics, including frequently used methods such as linear and generalized linear models, and data transformations. An extensive simulation study demonstrates that a resampling version effectively controls the Type I error rate in various situations, while also highlighting the method's limitations, including overly liberal Type I error rates. Some standard methods, which have inflated Type I error rates, further underscore the need for alternative approaches. Real data applications further emphasize the applicability of these methods.

虽然在许多实验中收集了计数数据,但它们的分析仍然具有挑战性,特别是在小样本量下。迄今为止,常使用泊松分布族或负二项分布族中的线性或广义线性模型。然而,这些数据经常显示出过度、分散不足,甚至零膨胀的迹象,这使人们对这些分布假设产生了怀疑,并导致了不准确的测试结果。由于它们的分布通常是倾斜的,所以数据转换(例如,对数转换)并不罕见。这强调了统计方法不依赖于特定的分布假设的必要性。我们深入研究了多重对比检验,允许一般对比(例如,多对一或全对比较)来分析多臂试验中的计数数据。这些方法在其效果和方差估计以及近似多个检验统计量的联合分布方面各不相同,包括常用的方法,如线性和广义线性模型以及数据转换。一项广泛的仿真研究表明,重采样版本在各种情况下有效地控制了I型错误率,同时也突出了该方法的局限性,包括过于自由的I型错误率。一些标准方法使第一类错误率过高,进一步强调需要其他方法。实际数据应用进一步强调了这些方法的适用性。
{"title":"Multiple Contrast Tests for Count Data: Small Sample Approximations and Their Limitations","authors":"Mareen Pigorsch,&nbsp;Ludwig A. Hothorn,&nbsp;Frank Konietschke","doi":"10.1002/bimj.70098","DOIUrl":"10.1002/bimj.70098","url":null,"abstract":"<p>Although count data are collected in many experiments, their analysis remains challenging, especially in small sample sizes. Until now, linear or generalized linear models in Poisson or Negative Binomial distributional families have often been used. However, these data frequently show signs of over-, underdispersion, or even zero-inflation, casting doubt on these distributional assumptions and leading to inaccurate test results. Since their distributions are usually skewed, data transformations (e.g., log-transformation) are not unusual. This underscores the need for statistical methods not to hinge on specific distributional assumptions. We delve into multiple contrast tests that allow general contrasts (e.g., many-to-one or all-pairs comparisons) to analyze count data in multi-arm trials. The methods vary in their effect and variance estimation, as well as in approximating the joint distribution of multiple test statistics, including frequently used methods such as linear and generalized linear models, and data transformations. An extensive simulation study demonstrates that a resampling version effectively controls the Type I error rate in various situations, while also highlighting the method's limitations, including overly liberal Type I error rates. Some standard methods, which have inflated Type I error rates, further underscore the need for alternative approaches. Real data applications further emphasize the applicability of these methods.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 6","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12683215/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145702878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Censoring and Competing Risks: Avoidable and Non-Avoidable Events. Comment to the Article “Hazards constitute key quantities for analysing, interpreting and understanding time-to-event data” by Beyersmann, Schmoor, and Schumacher 审查和竞争风险:可避免和不可避免的事件。对Beyersmann、Schmoor和Schumacher的文章“危险构成分析、解释和理解事件时间数据的关键数量”的评论。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-11-29 DOI: 10.1002/bimj.70099
Per Kragh Andersen

It is argued that even though censoring and competing events, technically, play similar roles when estimating hazard functions, they are conceptually different and should be treated as such when interpreting time-to-event data.

有人认为,尽管从技术上讲,审查事件和竞争事件在估计风险函数时起着相似的作用,但它们在概念上是不同的,在解释事件时间数据时应这样对待。
{"title":"Censoring and Competing Risks: Avoidable and Non-Avoidable Events. Comment to the Article “Hazards constitute key quantities for analysing, interpreting and understanding time-to-event data” by Beyersmann, Schmoor, and Schumacher","authors":"Per Kragh Andersen","doi":"10.1002/bimj.70099","DOIUrl":"10.1002/bimj.70099","url":null,"abstract":"<div>\u0000 \u0000 <p>It is argued that even though censoring and competing events, technically, play similar roles when estimating hazard functions, they are conceptually different and should be treated as such when interpreting time-to-event data.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 6","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145643078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biometrical Journal
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1