首页 > 最新文献

Biometrical Journal最新文献

英文 中文
Deriving Duration Time From Occupancy Data—A Case Study in the Length of Stay in Intensive Care Units for COVID-19 Patients 从占用数据推导持续时间——以COVID-19患者重症监护病房住院时间为例
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-13 DOI: 10.1002/bimj.70113
Martje Rave, Göran Kauermann

This paper focuses on drawing information on underlying processes, which are not directly observed in the data. In particular, we work with data in which only the total count of units in a system at a given time point is observed, but the underlying process of inflows, length of stay, and outflows is not. The particular data example looked at in this paper is the occupancy of intensive care units (ICUs) during the COVID-19 pandemic, where the aggregated numbers of occupied beds in ICUs on the district level (‘Landkreis') are recorded, but not the number of incoming and outgoing patients. The Skellam distribution allows us to infer the number of incoming and outgoing patients from the occupancy in the ICUs. This paper goes a step beyond and approaches the question of whether we can also estimate the average length of stay of ICU patients. Hence, the task is to derive not only the number of incoming and outgoing units from a total net count but also to gain information on the duration time of patients on ICUs. We make use of a stochastic expectation–maximization algorithm and additionally include exogenous information which are assumed to explain the intensity of inflow.

本文的重点是绘制底层过程的信息,这些信息不是直接在数据中观察到的。特别是,我们处理的数据中,只有在给定时间点观察到系统中单位的总数,而没有观察到流入、停留时间和流出的潜在过程。本文研究的特定数据示例是COVID-19大流行期间重症监护病房(icu)的占用情况,其中记录了地区一级icu(“Landkreis”)占用床位的总数,但没有记录入院和出院患者的数量。Skellam分布允许我们从icu的占用情况推断出传入和传出的患者数量。本文在此基础上进一步探讨了我们是否也可以估计ICU患者的平均住院时间的问题。因此,任务不仅是从总净计数中导出传入和传出单位的数量,而且还要获得icu患者持续时间的信息。我们使用随机期望最大化算法,并额外包含外生信息,这些信息被认为可以解释流入的强度。
{"title":"Deriving Duration Time From Occupancy Data—A Case Study in the Length of Stay in Intensive Care Units for COVID-19 Patients","authors":"Martje Rave,&nbsp;Göran Kauermann","doi":"10.1002/bimj.70113","DOIUrl":"10.1002/bimj.70113","url":null,"abstract":"<div>\u0000 \u0000 <p>This paper focuses on drawing information on underlying processes, which are not directly observed in the data. In particular, we work with data in which only the total count of units in a system at a given time point is observed, but the underlying process of inflows, length of stay, and outflows is not. The particular data example looked at in this paper is the occupancy of intensive care units (ICUs) during the COVID-19 pandemic, where the aggregated numbers of occupied beds in ICUs on the district level (‘Landkreis') are recorded, but not the number of incoming and outgoing patients. The Skellam distribution allows us to infer the number of incoming and outgoing patients from the occupancy in the ICUs. This paper goes a step beyond and approaches the question of whether we can also estimate the average length of stay of ICU patients. Hence, the task is to derive not only the number of incoming and outgoing units from a total net count but also to gain information on the duration time of patients on ICUs. We make use of a stochastic expectation–maximization algorithm and additionally include exogenous information which are assumed to explain the intensity of inflow.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2026-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146196055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Signpost Testing to Navigate the Parameter Space of the Gaussian Graphical Model With High-Dimensional Data 高维数据高斯图形模型参数空间导航的路标测试。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-12 DOI: 10.1002/bimj.70115
Kai Ruan, Mark A. van de Wiel, Wessel N. van Wieringen

We evaluate the relevance of external quantitative information on the parameter of a Gaussian graphical model from high-dimensional data. This information comes in the form of a parameter value available from a related knowledge domain or population. We contrast the external information to ‘null’ information, i.e., an internally accepted parameter value. The direction from a null to this externally provided parameter value is dubbed the signpost. The signpost test evaluates whether to follow the signpost in the search of the true parameter value. We present various test statistics to measure the informativeness of the signpost and ways to obtain their distribution under the null hypothesis of non-informativeness. By simulation, we investigate the power and other properties of the various signpost tests, and compare them to the likelihood ratio test. Finally, we employ the signpost test to illustrate how the learning of the Gaussian graphical model of a low-prevalence breast cancer subtype benefits from external knowledge obtained from data of a more prevalent and related but fundamentally different subtype.

我们从高维数据中评估高斯图形模型参数的外部定量信息的相关性。该信息以相关知识领域或群体中可用的参数值的形式出现。我们将外部信息与“空”信息(即内部接受的参数值)进行对比。从null到这个外部提供的参数值的方向称为路标。signpost test用于判断在搜索true参数值时是否跟随该路标。我们给出了各种检验统计量来衡量路标的信息性,以及在非信息性零假设下获得其分布的方法。通过模拟,我们研究了各种路标测试的功率和其他特性,并将它们与似然比测试进行了比较。最后,我们采用路标测试来说明低患病率乳腺癌亚型的高斯图形模型的学习如何受益于从更普遍和相关但根本不同的亚型数据中获得的外部知识。
{"title":"Signpost Testing to Navigate the Parameter Space of the Gaussian Graphical Model With High-Dimensional Data","authors":"Kai Ruan,&nbsp;Mark A. van de Wiel,&nbsp;Wessel N. van Wieringen","doi":"10.1002/bimj.70115","DOIUrl":"10.1002/bimj.70115","url":null,"abstract":"<p>We evaluate the relevance of external quantitative information on the parameter of a Gaussian graphical model from high-dimensional data. This information comes in the form of a parameter value available from a related knowledge domain or population. We contrast the external information to ‘null’ information, i.e., an internally accepted parameter value. The direction from a null to this externally provided parameter value is dubbed the <i>signpost</i>. The signpost test evaluates whether to follow the signpost in the search of the true parameter value. We present various test statistics to measure the informativeness of the signpost and ways to obtain their distribution under the null hypothesis of non-informativeness. By simulation, we investigate the power and other properties of the various signpost tests, and compare them to the likelihood ratio test. Finally, we employ the signpost test to illustrate how the learning of the Gaussian graphical model of a low-prevalence breast cancer subtype benefits from external knowledge obtained from data of a more prevalent and related but fundamentally different subtype.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2026-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12895234/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146168081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Time-Dependent Mediators in Survival Analysis: Graphical Representation of Causal Assumptions 生存分析中的时间依赖中介:因果假设的图形表示。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-01-29 DOI: 10.1002/bimj.70110
Søren Wengel Mogensen, Odd O. Aalen, Susanne Strohmaier

We study time-dependent mediators in survival analysis using a treatment separation approach due to Didelez [Lifetime Data Analysis 25, no. 4: 593–610] and based on earlier work by Robins and Richardson [Causality and Psychopathology: Finding the Determinants of Disorders and Their Cures, 103–158. Oxford University Press]. This approach avoids nested counterfactuals and cross-world assumptions which are otherwise common in mediation analysis. The causal model of treatment, mediators, covariates, confounders, and outcome is represented by directed acyclic graphs (DAGs). However, the DAGs tend to be very complex when we have measurements at many time points. We therefore suggest using so-called rolled graphs in which a node represents an entire coordinate process instead of a single random variable, leading us to far simpler graphical representations. The rolled graphs are not necessarily acyclic; they can be analyzed by δ$delta$-separation which is the appropriate graphical separation criterion in this class of graphs and analogous to d$d$-separation. In particular, δ$delta$-separation is a graphical tool for evaluating if the conditions of the mediation analysis are met, or if unmeasured confounders influence the estimated effects. We also state a mediational g-formula. This is similar to the approach in Vansteelandt et al. [Statistics in Medicine 38, no. 24: 4828–4840], although that paper has a different conceptual basis. Finally, we apply this framework to a statistical model based on a Cox model with an added treatment effect.

由于Didelez [Lifetime Data analysis 25, no. 5],我们使用治疗分离方法研究了生存分析中的时间依赖性介质。[4] 593-610]并基于罗宾斯和理查森的早期工作[因果关系和精神病理学:发现疾病的决定因素及其治疗,103-158]。牛津大学出版社]。这种方法避免了嵌套的反事实和跨世界假设,这些在中介分析中很常见。治疗、中介、协变量、混杂因素和结果的因果模型由有向无环图(dag)表示。然而,当我们在许多时间点进行测量时,dag往往非常复杂。因此,我们建议使用所谓的滚动图,其中一个节点代表整个坐标过程,而不是单个随机变量,从而使我们的图形表示更简单。卷起的图不一定是无循环的;它们可以用δ $ δ $分离来分析,这是这类图中适当的图形分离准则,类似于d$ d$分离。特别地,δ $delta$ -分离是一种图形工具,用于评估中介分析的条件是否满足,或者未测量的混杂因素是否影响估计的效果。我们也给出了一个中介的g公式。这与Vansteelandt等人的方法相似。[医学统计38,no. 6][24: 4828-4840],尽管那篇论文有不同的概念基础。最后,我们将这个框架应用到一个基于Cox模型的统计模型中,并增加了治疗效果。
{"title":"Time-Dependent Mediators in Survival Analysis: Graphical Representation of Causal Assumptions","authors":"Søren Wengel Mogensen,&nbsp;Odd O. Aalen,&nbsp;Susanne Strohmaier","doi":"10.1002/bimj.70110","DOIUrl":"10.1002/bimj.70110","url":null,"abstract":"<div>\u0000 \u0000 <p>We study time-dependent mediators in survival analysis using a treatment separation approach due to Didelez [<i>Lifetime Data Analysis</i> 25, no. 4: 593–610] and based on earlier work by Robins and Richardson [<i>Causality and Psychopathology: Finding the Determinants of Disorders and Their Cures</i>, 103–158. Oxford University Press]. This approach avoids nested counterfactuals and cross-world assumptions which are otherwise common in mediation analysis. The causal model of treatment, mediators, covariates, confounders, and outcome is represented by directed acyclic graphs (DAGs). However, the DAGs tend to be very complex when we have measurements at many time points. We therefore suggest using so-called <i>rolled graphs</i> in which a node represents an entire coordinate process instead of a single random variable, leading us to far simpler graphical representations. The rolled graphs are not necessarily acyclic; they can be analyzed by <span></span><math>\u0000 <semantics>\u0000 <mi>δ</mi>\u0000 <annotation>$delta$</annotation>\u0000 </semantics></math>-separation which is the appropriate graphical separation criterion in this class of graphs and analogous to <span></span><math>\u0000 <semantics>\u0000 <mi>d</mi>\u0000 <annotation>$d$</annotation>\u0000 </semantics></math>-separation. In particular, <span></span><math>\u0000 <semantics>\u0000 <mi>δ</mi>\u0000 <annotation>$delta$</annotation>\u0000 </semantics></math>-separation is a graphical tool for evaluating if the conditions of the mediation analysis are met, or if unmeasured confounders influence the estimated effects. We also state a mediational g-formula. This is similar to the approach in Vansteelandt et al. [<i>Statistics in Medicine</i> 38, no. 24: 4828–4840], although that paper has a different conceptual basis. Finally, we apply this framework to a statistical model based on a Cox model with an added treatment effect.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146088136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Challenge of Time-to-Event Analysis for Multiple Events: A Guided Tour From Time-to-First-Event to Recurrent Time-to-Event Analysis 多事件时间到事件分析的挑战:从时间到第一事件到重复时间到事件分析的导读。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-01-28 DOI: 10.1002/bimj.70107
Sandra Schmeller, Alexandra Erdmann, Jan Beyersmann, Christiane Angermann, Ann-Kathrin Ozga

Clinical trials often compare a treatment to a control group concerning multiple possible combined time-to-event endpoints like hospital-free survival. Thereby, the first endpoint may occur more than once (“recurrent”), whereas the second endpoint is absorbing. Inclusion of all observed events in the analysis can increase the power and provide a more complete picture of the disease but it needs more sophisticated methodology. We give a stepwise guidance on how to extend the simple time-to-first event model to complex multistate methodology, where multiple events are incorporated. We thereby consider non- and semiparametric methods and show how they are related. Special attention is given to the prerequisites of the models, for example, the Markov property, and their interpretation. Due to novel results in non-Markov models, the summary measurements: state occupation probability, mean number of hospitalizations, and average length of stay allow an easy interpretation of a treatment effect in non-Markov models if the censoring is random. Partly conditional transition rates can be estimated instead of hazards. We investigate the difference between partly conditional transition rates and hazards and the impact of the random censoring condition in a simulation study. Furthermore, the simulation study considers the sensitivity of a Markov test. Different estimators are introduced, and their use is explained based on data from the randomized controlled Interdisciplinary Network Heart Failure trial, which investigated the effects of a nurse-coordinated disease management program. The aim is to give an overview of existing methods, present the assumptions, and elaborate on the differences in interpretation.

临床试验经常将治疗与对照组进行比较,涉及多个可能的联合时间到事件终点,如无医院生存期。因此,第一个终点可能出现不止一次(“反复”),而第二个终点是吸收性的。在分析中纳入所有观察到的事件可以增加效力并提供更完整的疾病图像,但需要更复杂的方法。我们给出了如何将简单的时间到第一个事件模型扩展到复杂的多状态方法的逐步指导,其中包含多个事件。因此,我们考虑非参数和半参数方法,并说明它们是如何相关的。特别注意模型的先决条件,例如,马尔可夫性质及其解释。由于非马尔可夫模型中的新结果,如果审查是随机的,则总结测量:状态职业概率,平均住院次数和平均住院时间允许在非马尔可夫模型中轻松解释治疗效果。可以估计部分有条件的转移率,而不是危害。在模拟研究中,我们研究了部分条件转移率和危险之间的差异以及随机审查条件的影响。此外,仿真研究还考虑了马尔可夫检验的敏感性。介绍了不同的估计器,并根据随机对照跨学科网络心力衰竭试验的数据解释了它们的使用,该试验调查了护士协调疾病管理计划的效果。目的是概述现有方法,提出假设,并详细说明解释的差异。
{"title":"The Challenge of Time-to-Event Analysis for Multiple Events: A Guided Tour From Time-to-First-Event to Recurrent Time-to-Event Analysis","authors":"Sandra Schmeller,&nbsp;Alexandra Erdmann,&nbsp;Jan Beyersmann,&nbsp;Christiane Angermann,&nbsp;Ann-Kathrin Ozga","doi":"10.1002/bimj.70107","DOIUrl":"10.1002/bimj.70107","url":null,"abstract":"<p>Clinical trials often compare a treatment to a control group concerning multiple possible combined time-to-event endpoints like hospital-free survival. Thereby, the first endpoint may occur more than once (“recurrent”), whereas the second endpoint is absorbing. Inclusion of all observed events in the analysis can increase the power and provide a more complete picture of the disease but it needs more sophisticated methodology. We give a stepwise guidance on how to extend the simple time-to-first event model to complex multistate methodology, where multiple events are incorporated. We thereby consider non- and semiparametric methods and show how they are related. Special attention is given to the prerequisites of the models, for example, the Markov property, and their interpretation. Due to novel results in non-Markov models, the summary measurements: state occupation probability, mean number of hospitalizations, and average length of stay allow an easy interpretation of a treatment effect in non-Markov models if the censoring is random. Partly conditional transition rates can be estimated instead of hazards. We investigate the difference between partly conditional transition rates and hazards and the impact of the random censoring condition in a simulation study. Furthermore, the simulation study considers the sensitivity of a Markov test. Different estimators are introduced, and their use is explained based on data from the randomized controlled Interdisciplinary Network Heart Failure trial, which investigated the effects of a nurse-coordinated disease management program. The aim is to give an overview of existing methods, present the assumptions, and elaborate on the differences in interpretation.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12848661/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146068669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of Multiple Outcomes in Contaminated Trials Reinforced With Validation Data 用验证数据加强污染试验的多结果分析。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-01-28 DOI: 10.1002/bimj.70111
Solomon W. Harrar, Zi Ye

This paper is concerned with estimation and testing for treatment effects with multivariate outcomes. It primarily focuses on the situation where imperfect diagnostic tools are used to classify subjects into different groups. Oftentimes, there are more expensive and/or invasive diagnostic tools to accurately determine the subjects' status or conditions, yielding partially validated data on a smaller number of subjects. We propose moment-based approaches for estimating and testing treatment effects. We compare our methods with maximum likelihood approach using the EM algorithm, which requires strong assumptions and bears computational burden, and with traditional methods, which ignore the diagnostic tool's imperfection. The proposed methods show advantages in terms of coverage probability, computations efficiency, and robustness. The application of the methods is illustrated with gene-expression data from the Genes-environments & Admixture in Latino Americans (GALA) II study of asthma in Hispanic/Latino children.

本文关注的是多变量结果治疗效果的估计和检验。它主要关注的是使用不完善的诊断工具将受试者分为不同组的情况。通常,有更昂贵和/或侵入性的诊断工具来准确地确定受试者的状态或条件,在少数受试者中产生部分有效的数据。我们提出了基于矩的方法来估计和测试治疗效果。我们将我们的方法与使用EM算法的最大似然方法进行了比较,该方法需要很强的假设和计算负担,而传统方法忽略了诊断工具的不完善性。所提出的方法在覆盖概率、计算效率和鲁棒性方面具有优势。这些方法的应用通过拉丁美洲儿童哮喘的基因-环境和混合(GALA) II研究的基因表达数据来说明。
{"title":"Analysis of Multiple Outcomes in Contaminated Trials Reinforced With Validation Data","authors":"Solomon W. Harrar,&nbsp;Zi Ye","doi":"10.1002/bimj.70111","DOIUrl":"10.1002/bimj.70111","url":null,"abstract":"<div>\u0000 \u0000 <p>This paper is concerned with estimation and testing for treatment effects with multivariate outcomes. It primarily focuses on the situation where imperfect diagnostic tools are used to classify subjects into different groups. Oftentimes, there are more expensive and/or invasive diagnostic tools to accurately determine the subjects' status or conditions, yielding partially validated data on a smaller number of subjects. We propose moment-based approaches for estimating and testing treatment effects. We compare our methods with maximum likelihood approach using the EM algorithm, which requires strong assumptions and bears computational burden, and with traditional methods, which ignore the diagnostic tool's imperfection. The proposed methods show advantages in terms of coverage probability, computations efficiency, and robustness. The application of the methods is illustrated with gene-expression data from the Genes-environments &amp; Admixture in Latino Americans (GALA) II study of asthma in Hispanic/Latino children.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146068672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Issue Information: Biometrical Journal 1'26 期刊信息:bioometic Journal 1'26
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-01-09 DOI: 10.1002/bimj.70109
{"title":"Issue Information: Biometrical Journal 1'26","authors":"","doi":"10.1002/bimj.70109","DOIUrl":"https://doi.org/10.1002/bimj.70109","url":null,"abstract":"","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.70109","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145983580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Time-Dependent Predictive Accuracy Metrics in the Context of Interval Censoring and Competing Risks 区间筛选和竞争风险下的时变预测精度度量。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-01-05 DOI: 10.1002/bimj.70108
Zhenwei Yang, Dimitris Rizopoulos, Lisa F. Newcomb, Nicole S. Erler

Evaluating the performance of a prediction model is a common task in medical statistics. Standard accuracy metrics require the observation of the true outcomes. This is typically not possible in the setting with time-to-event outcomes due to censoring. Interval censoring, the presence of time-varying covariates, and competing risks present additional challenges in obtaining those accuracy metrics. In this study, we propose two methods to deal with interval censoring in a time-varying competing risk setting: a model-based approach and the inverse probability of censoring weighting (IPCW) approach, focusing on three key time-dependent metrics: area under the receiver-operating characteristic curve, Brier score, and expected predictive cross-entropy. The evaluation is conducted over a medically relevant time interval of interest, [t,Δt)$[t, Delta t)$. The model-based approach includes all subjects in the risk set, using their predicted risks to contribute to the accuracy metrics. In contrast, the IPCW approach only considers the subset of subjects who are known to be event-free or experience the event within the interval of interest. We performed a simulation study to compare the performance of the two approaches with regard to the three metrics. Furthermore, we demonstrated the three metrics using the two approaches on an example prostate cancer surveillance cohort. Risk predictions were generated from a joint model handling the interval-censored cancer progression and the competing event, early treatment, and repeatedly measured biomarkers.

评估预测模型的性能是医学统计中常见的任务。标准精度度量要求观察真实结果。由于审查,这在具有时间到事件结果的设置中通常是不可能的。区间审查、时变协变量的存在以及竞争风险为获得这些精度指标带来了额外的挑战。在这项研究中,我们提出了两种方法来处理时变竞争风险设置中的区间审查:基于模型的方法和审查加权逆概率(IPCW)方法,重点关注三个关键的时间相关指标:接收者操作特征曲线下的面积,Brier评分和预期预测交叉熵。评估是在一个医学相关的时间间隔内进行的,[t, Δ t)$ [t, Δ t)$。基于模型的方法包括风险集中的所有主题,使用他们预测的风险来贡献准确性度量。相比之下,IPCW方法只考虑已知没有事件或在感兴趣的时间间隔内经历事件的受试者子集。我们进行了一项模拟研究,以比较两种方法在三个指标方面的性能。此外,我们在一个前列腺癌监测队列中使用这两种方法证明了这三个指标。风险预测是由一个联合模型生成的,该模型处理间隔审查的癌症进展和竞争事件、早期治疗和反复测量的生物标志物。
{"title":"Time-Dependent Predictive Accuracy Metrics in the Context of Interval Censoring and Competing Risks","authors":"Zhenwei Yang,&nbsp;Dimitris Rizopoulos,&nbsp;Lisa F. Newcomb,&nbsp;Nicole S. Erler","doi":"10.1002/bimj.70108","DOIUrl":"10.1002/bimj.70108","url":null,"abstract":"<p>Evaluating the performance of a prediction model is a common task in medical statistics. Standard accuracy metrics require the observation of the true outcomes. This is typically not possible in the setting with time-to-event outcomes due to censoring. Interval censoring, the presence of time-varying covariates, and competing risks present additional challenges in obtaining those accuracy metrics. In this study, we propose two methods to deal with interval censoring in a time-varying competing risk setting: a model-based approach and the inverse probability of censoring weighting (IPCW) approach, focusing on three key time-dependent metrics: area under the receiver-operating characteristic curve, Brier score, and expected predictive cross-entropy. The evaluation is conducted over a medically relevant time interval of interest, <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mo>[</mo>\u0000 <mi>t</mi>\u0000 <mo>,</mo>\u0000 <mi>Δ</mi>\u0000 <mi>t</mi>\u0000 <mo>)</mo>\u0000 </mrow>\u0000 <annotation>$[t, Delta t)$</annotation>\u0000 </semantics></math>. The model-based approach includes all subjects in the risk set, using their predicted risks to contribute to the accuracy metrics. In contrast, the IPCW approach only considers the subset of subjects who are known to be event-free or experience the event within the interval of interest. We performed a simulation study to compare the performance of the two approaches with regard to the three metrics. Furthermore, we demonstrated the three metrics using the two approaches on an example prostate cancer surveillance cohort. Risk predictions were generated from a joint model handling the interval-censored cancer progression and the competing event, early treatment, and repeatedly measured biomarkers.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12766878/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145901368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Informative Co-Data Learning for High-Dimensional Horseshoe Regression 高维马蹄形回归的信息协同数据学习。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-12-30 DOI: 10.1002/bimj.70105
Claudio Busatto, Mark A. van de Wiel

High-dimensional data often arise from clinical genomics research to infer relevant predictors of a particular trait. A way to improve the predictive performance is by incorporating information about the predictors obtained from existing from prior knowledge or previous studies. Such information is also referred to as “co-data.” To this aim, we develop a novel Bayesian model for including co-data in a high-dimensional regression framework, termed informative Horseshoe regression (infHS). The proposed approach regresses the prior variances of the regression parameters on the co-data variables, improving variable selection and prediction. We implement both a Gibbs sampler and a Variational approximation algorithm. The former is suited for applications of moderate dimensions which, besides prediction, target posterior inference, whereas the latter's computational efficiency allows handling a very large number of variables. We show the benefits of including co-data through a simulation study. Lastly, we demonstrate that infHS outperforms competing approaches in two genomics applications.

高维数据通常来自临床基因组学研究,用于推断特定性状的相关预测因子。提高预测性能的一种方法是结合从现有的先验知识或以前的研究中获得的预测因子的信息。这样的信息也被称为“共同数据”。为此,我们开发了一种新的贝叶斯模型,用于将协数据包含在高维回归框架中,称为信息马蹄回归(infHS)。该方法对回归参数在协数据变量上的先验方差进行回归,提高了变量的选择和预测能力。我们实现了吉布斯采样器和变分近似算法。前者适用于中等维度的应用,除了预测之外,目标是后验推理,而后者的计算效率允许处理非常大量的变量。我们通过模拟研究展示了包含共同数据的好处。最后,我们证明了infHS在两个基因组学应用中优于竞争方法。
{"title":"Informative Co-Data Learning for High-Dimensional Horseshoe Regression","authors":"Claudio Busatto,&nbsp;Mark A. van de Wiel","doi":"10.1002/bimj.70105","DOIUrl":"10.1002/bimj.70105","url":null,"abstract":"<div>\u0000 \u0000 <p>High-dimensional data often arise from clinical genomics research to infer relevant predictors of a particular trait. A way to improve the predictive performance is by incorporating information about the predictors obtained from existing from prior knowledge or previous studies. Such information is also referred to as “co-data.” To this aim, we develop a novel Bayesian model for including co-data in a high-dimensional regression framework, termed informative Horseshoe regression (infHS). The proposed approach regresses the prior variances of the regression parameters on the co-data variables, improving variable selection and prediction. We implement both a Gibbs sampler and a Variational approximation algorithm. The former is suited for applications of moderate dimensions which, besides prediction, target posterior inference, whereas the latter's computational efficiency allows handling a very large number of variables. We show the benefits of including co-data through a simulation study. Lastly, we demonstrate that infHS outperforms competing approaches in two genomics applications.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145859358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modified Skew Discrete Laplace Regression Models for Integer-Valued Data With Applications to Paired Samples 整数数据的修正偏态离散拉普拉斯回归模型及其在成对样本中的应用。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-12-29 DOI: 10.1002/bimj.70106
Rodrigo M. R. de Medeiros, Marcelo Bourguignon

Modeling events associated with discrete-valued observations arises in several practical situations. Until now, research on statistical methods for discrete data has primarily focused on modeling count data. Nevertheless, discrete observations that may assume any value in the set of integers Z={,2,1,0,1,2,}$mathbb {Z} = lbrace ldots, -2, -1, 0, 1, 2, ldots rbrace$ are also found in various contexts. This paper introduces a general parametric modeling framework for the analysis of integer-valued data, with applications to paired discrete observations. The proposed model is based on the modified skew discrete Laplace distribution. Our approach enables a straightforward interpretation of regression coefficients in terms of mean and dispersion, while properly accounting for the discrete nature of the data. We adopt a frequentist approach to perform inference and define diagnostic tools to assess goodness-of-fit. Additionally, we conduct several simulation studies to examine the asymptotic properties of the estimators and test statistics, as well as the distribution of the residuals. We illustrate the usefulness of the proposed model with two real datasets: one from an experimental study conducted in a French penitentiary, and another involving diagnostic imaging to assess kidney function through dynamic and static scintigraphy. Estimation and inference procedures for the new regression model are implemented in the R package sdlrm.

与离散值观测相关的建模事件出现在几种实际情况中。到目前为止,离散数据的统计方法研究主要集中在计数数据的建模上。然而,可以在整数集合Z ={…,-2,-1,0,1,2,…}$mathbb {Z} = lbrace ldots, -2, -1, 0, 1, 2, ldots rbrace$中假设任意值的离散观测值也可以在各种上下文中找到。本文介绍了一种用于整数值数据分析的通用参数化建模框架,并将其应用于成对离散观测。该模型基于修正的偏态离散拉普拉斯分布。我们的方法可以根据平均值和离散度直接解释回归系数,同时适当地考虑数据的离散性。我们采用频率论的方法来进行推理,并定义诊断工具来评估拟合优度。此外,我们进行了一些模拟研究,以检查估计量和检验统计量的渐近性质,以及残差的分布。我们用两个真实的数据集来说明所提出的模型的实用性:一个来自法国监狱进行的实验研究,另一个涉及通过动态和静态闪烁成像来评估肾功能的诊断成像。新回归模型的估计和推理程序在R包sdlrm中实现。
{"title":"Modified Skew Discrete Laplace Regression Models for Integer-Valued Data With Applications to Paired Samples","authors":"Rodrigo M. R. de Medeiros,&nbsp;Marcelo Bourguignon","doi":"10.1002/bimj.70106","DOIUrl":"10.1002/bimj.70106","url":null,"abstract":"<div>\u0000 \u0000 <p>Modeling events associated with discrete-valued observations arises in several practical situations. Until now, research on statistical methods for discrete data has primarily focused on modeling count data. Nevertheless, discrete observations that may assume any value in the set of integers <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>Z</mi>\u0000 <mo>=</mo>\u0000 <mo>{</mo>\u0000 <mtext>…</mtext>\u0000 <mo>,</mo>\u0000 <mo>−</mo>\u0000 <mn>2</mn>\u0000 <mo>,</mo>\u0000 <mo>−</mo>\u0000 <mn>1</mn>\u0000 <mo>,</mo>\u0000 <mn>0</mn>\u0000 <mo>,</mo>\u0000 <mn>1</mn>\u0000 <mo>,</mo>\u0000 <mn>2</mn>\u0000 <mo>,</mo>\u0000 <mtext>…</mtext>\u0000 <mo>}</mo>\u0000 </mrow>\u0000 <annotation>$mathbb {Z} = lbrace ldots, -2, -1, 0, 1, 2, ldots rbrace$</annotation>\u0000 </semantics></math> are also found in various contexts. This paper introduces a general parametric modeling framework for the analysis of integer-valued data, with applications to paired discrete observations. The proposed model is based on the modified skew discrete Laplace distribution. Our approach enables a straightforward interpretation of regression coefficients in terms of mean and dispersion, while properly accounting for the discrete nature of the data. We adopt a frequentist approach to perform inference and define diagnostic tools to assess goodness-of-fit. Additionally, we conduct several simulation studies to examine the asymptotic properties of the estimators and test statistics, as well as the distribution of the residuals. We illustrate the usefulness of the proposed model with two real datasets: one from an experimental study conducted in a French penitentiary, and another involving diagnostic imaging to assess kidney function through dynamic and static scintigraphy. Estimation and inference procedures for the new regression model are implemented in the R package <span>sdlrm</span>.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145851446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Covariance-Based Penalty Estimator for Model Assessment With Censored Data 基于协方差的删节数据模型评估惩罚估计。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-12-29 DOI: 10.1002/bimj.70103
Zhuoran Zhang, Daniel L. Gillen

Prediction model selection and assessment are primary objectives of many statistical analyses. Covariance-based penalty estimators provide analytic estimates of the optimism associated with naive training error estimates for multiple classes of prediction models and error assessment rules. While the majority of work on covariance-based penalties has focused on prediction for uncensored data, little attention has been given to time-to-event data. In this article, we consider estimating the optimism for survival prediction models assessed via the Brier score. We first analytically derive an expression of the optimism in a single group scenario with uncensored data based on a reformulation of the optimism. With the same reformulation, we propose an algorithm to estimate the optimism for Cox's proportional hazards regression under a general prediction setting involving covariates and right censoring. We verify the derived theory and demonstrate the applicability of the proposed algorithm via simulation studies. Finally, we illustrate the utility of our new covariance-based penalty estimator through an application predicting time to hemodialysis access failure among patients with end-stage renal disease using data from the United States Renal Data System.

预测模型的选择和评估是许多统计分析的主要目标。基于协方差的惩罚估计提供了与多类预测模型和误差评估规则的朴素训练误差估计相关的乐观度的分析估计。虽然基于协方差的惩罚的大部分工作都集中在对未经审查的数据的预测上,但对事件时间数据的关注很少。在本文中,我们考虑通过Brier评分评估生存预测模型的乐观度。我们首先基于乐观主义的重新表述,解析地推导出在单个群体场景中使用未经审查的数据的乐观主义表达式。通过同样的重新表述,我们提出了一种算法来估计Cox比例风险回归在涉及协变量和右审查的一般预测设置下的乐观度。我们通过仿真研究验证了所推导的理论,并证明了所提出算法的适用性。最后,我们通过使用美国肾脏数据系统的数据预测终末期肾病患者血液透析获得失败的时间,说明了我们新的基于协方差的惩罚估计器的实用性。
{"title":"A Covariance-Based Penalty Estimator for Model Assessment With Censored Data","authors":"Zhuoran Zhang,&nbsp;Daniel L. Gillen","doi":"10.1002/bimj.70103","DOIUrl":"10.1002/bimj.70103","url":null,"abstract":"<div>\u0000 \u0000 <p>Prediction model selection and assessment are primary objectives of many statistical analyses. Covariance-based penalty estimators provide analytic estimates of the optimism associated with naive training error estimates for multiple classes of prediction models and error assessment rules. While the majority of work on covariance-based penalties has focused on prediction for uncensored data, little attention has been given to time-to-event data. In this article, we consider estimating the optimism for survival prediction models assessed via the Brier score. We first analytically derive an expression of the optimism in a single group scenario with uncensored data based on a reformulation of the optimism. With the same reformulation, we propose an algorithm to estimate the optimism for Cox's proportional hazards regression under a general prediction setting involving covariates and right censoring. We verify the derived theory and demonstrate the applicability of the proposed algorithm via simulation studies. Finally, we illustrate the utility of our new covariance-based penalty estimator through an application predicting time to hemodialysis access failure among patients with end-stage renal disease using data from the United States Renal Data System.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145851500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biometrical Journal
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1