首页 > 最新文献

Biometrical Journal最新文献

英文 中文
Time-Dependent Mediators in Survival Analysis: Graphical Representation of Causal Assumptions 生存分析中的时间依赖中介:因果假设的图形表示。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-01-29 DOI: 10.1002/bimj.70110
Søren Wengel Mogensen, Odd O. Aalen, Susanne Strohmaier

We study time-dependent mediators in survival analysis using a treatment separation approach due to Didelez [Lifetime Data Analysis 25, no. 4: 593–610] and based on earlier work by Robins and Richardson [Causality and Psychopathology: Finding the Determinants of Disorders and Their Cures, 103–158. Oxford University Press]. This approach avoids nested counterfactuals and cross-world assumptions which are otherwise common in mediation analysis. The causal model of treatment, mediators, covariates, confounders, and outcome is represented by directed acyclic graphs (DAGs). However, the DAGs tend to be very complex when we have measurements at many time points. We therefore suggest using so-called rolled graphs in which a node represents an entire coordinate process instead of a single random variable, leading us to far simpler graphical representations. The rolled graphs are not necessarily acyclic; they can be analyzed by δ$delta$-separation which is the appropriate graphical separation criterion in this class of graphs and analogous to d$d$-separation. In particular, δ$delta$-separation is a graphical tool for evaluating if the conditions of the mediation analysis are met, or if unmeasured confounders influence the estimated effects. We also state a mediational g-formula. This is similar to the approach in Vansteelandt et al. [Statistics in Medicine 38, no. 24: 4828–4840], although that paper has a different conceptual basis. Finally, we apply this framework to a statistical model based on a Cox model with an added treatment effect.

由于Didelez [Lifetime Data analysis 25, no. 5],我们使用治疗分离方法研究了生存分析中的时间依赖性介质。[4] 593-610]并基于罗宾斯和理查森的早期工作[因果关系和精神病理学:发现疾病的决定因素及其治疗,103-158]。牛津大学出版社]。这种方法避免了嵌套的反事实和跨世界假设,这些在中介分析中很常见。治疗、中介、协变量、混杂因素和结果的因果模型由有向无环图(dag)表示。然而,当我们在许多时间点进行测量时,dag往往非常复杂。因此,我们建议使用所谓的滚动图,其中一个节点代表整个坐标过程,而不是单个随机变量,从而使我们的图形表示更简单。卷起的图不一定是无循环的;它们可以用δ $ δ $分离来分析,这是这类图中适当的图形分离准则,类似于d$ d$分离。特别地,δ $delta$ -分离是一种图形工具,用于评估中介分析的条件是否满足,或者未测量的混杂因素是否影响估计的效果。我们也给出了一个中介的g公式。这与Vansteelandt等人的方法相似。[医学统计38,no. 6][24: 4828-4840],尽管那篇论文有不同的概念基础。最后,我们将这个框架应用到一个基于Cox模型的统计模型中,并增加了治疗效果。
{"title":"Time-Dependent Mediators in Survival Analysis: Graphical Representation of Causal Assumptions","authors":"Søren Wengel Mogensen,&nbsp;Odd O. Aalen,&nbsp;Susanne Strohmaier","doi":"10.1002/bimj.70110","DOIUrl":"10.1002/bimj.70110","url":null,"abstract":"<div>\u0000 \u0000 <p>We study time-dependent mediators in survival analysis using a treatment separation approach due to Didelez [<i>Lifetime Data Analysis</i> 25, no. 4: 593–610] and based on earlier work by Robins and Richardson [<i>Causality and Psychopathology: Finding the Determinants of Disorders and Their Cures</i>, 103–158. Oxford University Press]. This approach avoids nested counterfactuals and cross-world assumptions which are otherwise common in mediation analysis. The causal model of treatment, mediators, covariates, confounders, and outcome is represented by directed acyclic graphs (DAGs). However, the DAGs tend to be very complex when we have measurements at many time points. We therefore suggest using so-called <i>rolled graphs</i> in which a node represents an entire coordinate process instead of a single random variable, leading us to far simpler graphical representations. The rolled graphs are not necessarily acyclic; they can be analyzed by <span></span><math>\u0000 <semantics>\u0000 <mi>δ</mi>\u0000 <annotation>$delta$</annotation>\u0000 </semantics></math>-separation which is the appropriate graphical separation criterion in this class of graphs and analogous to <span></span><math>\u0000 <semantics>\u0000 <mi>d</mi>\u0000 <annotation>$d$</annotation>\u0000 </semantics></math>-separation. In particular, <span></span><math>\u0000 <semantics>\u0000 <mi>δ</mi>\u0000 <annotation>$delta$</annotation>\u0000 </semantics></math>-separation is a graphical tool for evaluating if the conditions of the mediation analysis are met, or if unmeasured confounders influence the estimated effects. We also state a mediational g-formula. This is similar to the approach in Vansteelandt et al. [<i>Statistics in Medicine</i> 38, no. 24: 4828–4840], although that paper has a different conceptual basis. Finally, we apply this framework to a statistical model based on a Cox model with an added treatment effect.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146088136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Challenge of Time-to-Event Analysis for Multiple Events: A Guided Tour From Time-to-First-Event to Recurrent Time-to-Event Analysis 多事件时间到事件分析的挑战:从时间到第一事件到重复时间到事件分析的导读。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-01-28 DOI: 10.1002/bimj.70107
Sandra Schmeller, Alexandra Erdmann, Jan Beyersmann, Christiane Angermann, Ann-Kathrin Ozga

Clinical trials often compare a treatment to a control group concerning multiple possible combined time-to-event endpoints like hospital-free survival. Thereby, the first endpoint may occur more than once (“recurrent”), whereas the second endpoint is absorbing. Inclusion of all observed events in the analysis can increase the power and provide a more complete picture of the disease but it needs more sophisticated methodology. We give a stepwise guidance on how to extend the simple time-to-first event model to complex multistate methodology, where multiple events are incorporated. We thereby consider non- and semiparametric methods and show how they are related. Special attention is given to the prerequisites of the models, for example, the Markov property, and their interpretation. Due to novel results in non-Markov models, the summary measurements: state occupation probability, mean number of hospitalizations, and average length of stay allow an easy interpretation of a treatment effect in non-Markov models if the censoring is random. Partly conditional transition rates can be estimated instead of hazards. We investigate the difference between partly conditional transition rates and hazards and the impact of the random censoring condition in a simulation study. Furthermore, the simulation study considers the sensitivity of a Markov test. Different estimators are introduced, and their use is explained based on data from the randomized controlled Interdisciplinary Network Heart Failure trial, which investigated the effects of a nurse-coordinated disease management program. The aim is to give an overview of existing methods, present the assumptions, and elaborate on the differences in interpretation.

临床试验经常将治疗与对照组进行比较,涉及多个可能的联合时间到事件终点,如无医院生存期。因此,第一个终点可能出现不止一次(“反复”),而第二个终点是吸收性的。在分析中纳入所有观察到的事件可以增加效力并提供更完整的疾病图像,但需要更复杂的方法。我们给出了如何将简单的时间到第一个事件模型扩展到复杂的多状态方法的逐步指导,其中包含多个事件。因此,我们考虑非参数和半参数方法,并说明它们是如何相关的。特别注意模型的先决条件,例如,马尔可夫性质及其解释。由于非马尔可夫模型中的新结果,如果审查是随机的,则总结测量:状态职业概率,平均住院次数和平均住院时间允许在非马尔可夫模型中轻松解释治疗效果。可以估计部分有条件的转移率,而不是危害。在模拟研究中,我们研究了部分条件转移率和危险之间的差异以及随机审查条件的影响。此外,仿真研究还考虑了马尔可夫检验的敏感性。介绍了不同的估计器,并根据随机对照跨学科网络心力衰竭试验的数据解释了它们的使用,该试验调查了护士协调疾病管理计划的效果。目的是概述现有方法,提出假设,并详细说明解释的差异。
{"title":"The Challenge of Time-to-Event Analysis for Multiple Events: A Guided Tour From Time-to-First-Event to Recurrent Time-to-Event Analysis","authors":"Sandra Schmeller,&nbsp;Alexandra Erdmann,&nbsp;Jan Beyersmann,&nbsp;Christiane Angermann,&nbsp;Ann-Kathrin Ozga","doi":"10.1002/bimj.70107","DOIUrl":"10.1002/bimj.70107","url":null,"abstract":"<p>Clinical trials often compare a treatment to a control group concerning multiple possible combined time-to-event endpoints like hospital-free survival. Thereby, the first endpoint may occur more than once (“recurrent”), whereas the second endpoint is absorbing. Inclusion of all observed events in the analysis can increase the power and provide a more complete picture of the disease but it needs more sophisticated methodology. We give a stepwise guidance on how to extend the simple time-to-first event model to complex multistate methodology, where multiple events are incorporated. We thereby consider non- and semiparametric methods and show how they are related. Special attention is given to the prerequisites of the models, for example, the Markov property, and their interpretation. Due to novel results in non-Markov models, the summary measurements: state occupation probability, mean number of hospitalizations, and average length of stay allow an easy interpretation of a treatment effect in non-Markov models if the censoring is random. Partly conditional transition rates can be estimated instead of hazards. We investigate the difference between partly conditional transition rates and hazards and the impact of the random censoring condition in a simulation study. Furthermore, the simulation study considers the sensitivity of a Markov test. Different estimators are introduced, and their use is explained based on data from the randomized controlled Interdisciplinary Network Heart Failure trial, which investigated the effects of a nurse-coordinated disease management program. The aim is to give an overview of existing methods, present the assumptions, and elaborate on the differences in interpretation.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12848661/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146068669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of Multiple Outcomes in Contaminated Trials Reinforced With Validation Data 用验证数据加强污染试验的多结果分析。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-01-28 DOI: 10.1002/bimj.70111
Solomon W. Harrar, Zi Ye

This paper is concerned with estimation and testing for treatment effects with multivariate outcomes. It primarily focuses on the situation where imperfect diagnostic tools are used to classify subjects into different groups. Oftentimes, there are more expensive and/or invasive diagnostic tools to accurately determine the subjects' status or conditions, yielding partially validated data on a smaller number of subjects. We propose moment-based approaches for estimating and testing treatment effects. We compare our methods with maximum likelihood approach using the EM algorithm, which requires strong assumptions and bears computational burden, and with traditional methods, which ignore the diagnostic tool's imperfection. The proposed methods show advantages in terms of coverage probability, computations efficiency, and robustness. The application of the methods is illustrated with gene-expression data from the Genes-environments & Admixture in Latino Americans (GALA) II study of asthma in Hispanic/Latino children.

本文关注的是多变量结果治疗效果的估计和检验。它主要关注的是使用不完善的诊断工具将受试者分为不同组的情况。通常,有更昂贵和/或侵入性的诊断工具来准确地确定受试者的状态或条件,在少数受试者中产生部分有效的数据。我们提出了基于矩的方法来估计和测试治疗效果。我们将我们的方法与使用EM算法的最大似然方法进行了比较,该方法需要很强的假设和计算负担,而传统方法忽略了诊断工具的不完善性。所提出的方法在覆盖概率、计算效率和鲁棒性方面具有优势。这些方法的应用通过拉丁美洲儿童哮喘的基因-环境和混合(GALA) II研究的基因表达数据来说明。
{"title":"Analysis of Multiple Outcomes in Contaminated Trials Reinforced With Validation Data","authors":"Solomon W. Harrar,&nbsp;Zi Ye","doi":"10.1002/bimj.70111","DOIUrl":"10.1002/bimj.70111","url":null,"abstract":"<div>\u0000 \u0000 <p>This paper is concerned with estimation and testing for treatment effects with multivariate outcomes. It primarily focuses on the situation where imperfect diagnostic tools are used to classify subjects into different groups. Oftentimes, there are more expensive and/or invasive diagnostic tools to accurately determine the subjects' status or conditions, yielding partially validated data on a smaller number of subjects. We propose moment-based approaches for estimating and testing treatment effects. We compare our methods with maximum likelihood approach using the EM algorithm, which requires strong assumptions and bears computational burden, and with traditional methods, which ignore the diagnostic tool's imperfection. The proposed methods show advantages in terms of coverage probability, computations efficiency, and robustness. The application of the methods is illustrated with gene-expression data from the Genes-environments &amp; Admixture in Latino Americans (GALA) II study of asthma in Hispanic/Latino children.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146068672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Issue Information: Biometrical Journal 1'26 期刊信息:bioometic Journal 1'26
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-01-09 DOI: 10.1002/bimj.70109
{"title":"Issue Information: Biometrical Journal 1'26","authors":"","doi":"10.1002/bimj.70109","DOIUrl":"https://doi.org/10.1002/bimj.70109","url":null,"abstract":"","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bimj.70109","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145983580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Time-Dependent Predictive Accuracy Metrics in the Context of Interval Censoring and Competing Risks 区间筛选和竞争风险下的时变预测精度度量。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-01-05 DOI: 10.1002/bimj.70108
Zhenwei Yang, Dimitris Rizopoulos, Lisa F. Newcomb, Nicole S. Erler

Evaluating the performance of a prediction model is a common task in medical statistics. Standard accuracy metrics require the observation of the true outcomes. This is typically not possible in the setting with time-to-event outcomes due to censoring. Interval censoring, the presence of time-varying covariates, and competing risks present additional challenges in obtaining those accuracy metrics. In this study, we propose two methods to deal with interval censoring in a time-varying competing risk setting: a model-based approach and the inverse probability of censoring weighting (IPCW) approach, focusing on three key time-dependent metrics: area under the receiver-operating characteristic curve, Brier score, and expected predictive cross-entropy. The evaluation is conducted over a medically relevant time interval of interest, [t,Δt)$[t, Delta t)$. The model-based approach includes all subjects in the risk set, using their predicted risks to contribute to the accuracy metrics. In contrast, the IPCW approach only considers the subset of subjects who are known to be event-free or experience the event within the interval of interest. We performed a simulation study to compare the performance of the two approaches with regard to the three metrics. Furthermore, we demonstrated the three metrics using the two approaches on an example prostate cancer surveillance cohort. Risk predictions were generated from a joint model handling the interval-censored cancer progression and the competing event, early treatment, and repeatedly measured biomarkers.

评估预测模型的性能是医学统计中常见的任务。标准精度度量要求观察真实结果。由于审查,这在具有时间到事件结果的设置中通常是不可能的。区间审查、时变协变量的存在以及竞争风险为获得这些精度指标带来了额外的挑战。在这项研究中,我们提出了两种方法来处理时变竞争风险设置中的区间审查:基于模型的方法和审查加权逆概率(IPCW)方法,重点关注三个关键的时间相关指标:接收者操作特征曲线下的面积,Brier评分和预期预测交叉熵。评估是在一个医学相关的时间间隔内进行的,[t, Δ t)$ [t, Δ t)$。基于模型的方法包括风险集中的所有主题,使用他们预测的风险来贡献准确性度量。相比之下,IPCW方法只考虑已知没有事件或在感兴趣的时间间隔内经历事件的受试者子集。我们进行了一项模拟研究,以比较两种方法在三个指标方面的性能。此外,我们在一个前列腺癌监测队列中使用这两种方法证明了这三个指标。风险预测是由一个联合模型生成的,该模型处理间隔审查的癌症进展和竞争事件、早期治疗和反复测量的生物标志物。
{"title":"Time-Dependent Predictive Accuracy Metrics in the Context of Interval Censoring and Competing Risks","authors":"Zhenwei Yang,&nbsp;Dimitris Rizopoulos,&nbsp;Lisa F. Newcomb,&nbsp;Nicole S. Erler","doi":"10.1002/bimj.70108","DOIUrl":"10.1002/bimj.70108","url":null,"abstract":"<p>Evaluating the performance of a prediction model is a common task in medical statistics. Standard accuracy metrics require the observation of the true outcomes. This is typically not possible in the setting with time-to-event outcomes due to censoring. Interval censoring, the presence of time-varying covariates, and competing risks present additional challenges in obtaining those accuracy metrics. In this study, we propose two methods to deal with interval censoring in a time-varying competing risk setting: a model-based approach and the inverse probability of censoring weighting (IPCW) approach, focusing on three key time-dependent metrics: area under the receiver-operating characteristic curve, Brier score, and expected predictive cross-entropy. The evaluation is conducted over a medically relevant time interval of interest, <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mo>[</mo>\u0000 <mi>t</mi>\u0000 <mo>,</mo>\u0000 <mi>Δ</mi>\u0000 <mi>t</mi>\u0000 <mo>)</mo>\u0000 </mrow>\u0000 <annotation>$[t, Delta t)$</annotation>\u0000 </semantics></math>. The model-based approach includes all subjects in the risk set, using their predicted risks to contribute to the accuracy metrics. In contrast, the IPCW approach only considers the subset of subjects who are known to be event-free or experience the event within the interval of interest. We performed a simulation study to compare the performance of the two approaches with regard to the three metrics. Furthermore, we demonstrated the three metrics using the two approaches on an example prostate cancer surveillance cohort. Risk predictions were generated from a joint model handling the interval-censored cancer progression and the competing event, early treatment, and repeatedly measured biomarkers.</p>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12766878/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145901368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Informative Co-Data Learning for High-Dimensional Horseshoe Regression 高维马蹄形回归的信息协同数据学习。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-12-30 DOI: 10.1002/bimj.70105
Claudio Busatto, Mark A. van de Wiel

High-dimensional data often arise from clinical genomics research to infer relevant predictors of a particular trait. A way to improve the predictive performance is by incorporating information about the predictors obtained from existing from prior knowledge or previous studies. Such information is also referred to as “co-data.” To this aim, we develop a novel Bayesian model for including co-data in a high-dimensional regression framework, termed informative Horseshoe regression (infHS). The proposed approach regresses the prior variances of the regression parameters on the co-data variables, improving variable selection and prediction. We implement both a Gibbs sampler and a Variational approximation algorithm. The former is suited for applications of moderate dimensions which, besides prediction, target posterior inference, whereas the latter's computational efficiency allows handling a very large number of variables. We show the benefits of including co-data through a simulation study. Lastly, we demonstrate that infHS outperforms competing approaches in two genomics applications.

高维数据通常来自临床基因组学研究,用于推断特定性状的相关预测因子。提高预测性能的一种方法是结合从现有的先验知识或以前的研究中获得的预测因子的信息。这样的信息也被称为“共同数据”。为此,我们开发了一种新的贝叶斯模型,用于将协数据包含在高维回归框架中,称为信息马蹄回归(infHS)。该方法对回归参数在协数据变量上的先验方差进行回归,提高了变量的选择和预测能力。我们实现了吉布斯采样器和变分近似算法。前者适用于中等维度的应用,除了预测之外,目标是后验推理,而后者的计算效率允许处理非常大量的变量。我们通过模拟研究展示了包含共同数据的好处。最后,我们证明了infHS在两个基因组学应用中优于竞争方法。
{"title":"Informative Co-Data Learning for High-Dimensional Horseshoe Regression","authors":"Claudio Busatto,&nbsp;Mark A. van de Wiel","doi":"10.1002/bimj.70105","DOIUrl":"10.1002/bimj.70105","url":null,"abstract":"<div>\u0000 \u0000 <p>High-dimensional data often arise from clinical genomics research to infer relevant predictors of a particular trait. A way to improve the predictive performance is by incorporating information about the predictors obtained from existing from prior knowledge or previous studies. Such information is also referred to as “co-data.” To this aim, we develop a novel Bayesian model for including co-data in a high-dimensional regression framework, termed informative Horseshoe regression (infHS). The proposed approach regresses the prior variances of the regression parameters on the co-data variables, improving variable selection and prediction. We implement both a Gibbs sampler and a Variational approximation algorithm. The former is suited for applications of moderate dimensions which, besides prediction, target posterior inference, whereas the latter's computational efficiency allows handling a very large number of variables. We show the benefits of including co-data through a simulation study. Lastly, we demonstrate that infHS outperforms competing approaches in two genomics applications.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145859358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modified Skew Discrete Laplace Regression Models for Integer-Valued Data With Applications to Paired Samples 整数数据的修正偏态离散拉普拉斯回归模型及其在成对样本中的应用。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-12-29 DOI: 10.1002/bimj.70106
Rodrigo M. R. de Medeiros, Marcelo Bourguignon

Modeling events associated with discrete-valued observations arises in several practical situations. Until now, research on statistical methods for discrete data has primarily focused on modeling count data. Nevertheless, discrete observations that may assume any value in the set of integers Z={,2,1,0,1,2,}$mathbb {Z} = lbrace ldots, -2, -1, 0, 1, 2, ldots rbrace$ are also found in various contexts. This paper introduces a general parametric modeling framework for the analysis of integer-valued data, with applications to paired discrete observations. The proposed model is based on the modified skew discrete Laplace distribution. Our approach enables a straightforward interpretation of regression coefficients in terms of mean and dispersion, while properly accounting for the discrete nature of the data. We adopt a frequentist approach to perform inference and define diagnostic tools to assess goodness-of-fit. Additionally, we conduct several simulation studies to examine the asymptotic properties of the estimators and test statistics, as well as the distribution of the residuals. We illustrate the usefulness of the proposed model with two real datasets: one from an experimental study conducted in a French penitentiary, and another involving diagnostic imaging to assess kidney function through dynamic and static scintigraphy. Estimation and inference procedures for the new regression model are implemented in the R package sdlrm.

与离散值观测相关的建模事件出现在几种实际情况中。到目前为止,离散数据的统计方法研究主要集中在计数数据的建模上。然而,可以在整数集合Z ={…,-2,-1,0,1,2,…}$mathbb {Z} = lbrace ldots, -2, -1, 0, 1, 2, ldots rbrace$中假设任意值的离散观测值也可以在各种上下文中找到。本文介绍了一种用于整数值数据分析的通用参数化建模框架,并将其应用于成对离散观测。该模型基于修正的偏态离散拉普拉斯分布。我们的方法可以根据平均值和离散度直接解释回归系数,同时适当地考虑数据的离散性。我们采用频率论的方法来进行推理,并定义诊断工具来评估拟合优度。此外,我们进行了一些模拟研究,以检查估计量和检验统计量的渐近性质,以及残差的分布。我们用两个真实的数据集来说明所提出的模型的实用性:一个来自法国监狱进行的实验研究,另一个涉及通过动态和静态闪烁成像来评估肾功能的诊断成像。新回归模型的估计和推理程序在R包sdlrm中实现。
{"title":"Modified Skew Discrete Laplace Regression Models for Integer-Valued Data With Applications to Paired Samples","authors":"Rodrigo M. R. de Medeiros,&nbsp;Marcelo Bourguignon","doi":"10.1002/bimj.70106","DOIUrl":"10.1002/bimj.70106","url":null,"abstract":"<div>\u0000 \u0000 <p>Modeling events associated with discrete-valued observations arises in several practical situations. Until now, research on statistical methods for discrete data has primarily focused on modeling count data. Nevertheless, discrete observations that may assume any value in the set of integers <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>Z</mi>\u0000 <mo>=</mo>\u0000 <mo>{</mo>\u0000 <mtext>…</mtext>\u0000 <mo>,</mo>\u0000 <mo>−</mo>\u0000 <mn>2</mn>\u0000 <mo>,</mo>\u0000 <mo>−</mo>\u0000 <mn>1</mn>\u0000 <mo>,</mo>\u0000 <mn>0</mn>\u0000 <mo>,</mo>\u0000 <mn>1</mn>\u0000 <mo>,</mo>\u0000 <mn>2</mn>\u0000 <mo>,</mo>\u0000 <mtext>…</mtext>\u0000 <mo>}</mo>\u0000 </mrow>\u0000 <annotation>$mathbb {Z} = lbrace ldots, -2, -1, 0, 1, 2, ldots rbrace$</annotation>\u0000 </semantics></math> are also found in various contexts. This paper introduces a general parametric modeling framework for the analysis of integer-valued data, with applications to paired discrete observations. The proposed model is based on the modified skew discrete Laplace distribution. Our approach enables a straightforward interpretation of regression coefficients in terms of mean and dispersion, while properly accounting for the discrete nature of the data. We adopt a frequentist approach to perform inference and define diagnostic tools to assess goodness-of-fit. Additionally, we conduct several simulation studies to examine the asymptotic properties of the estimators and test statistics, as well as the distribution of the residuals. We illustrate the usefulness of the proposed model with two real datasets: one from an experimental study conducted in a French penitentiary, and another involving diagnostic imaging to assess kidney function through dynamic and static scintigraphy. Estimation and inference procedures for the new regression model are implemented in the R package <span>sdlrm</span>.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145851446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Covariance-Based Penalty Estimator for Model Assessment With Censored Data 基于协方差的删节数据模型评估惩罚估计。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-12-29 DOI: 10.1002/bimj.70103
Zhuoran Zhang, Daniel L. Gillen

Prediction model selection and assessment are primary objectives of many statistical analyses. Covariance-based penalty estimators provide analytic estimates of the optimism associated with naive training error estimates for multiple classes of prediction models and error assessment rules. While the majority of work on covariance-based penalties has focused on prediction for uncensored data, little attention has been given to time-to-event data. In this article, we consider estimating the optimism for survival prediction models assessed via the Brier score. We first analytically derive an expression of the optimism in a single group scenario with uncensored data based on a reformulation of the optimism. With the same reformulation, we propose an algorithm to estimate the optimism for Cox's proportional hazards regression under a general prediction setting involving covariates and right censoring. We verify the derived theory and demonstrate the applicability of the proposed algorithm via simulation studies. Finally, we illustrate the utility of our new covariance-based penalty estimator through an application predicting time to hemodialysis access failure among patients with end-stage renal disease using data from the United States Renal Data System.

预测模型的选择和评估是许多统计分析的主要目标。基于协方差的惩罚估计提供了与多类预测模型和误差评估规则的朴素训练误差估计相关的乐观度的分析估计。虽然基于协方差的惩罚的大部分工作都集中在对未经审查的数据的预测上,但对事件时间数据的关注很少。在本文中,我们考虑通过Brier评分评估生存预测模型的乐观度。我们首先基于乐观主义的重新表述,解析地推导出在单个群体场景中使用未经审查的数据的乐观主义表达式。通过同样的重新表述,我们提出了一种算法来估计Cox比例风险回归在涉及协变量和右审查的一般预测设置下的乐观度。我们通过仿真研究验证了所推导的理论,并证明了所提出算法的适用性。最后,我们通过使用美国肾脏数据系统的数据预测终末期肾病患者血液透析获得失败的时间,说明了我们新的基于协方差的惩罚估计器的实用性。
{"title":"A Covariance-Based Penalty Estimator for Model Assessment With Censored Data","authors":"Zhuoran Zhang,&nbsp;Daniel L. Gillen","doi":"10.1002/bimj.70103","DOIUrl":"10.1002/bimj.70103","url":null,"abstract":"<div>\u0000 \u0000 <p>Prediction model selection and assessment are primary objectives of many statistical analyses. Covariance-based penalty estimators provide analytic estimates of the optimism associated with naive training error estimates for multiple classes of prediction models and error assessment rules. While the majority of work on covariance-based penalties has focused on prediction for uncensored data, little attention has been given to time-to-event data. In this article, we consider estimating the optimism for survival prediction models assessed via the Brier score. We first analytically derive an expression of the optimism in a single group scenario with uncensored data based on a reformulation of the optimism. With the same reformulation, we propose an algorithm to estimate the optimism for Cox's proportional hazards regression under a general prediction setting involving covariates and right censoring. We verify the derived theory and demonstrate the applicability of the proposed algorithm via simulation studies. Finally, we illustrate the utility of our new covariance-based penalty estimator through an application predicting time to hemodialysis access failure among patients with end-stage renal disease using data from the United States Renal Data System.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"68 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145851500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dimension Reduction for the Conditional Quantiles of Functional Data With Categorical Predictors 具有分类预测因子的功能数据条件分位数的降维。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-12-18 DOI: 10.1002/bimj.70102
Shanshan Wang, Eliana Christou, Eftychia Solea, Jun Song

Functional data analysis has received significant attention due to its frequent occurrence in modern applications, such as in the medical field, where electrocardiograms or electroencephalograms can be used for a better understanding of various medical conditions. Due to the infinite-dimensional nature of functional elements, the current work focuses on dimension reduction techniques. This study shifts its focus to modeling the conditional quantiles of functional data, noting that existing works are limited to quantitative predictors. Consequently, we introduce the first approach to partial dimension reduction for the conditional quantiles under the presence of both functional and categorical predictors. We present the proposed algorithm and derive the convergence rates of the estimators. Moreover, we demonstrate the finite sample performance of the method using simulation examples and a real dataset based on functional magnetic resonance imaging.

功能数据分析由于其在现代应用中的频繁出现而受到了极大的关注,例如在医疗领域,心电图或脑电图可以用于更好地了解各种医疗状况。由于功能元素的无限维性质,目前的工作重点是降维技术。本研究将重点转移到对功能数据的条件分位数进行建模,注意到现有的工作仅限于定量预测因子。因此,我们引入了第一种方法来部分降维的条件分位数在功能和分类预测的存在。我们给出了该算法,并推导了估计量的收敛速率。此外,我们使用仿真示例和基于功能磁共振成像的真实数据集来证明该方法的有限样本性能。
{"title":"Dimension Reduction for the Conditional Quantiles of Functional Data With Categorical Predictors","authors":"Shanshan Wang,&nbsp;Eliana Christou,&nbsp;Eftychia Solea,&nbsp;Jun Song","doi":"10.1002/bimj.70102","DOIUrl":"10.1002/bimj.70102","url":null,"abstract":"<div>\u0000 \u0000 <p>Functional data analysis has received significant attention due to its frequent occurrence in modern applications, such as in the medical field, where electrocardiograms or electroencephalograms can be used for a better understanding of various medical conditions. Due to the infinite-dimensional nature of functional elements, the current work focuses on dimension reduction techniques. This study shifts its focus to modeling the conditional quantiles of functional data, noting that existing works are limited to quantitative predictors. Consequently, we introduce the first approach to partial dimension reduction for the conditional quantiles under the presence of both functional and categorical predictors. We present the proposed algorithm and derive the convergence rates of the estimators. Moreover, we demonstrate the finite sample performance of the method using simulation examples and a real dataset based on functional magnetic resonance imaging.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 6","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145776649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Empirical Likelihood Comparison of Absolute Risks 绝对风险的经验似然比较。
IF 1.8 3区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-12-18 DOI: 10.1002/bimj.70104
Paul Blanche, Frank Eriksson

In the competing risks setting, the t$t$-year absolute risk for a specific time t$t$ (e.g., 2 years), also called the cumulative incidence function at time t$t$, is often interesting to estimate. It is routinely estimated using the nonparametric Aalen–Johansen estimator. This estimator handles right-censored data and has desirable large sample properties, as it is the nonparametric maximum likelihood estimator (NPMLE). Inference for comparing absolute risks, via either a risk difference or a risk ratio, can therefore be done via usual asymptotic normal approximations and the delta method. However, the small sample performances of this approach are not fully satisfactory. Especially, (i) coverage of confidence intervals may be inaccurate and (ii) comparisons made using a risk ratio and a risk difference can lead to inconsistent conclusions, in terms of statistical significance. We, therefore, introduce an alternative empirical likelihood approach. One advantage of this approach is that it always leads to consistent conclusions when comparing absolute risks via a risk ratio and a risk difference, in terms of significance. Simulation results also suggest that small sample inference using this approach can be more accurate. We present the computation of confidence intervals and p-values using this approach and the asymptotic properties that justify them. We provide formulas and algorithms to compute constrained NPMLE, from which empirical likelihood ratios and inference procedures are derived. The novel approach has been implemented in the timeEL package for R, and some of its advantages are demonstrated via reproducible analyses of bone marrow transplant data.

在竞争风险设置中,特定时间t$ t$(例如,2年)的t$ t$年绝对风险,也称为时间t$ t$的累积关联函数,通常是有趣的估计。通常使用非参数aallen - johansen估计量进行估计。该估计器处理右截尾数据,并具有理想的大样本特性,因为它是非参数最大似然估计器(NPMLE)。因此,通过风险差或风险比来比较绝对风险的推理可以通过通常的渐近正态近似和delta方法来完成。然而,这种方法的小样本性能并不完全令人满意。特别是,(i)置信区间的覆盖范围可能不准确,(ii)使用风险比和风险差异进行比较可能导致统计显著性方面的结论不一致。因此,我们引入了另一种经验似然方法。这种方法的一个优点是,当通过风险比和风险差比较绝对风险时,在显著性方面,它总是得出一致的结论。仿真结果也表明,使用该方法进行小样本推理可以获得更准确的结果。我们给出了用这种方法计算置信区间和p值以及证明它们的渐近性质。我们提供了计算约束NPMLE的公式和算法,并由此导出了经验似然比和推理程序。这种新方法已经在R的timeEL软件包中实现,并且通过对骨髓移植数据的可重复分析证明了它的一些优点。
{"title":"Empirical Likelihood Comparison of Absolute Risks","authors":"Paul Blanche,&nbsp;Frank Eriksson","doi":"10.1002/bimj.70104","DOIUrl":"10.1002/bimj.70104","url":null,"abstract":"<div>\u0000 \u0000 <p>In the competing risks setting, the <span></span><math>\u0000 <semantics>\u0000 <mi>t</mi>\u0000 <annotation>$t$</annotation>\u0000 </semantics></math>-year absolute risk for a specific time <span></span><math>\u0000 <semantics>\u0000 <mi>t</mi>\u0000 <annotation>$t$</annotation>\u0000 </semantics></math> (e.g., 2 years), also called the cumulative incidence function at time <span></span><math>\u0000 <semantics>\u0000 <mi>t</mi>\u0000 <annotation>$t$</annotation>\u0000 </semantics></math>, is often interesting to estimate. It is routinely estimated using the nonparametric Aalen–Johansen estimator. This estimator handles right-censored data and has desirable large sample properties, as it is the nonparametric maximum likelihood estimator (NPMLE). Inference for comparing absolute risks, via either a risk difference or a risk ratio, can therefore be done via usual asymptotic normal approximations and the delta method. However, the small sample performances of this approach are not fully satisfactory. Especially, (i) coverage of confidence intervals may be inaccurate and (ii) comparisons made using a risk ratio and a risk difference can lead to inconsistent conclusions, in terms of statistical significance. We, therefore, introduce an alternative empirical likelihood approach. One advantage of this approach is that it always leads to consistent conclusions when comparing absolute risks via a risk ratio and a risk difference, in terms of significance. Simulation results also suggest that small sample inference using this approach can be more accurate. We present the computation of confidence intervals and <i>p</i>-values using this approach and the asymptotic properties that justify them. We provide formulas and algorithms to compute constrained NPMLE, from which empirical likelihood ratios and inference procedures are derived. The novel approach has been implemented in the <span>timeEL</span> package for <span>R</span>, and some of its advantages are demonstrated via reproducible analyses of bone marrow transplant data.</p></div>","PeriodicalId":55360,"journal":{"name":"Biometrical Journal","volume":"67 6","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145776656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biometrical Journal
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1