首页 > 最新文献

Biostatistics最新文献

英文 中文
Shortcomings of deep learning for distributional predictors: a note. 分布预测器深度学习的缺点:注释。
IF 2 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-01-20 DOI: 10.1093/biostatistics/kxaf051
Bonnie B Smith, Abhirup Datta, Brian Caffo

A number of domains in biomedical research use data with a large number of predictors all representing the same type of measurement. Often, an important summary is the within-person distribution of these predictors. Here we focus on settings where the mean relationship between outcome and predictors is fully captured by this distribution and, more generally, on problems where the goal is to learn a mapping that is invariant under permutations of the input vector. We compare unstructured neural networks, which do not explicitly incorporate the permutation invariance property, versus networks that we call ordered predictors neural networks. We show in simulations that the unstructured deep learning approach can yield higher prediction errors, compared to the approach that explicitly leverages the invariance to simplify the learning task. Additionally, in the context of neural Bayes estimation, in which neural networks are used to construct point estimators, we show that ordered predictors neural networks can yield substantially more precise estimators. We therefore recommend that, when permutation invariance is known or suspected to hold, investigators use a learning or statistical modeling approach that can leverage the invariance, rather than an unstructured deep learning approach.

生物医学研究的许多领域使用具有大量预测因子的数据,所有这些预测因子都代表同一类型的测量。通常,一个重要的总结是这些预测因子的个人内部分布。在这里,我们关注的是结果和预测器之间的平均关系被这个分布完全捕获的设置,更一般地说,我们关注的是目标是学习在输入向量置换下不变的映射的问题。我们比较了没有明确包含排列不变性的非结构化神经网络与我们称之为有序预测神经网络的网络。我们在模拟中表明,与明确利用不变性来简化学习任务的方法相比,非结构化深度学习方法可以产生更高的预测误差。此外,在神经贝叶斯估计的背景下,其中神经网络被用来构造点估计,我们表明有序预测神经网络可以产生更精确的估计。因此,我们建议,当已知或怀疑排列不变性时,研究人员使用可以利用不变性的学习或统计建模方法,而不是非结构化的深度学习方法。
{"title":"Shortcomings of deep learning for distributional predictors: a note.","authors":"Bonnie B Smith, Abhirup Datta, Brian Caffo","doi":"10.1093/biostatistics/kxaf051","DOIUrl":"10.1093/biostatistics/kxaf051","url":null,"abstract":"<p><p>A number of domains in biomedical research use data with a large number of predictors all representing the same type of measurement. Often, an important summary is the within-person distribution of these predictors. Here we focus on settings where the mean relationship between outcome and predictors is fully captured by this distribution and, more generally, on problems where the goal is to learn a mapping that is invariant under permutations of the input vector. We compare unstructured neural networks, which do not explicitly incorporate the permutation invariance property, versus networks that we call ordered predictors neural networks. We show in simulations that the unstructured deep learning approach can yield higher prediction errors, compared to the approach that explicitly leverages the invariance to simplify the learning task. Additionally, in the context of neural Bayes estimation, in which neural networks are used to construct point estimators, we show that ordered predictors neural networks can yield substantially more precise estimators. We therefore recommend that, when permutation invariance is known or suspected to hold, investigators use a learning or statistical modeling approach that can leverage the invariance, rather than an unstructured deep learning approach.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"27 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12815889/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146004720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A two-stage approach for segmenting spatial point patterns applied to multiplex imaging. 一种用于多路成像的空间点模式分割的两阶段方法。
IF 2 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-01-20 DOI: 10.1093/biostatistics/kxaf049
Alvin Sheng, Brian J Reich, Ana-Maria Staicu, Santhoshi N Krishnan, Arvind Rao, Timothy L Frankel

Recent advances in multiplex imaging have enabled researchers to locate different types of cells within a tissue sample. This is especially relevant for tumor immunology, as clinical regimes corresponding to different stages of disease or responses to treatment may manifest as different spatial arrangements of tumor and immune cells. Spatial point pattern modeling can be used to partition multiplex tissue images according to these regimes. To this end, we propose a two-stage approach: first, local intensities and pair correlation functions are estimated from the spatial point pattern of cells within each image, and the pair correlation functions are reduced in dimension via spectral decomposition of the covariance function. Second, the estimates are clustered in a Bayesian hierarchical model with spatially-dependent cluster labels. The clusters correspond to regimes of interest that are present across subjects; the cluster labels segment the spatial point patterns according to those regimes. Through Markov Chain Monte Carlo sampling, we jointly estimate and quantify uncertainty in the cluster assignment and spatial characteristics of each cluster. Simulations demonstrate the performance of the method, and it is applied to a set of multiplex immunofluorescence images of diseased pancreatic tissue.

多重成像技术的最新进展使研究人员能够在组织样本中定位不同类型的细胞。这与肿瘤免疫学尤其相关,因为与疾病不同阶段或对治疗的反应相对应的临床制度可能表现为肿瘤和免疫细胞的不同空间排列。空间点模式建模可以根据这些模式对多重组织图像进行分割。为此,我们提出了一个两阶段的方法:首先,从每个图像内细胞的空间点模式估计局部强度和对相关函数,并通过协方差函数的频谱分解降低对相关函数的维数。其次,用贝叶斯层次模型对估计结果进行聚类,并对聚类标签进行空间依赖。这些集群对应于跨学科存在的兴趣机制;聚类标签根据这些机制对空间点模式进行分割。通过马尔可夫链蒙特卡罗采样,我们共同估计和量化了聚类分配和每个聚类空间特征的不确定性。仿真实验证明了该方法的有效性,并将其应用于一组病变胰腺组织的多重免疫荧光图像。
{"title":"A two-stage approach for segmenting spatial point patterns applied to multiplex imaging.","authors":"Alvin Sheng, Brian J Reich, Ana-Maria Staicu, Santhoshi N Krishnan, Arvind Rao, Timothy L Frankel","doi":"10.1093/biostatistics/kxaf049","DOIUrl":"10.1093/biostatistics/kxaf049","url":null,"abstract":"<p><p>Recent advances in multiplex imaging have enabled researchers to locate different types of cells within a tissue sample. This is especially relevant for tumor immunology, as clinical regimes corresponding to different stages of disease or responses to treatment may manifest as different spatial arrangements of tumor and immune cells. Spatial point pattern modeling can be used to partition multiplex tissue images according to these regimes. To this end, we propose a two-stage approach: first, local intensities and pair correlation functions are estimated from the spatial point pattern of cells within each image, and the pair correlation functions are reduced in dimension via spectral decomposition of the covariance function. Second, the estimates are clustered in a Bayesian hierarchical model with spatially-dependent cluster labels. The clusters correspond to regimes of interest that are present across subjects; the cluster labels segment the spatial point patterns according to those regimes. Through Markov Chain Monte Carlo sampling, we jointly estimate and quantify uncertainty in the cluster assignment and spatial characteristics of each cluster. Simulations demonstrate the performance of the method, and it is applied to a set of multiplex immunofluorescence images of diseased pancreatic tissue.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"27 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12815891/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146004726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Risk functions with outcome measurement error. 带有结果测量误差的风险函数。
IF 2 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-01-20 DOI: 10.1093/biostatistics/kxaf052
Jessie K Edwards, Stephen R Cole, Paul N Zivich, Benjamin Ackerman, Sonia Napravnik, Heather Henderson, Timothy Lash, Bonnie E Shook-Sa

Mortality risk estimated from studies that ascertain date of death through linkage to vital statistics registries may be subject to outcome measurement error. As a result, some deaths among study participants may not be captured, some study participants who are alive may be falsely categorized as deceased, and some deaths may be recorded at incorrect times, leading to bias in estimates of mortality risk and survival. Here, we illustrate an extension of the Rogan-Gladen estimator to account for outcome measurement error in risk and survival functions in settings with right censoring. As a motivating application, we consider and account for outcome measurement error that could be induced by incomplete and/or incorrect linkage to death registries when estimating mortality risk among people entering care for HIV in the University of North Carolina Center for AIDS Research HIV Clinical Cohort between 2001 and 2022. A series of simulation studies demonstrates that the approach performed well even when participants selected into the validation study were at higher mortality risk than the main study. The proposed approach may be parameterized using internal or external validation data or used as a form of quantitative bias analysis.

通过与生命统计登记的联系确定死亡日期的研究估计的死亡风险可能受到结果测量误差的影响。因此,研究参与者中的一些死亡可能没有被记录下来,一些活着的研究参与者可能被错误地归类为死亡,一些死亡可能在不正确的时间被记录下来,导致对死亡风险和生存的估计存在偏差。在这里,我们说明了罗根-格拉登估计器的扩展,以解释风险和生存函数在正确审查设置中的结果测量误差。作为一项激励应用,我们考虑并解释了在估计2001年至2022年间北卡罗来纳大学艾滋病研究中心HIV临床队列中进入HIV护理的患者的死亡率风险时,与死亡登记不完整和/或不正确的联系可能引起的结果测量误差。一系列模拟研究表明,即使被选择进入验证研究的参与者死亡率高于主要研究,该方法也表现良好。建议的方法可以使用内部或外部验证数据参数化,或用作定量偏差分析的一种形式。
{"title":"Risk functions with outcome measurement error.","authors":"Jessie K Edwards, Stephen R Cole, Paul N Zivich, Benjamin Ackerman, Sonia Napravnik, Heather Henderson, Timothy Lash, Bonnie E Shook-Sa","doi":"10.1093/biostatistics/kxaf052","DOIUrl":"10.1093/biostatistics/kxaf052","url":null,"abstract":"<p><p>Mortality risk estimated from studies that ascertain date of death through linkage to vital statistics registries may be subject to outcome measurement error. As a result, some deaths among study participants may not be captured, some study participants who are alive may be falsely categorized as deceased, and some deaths may be recorded at incorrect times, leading to bias in estimates of mortality risk and survival. Here, we illustrate an extension of the Rogan-Gladen estimator to account for outcome measurement error in risk and survival functions in settings with right censoring. As a motivating application, we consider and account for outcome measurement error that could be induced by incomplete and/or incorrect linkage to death registries when estimating mortality risk among people entering care for HIV in the University of North Carolina Center for AIDS Research HIV Clinical Cohort between 2001 and 2022. A series of simulation studies demonstrates that the approach performed well even when participants selected into the validation study were at higher mortality risk than the main study. The proposed approach may be parameterized using internal or external validation data or used as a form of quantitative bias analysis.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"27 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12815892/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146004702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-dimensional inference for functional regression with an application to the Alzheimer's disease magnetoencephalography study. 功能回归的高维推断与阿尔茨海默病脑磁图研究的应用。
IF 2 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-12-23 DOI: 10.1093/biostatistics/kxaf050
Huaqing Jin, Fei Jiang

Alzheimer's disease (AD) is a progressive, chronic neurodegenerative disorder affecting millions worldwide. A new clinical magnetoencephalography (MEG) study was conducted to identify neural activity biomarkers and key brain regions in AD. Traditional methods for analyzing MEG data, which typically extract features from power spectral density, suffer from information loss. Furthermore, functional regression with variable selection tends to produce non-robust results, making it less ideal for drawing reliable scientific conclusions. To address these challenges, we propose a high-dimensional hypothesis testing (HDHT) framework for functional covariates and introduce a rigorous inference process to support scientific conclusions. We establish the theoretical properties of the HDHT framework and validate its performance through simulation studies. Applying the HDHT framework to the AD MEG data, we identify 19 important regions associated with cognitive functions that align with established AD pathophysiology. These findings suggest that the non-invasive MEG can be a potential low-risk and low-toxicity modality for monitoring neurodegenerative progression.

阿尔茨海默病(AD)是一种进行性慢性神经退行性疾病,影响全球数百万人。一项新的临床脑磁图(MEG)研究用于识别AD患者的神经活动生物标志物和关键脑区。传统的脑磁图数据分析方法通常是从功率谱密度中提取特征,存在信息丢失的问题。此外,具有变量选择的函数回归往往产生非鲁棒性结果,使其不太适合得出可靠的科学结论。为了解决这些挑战,我们提出了一个功能协变量的高维假设检验(HDHT)框架,并引入了一个严格的推理过程来支持科学结论。建立了HDHT框架的理论特性,并通过仿真研究验证了其性能。将HDHT框架应用于AD MEG数据,我们确定了19个与认知功能相关的重要区域,这些区域与已建立的AD病理生理学相一致。这些发现表明,无创脑磁图可能是一种潜在的低风险和低毒性监测神经退行性进展的方式。
{"title":"High-dimensional inference for functional regression with an application to the Alzheimer's disease magnetoencephalography study.","authors":"Huaqing Jin, Fei Jiang","doi":"10.1093/biostatistics/kxaf050","DOIUrl":"10.1093/biostatistics/kxaf050","url":null,"abstract":"<p><p>Alzheimer's disease (AD) is a progressive, chronic neurodegenerative disorder affecting millions worldwide. A new clinical magnetoencephalography (MEG) study was conducted to identify neural activity biomarkers and key brain regions in AD. Traditional methods for analyzing MEG data, which typically extract features from power spectral density, suffer from information loss. Furthermore, functional regression with variable selection tends to produce non-robust results, making it less ideal for drawing reliable scientific conclusions. To address these challenges, we propose a high-dimensional hypothesis testing (HDHT) framework for functional covariates and introduce a rigorous inference process to support scientific conclusions. We establish the theoretical properties of the HDHT framework and validate its performance through simulation studies. Applying the HDHT framework to the AD MEG data, we identify 19 important regions associated with cognitive functions that align with established AD pathophysiology. These findings suggest that the non-invasive MEG can be a potential low-risk and low-toxicity modality for monitoring neurodegenerative progression.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12728160/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145822185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Random forest for dynamic risk prediction of recurrent events: a pseudo-observation approach. 随机森林用于周期性事件的动态风险预测:一种伪观测方法。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf007
Abigail Loe, Susan Murray, Zhenke Wu

Recurrent events are common in clinical, healthcare, social, and behavioral studies, yet methods for dynamic risk prediction of these events are limited. To overcome some long-standing challenges in analyzing censored recurrent event data, a recent regression analysis framework constructs a censored longitudinal dataset consisting of times to the first recurrent event in multiple pre-specified follow-up windows of length $ tau $(XMT models). Traditional regression models struggle with nonlinear and multiway interactions, with success depending on the skill of the statistical programmer. With a staggering number of potential predictors being generated from genetic, -omic, and electronic health records sources, machine learning approaches such as the random forest regression are growing in popularity, as they can nonparametrically incorporate information from many predictors with nonlinear and multiway interactions involved in prediction. In this article, we (i) develop a random forest approach for dynamically predicting probabilities of remaining event-free during a subsequent $ tau $-duration follow-up period from a reconstructed censored longitudinal data set, (ii) modify the XMT regression approach to predict these same probabilities, subject to the limitations that traditional regression models typically have, and (iii) demonstrate how to incorporate patient-specific history of recurrent events for prediction in settings where this information may be partially missing. We show the increased ability of our random forest algorithm for predicting the probability of remaining event-free over a $ tau $-duration follow-up window when compared to our modified XMT method for prediction in settings where association between predictors and recurrent event outcomes is complex in nature. We also show the importance of incorporating past recurrent event history in prediction algorithms when event times are correlated within a subject. The proposed random forest algorithm is demonstrated using recurrent exacerbation data from the trial of Azithromycin for the Prevention of Exacerbations of Chronic Obstructive Pulmonary Disease.

复发事件在临床、医疗保健、社会和行为研究中很常见,但这些事件的动态风险预测方法有限。为了克服一些长期存在的问题,最近的回归分析框架构建了一个经过审查的纵向数据集,该数据集由多个预先指定的长度为$ tau $的后续窗口(XMT模型)中的第一个重复事件的时间组成。传统的回归模型与非线性和多方向的相互作用作斗争,其成功取决于统计程序员的技能。随着从遗传、基因组学和电子健康记录来源生成的潜在预测因子数量惊人,随机森林回归等机器学习方法越来越受欢迎,因为它们可以将来自许多预测因子的信息与预测中涉及的非线性和多向交互非参数化地结合起来。在本文中,我们(i)开发了一种随机森林方法,用于从重建的经审查的纵向数据集动态预测后续$ tau $持续时间随访期间剩余无事件的概率,(ii)修改XMT回归方法来预测这些相同的概率,但受传统回归模型通常具有的局限性的限制。(iii)演示如何在可能部分缺少这些信息的情况下,将患者特定的复发事件历史纳入预测。与改进的XMT方法相比,我们的随机森林算法在预测因子和复发事件结果之间的关联本质上是复杂的情况下,预测在$ tau $持续时间的随访窗口内剩余事件无概率的能力有所提高。我们还展示了当事件时间在主题内相关时,在预测算法中纳入过去循环事件历史的重要性。该随机森林算法使用阿奇霉素预防慢性阻塞性肺疾病加重试验的复发性加重数据进行了验证。
{"title":"Random forest for dynamic risk prediction of recurrent events: a pseudo-observation approach.","authors":"Abigail Loe, Susan Murray, Zhenke Wu","doi":"10.1093/biostatistics/kxaf007","DOIUrl":"10.1093/biostatistics/kxaf007","url":null,"abstract":"<p><p>Recurrent events are common in clinical, healthcare, social, and behavioral studies, yet methods for dynamic risk prediction of these events are limited. To overcome some long-standing challenges in analyzing censored recurrent event data, a recent regression analysis framework constructs a censored longitudinal dataset consisting of times to the first recurrent event in multiple pre-specified follow-up windows of length $ tau $(XMT models). Traditional regression models struggle with nonlinear and multiway interactions, with success depending on the skill of the statistical programmer. With a staggering number of potential predictors being generated from genetic, -omic, and electronic health records sources, machine learning approaches such as the random forest regression are growing in popularity, as they can nonparametrically incorporate information from many predictors with nonlinear and multiway interactions involved in prediction. In this article, we (i) develop a random forest approach for dynamically predicting probabilities of remaining event-free during a subsequent $ tau $-duration follow-up period from a reconstructed censored longitudinal data set, (ii) modify the XMT regression approach to predict these same probabilities, subject to the limitations that traditional regression models typically have, and (iii) demonstrate how to incorporate patient-specific history of recurrent events for prediction in settings where this information may be partially missing. We show the increased ability of our random forest algorithm for predicting the probability of remaining event-free over a $ tau $-duration follow-up window when compared to our modified XMT method for prediction in settings where association between predictors and recurrent event outcomes is complex in nature. We also show the importance of incorporating past recurrent event history in prediction algorithms when event times are correlated within a subject. The proposed random forest algorithm is demonstrated using recurrent exacerbation data from the trial of Azithromycin for the Prevention of Exacerbations of Chronic Obstructive Pulmonary Disease.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143626883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A marginal structural model for normal tissue complication probability. 正常组织并发症概率的边际结构模型。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae019
Thai-Son Tang, Zhihui Liu, Ali Hosni, John Kim, Olli Saarela

The goal of radiation therapy for cancer is to deliver prescribed radiation dose to the tumor while minimizing dose to the surrounding healthy tissues. To evaluate treatment plans, the dose distribution to healthy organs is commonly summarized as dose-volume histograms (DVHs). Normal tissue complication probability (NTCP) modeling has centered around making patient-level risk predictions with features extracted from the DVHs, but few have considered adapting a causal framework to evaluate the safety of alternative treatment plans. We propose causal estimands for NTCP based on deterministic and stochastic interventions, as well as propose estimators based on marginal structural models that impose bivariable monotonicity between dose, volume, and toxicity risk. The properties of these estimators are studied through simulations, and their use is illustrated in the context of radiotherapy treatment of anal canal cancer patients.

癌症放射治疗的目标是将规定的放射剂量输送到肿瘤,同时尽量减少对周围健康组织的剂量。为了评估治疗计划,通常将健康器官的剂量分布总结为剂量-体积直方图(DVH)。正常组织并发症概率(NTCP)建模的核心是利用从剂量-体积直方图中提取的特征进行患者层面的风险预测,但很少有人考虑采用因果框架来评估替代治疗方案的安全性。我们提出了基于确定性和随机性干预的 NTCP 因果估计值,并提出了基于边际结构模型的估计值,这些模型在剂量、容量和毒性风险之间施加了双变量单调性。通过模拟研究了这些估计器的特性,并以肛管癌患者的放疗治疗为例说明了它们的应用。
{"title":"A marginal structural model for normal tissue complication probability.","authors":"Thai-Son Tang, Zhihui Liu, Ali Hosni, John Kim, Olli Saarela","doi":"10.1093/biostatistics/kxae019","DOIUrl":"10.1093/biostatistics/kxae019","url":null,"abstract":"<p><p>The goal of radiation therapy for cancer is to deliver prescribed radiation dose to the tumor while minimizing dose to the surrounding healthy tissues. To evaluate treatment plans, the dose distribution to healthy organs is commonly summarized as dose-volume histograms (DVHs). Normal tissue complication probability (NTCP) modeling has centered around making patient-level risk predictions with features extracted from the DVHs, but few have considered adapting a causal framework to evaluate the safety of alternative treatment plans. We propose causal estimands for NTCP based on deterministic and stochastic interventions, as well as propose estimators based on marginal structural models that impose bivariable monotonicity between dose, volume, and toxicity risk. The properties of these estimators are studied through simulations, and their use is illustrated in the context of radiotherapy treatment of anal canal cancer patients.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11823140/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141565187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unveiling Schizophrenia: a study with generalized functional linear mixed model via the investigation of functional random effects. 揭示精神分裂症:基于功能随机效应的广义泛函线性混合模型研究。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae049
Rongxiang Rui, Wei Xiong, Jianxin Pan, Maozai Tian

Previous studies have identified attenuated pre-speech activity and speech sound suppression in individuals with Schizophrenia, with similar patterns observed in basic tasks entailing button-pressing to perceive a tone. However, it remains unclear whether these patterns are uniform across individuals or vary from person to person. Motivated by electroencephalographic (EEG) data from a Schizophrenia study, we develop a generalized functional linear mixed model (GFLMM) for repeated measurements by incorporating subject-specific functional random effects associated with multiple functional predictors. To assess the significance of these functional effects, we employ two different multivariate functional principal component analysis methods, which transform the GFLMM into a conventional generalized linear mixed model, thereby facilitating its implementation with standard software. Furthermore, we introduce a cutting-edge testing approach utilizing working responses to detect both subject-specific and predictor-specific functional random effects. Monte Carlo simulation studies demonstrate the effectiveness of our proposed testing method. Application of the proposed methods to the Schizophrenia data reveals significant subject-specific effects of human brain activity in the frontal zone (Fz) and the central zone (Cz), providing valuable insights into the potential variations among individuals, from healthy controls to those diagnosed with Schizophrenia.

先前的研究已经发现,精神分裂症患者的言语前活动和语音抑制减弱,在需要按下按钮来感知音调的基本任务中也观察到类似的模式。然而,目前尚不清楚这些模式是否在个体之间是一致的,还是因人而异。受一项精神分裂症研究的脑电图(EEG)数据的启发,我们开发了一种广义功能线性混合模型(GFLMM),通过纳入与多个功能预测因子相关的受试者特异性功能随机效应,用于重复测量。为了评估这些功能效应的重要性,我们采用了两种不同的多元功能主成分分析方法,将GFLMM转换为传统的广义线性混合模型,从而便于在标准软件中实现。此外,我们引入了一种尖端的测试方法,利用工作反应来检测受试者特定和预测者特定的功能随机效应。蒙特卡罗仿真研究证明了我们所提出的测试方法的有效性。将所提出的方法应用于精神分裂症数据,揭示了人类大脑额叶区(Fz)和中央区(Cz)活动的显著主体特异性影响,为从健康对照到被诊断为精神分裂症的个体之间的潜在差异提供了有价值的见解。
{"title":"Unveiling Schizophrenia: a study with generalized functional linear mixed model via the investigation of functional random effects.","authors":"Rongxiang Rui, Wei Xiong, Jianxin Pan, Maozai Tian","doi":"10.1093/biostatistics/kxae049","DOIUrl":"https://doi.org/10.1093/biostatistics/kxae049","url":null,"abstract":"<p><p>Previous studies have identified attenuated pre-speech activity and speech sound suppression in individuals with Schizophrenia, with similar patterns observed in basic tasks entailing button-pressing to perceive a tone. However, it remains unclear whether these patterns are uniform across individuals or vary from person to person. Motivated by electroencephalographic (EEG) data from a Schizophrenia study, we develop a generalized functional linear mixed model (GFLMM) for repeated measurements by incorporating subject-specific functional random effects associated with multiple functional predictors. To assess the significance of these functional effects, we employ two different multivariate functional principal component analysis methods, which transform the GFLMM into a conventional generalized linear mixed model, thereby facilitating its implementation with standard software. Furthermore, we introduce a cutting-edge testing approach utilizing working responses to detect both subject-specific and predictor-specific functional random effects. Monte Carlo simulation studies demonstrate the effectiveness of our proposed testing method. Application of the proposed methods to the Schizophrenia data reveals significant subject-specific effects of human brain activity in the frontal zone (Fz) and the central zone (Cz), providing valuable insights into the potential variations among individuals, from healthy controls to those diagnosed with Schizophrenia.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142933522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sensitivity analysis for the probability of benefit in randomized controlled trials with a binary treatment and a binary outcome. 采用二元治疗和二元结局的随机对照试验中获益概率的敏感性分析。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf011
Iuliana Ciocănea-Teodorescu, Erin E Gabriel, Arvid Sjölander

For a comprehensive understanding of the effect of a given treatment on an outcome of interest, quantification of individual treatment heterogeneity is essential, alongside estimation of the average causal effect. However, even in randomized controlled trials, quantities such as the probability of benefit or the probability of harm are not identifiable, since multiple potential outcomes cannot be observed simultaneously for the same individual. We propose a sensitivity analysis for the probability of benefit in randomized controlled trial settings with a binary treatment and a binary outcome, by quantifying the deviation from conditional independence of the two potential outcomes, given a set of measured prognostic baseline covariates. We do this using a marginal sensitivity analysis parameter that does not depend on the number or complexity of the measured covariates. We provide a guide to estimation and interpretation, and illustrate our method in simulations, as well as using a real data example from a randomized controlled trial studying the effect of umbilical vein oxytocin administration on the need for manual removal of the placenta during birth.

为了全面了解给定治疗对目标结果的影响,除了估计平均因果效应外,还必须对个体治疗异质性进行量化。然而,即使在随机对照试验中,也无法确定诸如获益概率或伤害概率之类的数量,因为无法同时观察到同一个体的多种潜在结果。我们提出在随机对照试验设置中采用二元治疗和二元结果的获益概率的敏感性分析,通过量化两种潜在结果的条件独立性偏差,给定一组测量的预后基线协变量。我们使用不依赖于测量协变量的数量或复杂性的边际灵敏度分析参数来做到这一点。我们提供了一个估计和解释的指南,并在模拟中说明了我们的方法,并使用了一个随机对照试验的真实数据示例,研究了脐静脉催产素对分娩时人工移除胎盘需求的影响。
{"title":"Sensitivity analysis for the probability of benefit in randomized controlled trials with a binary treatment and a binary outcome.","authors":"Iuliana Ciocănea-Teodorescu, Erin E Gabriel, Arvid Sjölander","doi":"10.1093/biostatistics/kxaf011","DOIUrl":"10.1093/biostatistics/kxaf011","url":null,"abstract":"<p><p>For a comprehensive understanding of the effect of a given treatment on an outcome of interest, quantification of individual treatment heterogeneity is essential, alongside estimation of the average causal effect. However, even in randomized controlled trials, quantities such as the probability of benefit or the probability of harm are not identifiable, since multiple potential outcomes cannot be observed simultaneously for the same individual. We propose a sensitivity analysis for the probability of benefit in randomized controlled trial settings with a binary treatment and a binary outcome, by quantifying the deviation from conditional independence of the two potential outcomes, given a set of measured prognostic baseline covariates. We do this using a marginal sensitivity analysis parameter that does not depend on the number or complexity of the measured covariates. We provide a guide to estimation and interpretation, and illustrate our method in simulations, as well as using a real data example from a randomized controlled trial studying the effect of umbilical vein oxytocin administration on the need for manual removal of the placenta during birth.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12129078/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144210358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimation and inference for causal spillover effects in egocentric-network randomized trials in the presence of network membership misclassification. 存在网络成员错误分类的自我中心网络随机试验中因果溢出效应的估计与推断。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf009
Ariel Chao, Donna Spiegelman, Ashley Buchanan, Laura Forastiere

To leverage peer influence and increase population behavioral changes, behavioral interventions often rely on peer-based strategies. A common study design that assesses such strategies is the egocentric-network randomized trial (ENRT), where index participants receive a behavioral training and are encouraged to disseminate information to their peers. Under this design, a crucial estimand of interest is the Average Spillover Effect (ASpE), which measures the impact of the intervention on participants who do not receive it, but whose outcomes may be affected by others who do. The assessment of the ASpE relies on assumptions about, and correct measurement of, interference sets within which individuals may influence one another's outcomes. It can be challenging to properly specify interference sets, such as networks in ENRTs, and when mismeasured, intervention effects estimated by existing methods will be biased. In studies where social networks play an important role in disease transmission or behavior change, correcting ASpE estimates for bias due to network misclassification is critical for accurately evaluating the full impact of interventions. We combined measurement error and causal inference methods to bias-correct the ASpE estimate for network misclassification in ENRTs, when surrogate networks are recorded in place of true ones, and validation data that relate the misclassified to the true networks are available. We investigated finite sample properties of our methods in an extensive simulation study and illustrated our methods in the HIV Prevention Trials Network (HPTN) 037 study.

为了利用同伴影响和增加人口行为改变,行为干预往往依赖于基于同伴的策略。评估这些策略的一种常见的研究设计是自我中心网络随机试验(ENRT),在该试验中,指数参与者接受行为训练,并鼓励他们向同伴传播信息。在这种设计下,一个重要的估计是平均溢出效应(ASpE),它衡量干预对没有接受干预的参与者的影响,但其结果可能受到其他参与者的影响。ASpE的评估依赖于对干扰集的假设和对干扰集的正确测量,在这些干扰集中,个体可能会影响彼此的结果。适当地指定干扰集(例如ENRTs中的网络)可能具有挑战性,并且当测量错误时,用现有方法估计的干预效果将存在偏差。在社会网络在疾病传播或行为改变中发挥重要作用的研究中,纠正由于网络错误分类而导致的ASpE估计偏差对于准确评估干预措施的全部影响至关重要。我们将测量误差和因果推理方法结合起来,对ENRTs中网络错误分类的ASpE估计进行偏差校正,当记录替代网络代替真实网络时,并且可以获得将错误分类与真实网络联系起来的验证数据。我们在广泛的模拟研究中研究了我们方法的有限样本特性,并在HIV预防试验网络(HPTN) 037研究中说明了我们的方法。
{"title":"Estimation and inference for causal spillover effects in egocentric-network randomized trials in the presence of network membership misclassification.","authors":"Ariel Chao, Donna Spiegelman, Ashley Buchanan, Laura Forastiere","doi":"10.1093/biostatistics/kxaf009","DOIUrl":"10.1093/biostatistics/kxaf009","url":null,"abstract":"<p><p>To leverage peer influence and increase population behavioral changes, behavioral interventions often rely on peer-based strategies. A common study design that assesses such strategies is the egocentric-network randomized trial (ENRT), where index participants receive a behavioral training and are encouraged to disseminate information to their peers. Under this design, a crucial estimand of interest is the Average Spillover Effect (ASpE), which measures the impact of the intervention on participants who do not receive it, but whose outcomes may be affected by others who do. The assessment of the ASpE relies on assumptions about, and correct measurement of, interference sets within which individuals may influence one another's outcomes. It can be challenging to properly specify interference sets, such as networks in ENRTs, and when mismeasured, intervention effects estimated by existing methods will be biased. In studies where social networks play an important role in disease transmission or behavior change, correcting ASpE estimates for bias due to network misclassification is critical for accurately evaluating the full impact of interventions. We combined measurement error and causal inference methods to bias-correct the ASpE estimate for network misclassification in ENRTs, when surrogate networks are recorded in place of true ones, and validation data that relate the misclassified to the true networks are available. We investigated finite sample properties of our methods in an extensive simulation study and illustrated our methods in the HIV Prevention Trials Network (HPTN) 037 study.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11955068/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143755648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decomposition of longitudinal disparities: an application to the fetal growth-singletons study. 纵向差异分解:在胎儿生长-单胎研究中的应用。
IF 2 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf044
Sang Kyu Lee, Seonjin Kim, Mi-Ok Kim, Katherine L Grantz, Hyokyoung G Hong

Addressing health disparities across demographic groups remains a critical challenge in public health, with significant gaps in understanding how these disparities evolve over time. This paper extends the traditional Peters-Belson decomposition to a longitudinal setting, focusing on the role of a single explanatory variable, referred to as a modifier, that captures complex interactions with other covariates. The proposed method partitions disparities into 3 components: (i) the portion associated with differences in the conditional distribution of covariates, evaluated under a common distribution of the modifier across groups; (ii) the portion arising from differences in the distribution of the modifier and its interactions with other covariates; and (iii) the unexplained disparity not accounted for by observed covariates. Rather than aggregating the first 2 components into one "explained disparity," the proposed method allows for a separate characterization of temporal patterns in disparities, distinguishing those that are unassociated with the modifier from those that are associated with it. We illustrate the method using a fetal growth study, examining disparities in fetal development trajectories across racial and ethnic groups during pregnancy.

解决人口群体之间的健康差异仍然是公共卫生领域的一项重大挑战,在了解这些差异如何随时间演变方面存在重大差距。本文将传统的彼得斯-贝尔森分解扩展到纵向设置,重点关注单个解释变量的作用,称为修饰符,它捕获了与其他协变量的复杂相互作用。该方法将差异划分为3个部分:(i)与协变量条件分布差异相关的部分,在组间修饰符的共同分布下进行评估;(ii)修饰语分布的差异及其与其他协变量的相互作用所产生的部分;(iii)未被观测到的协变量解释的无法解释的差异。与其将前两个成分聚合成一个“可解释的差异”,建议的方法允许对差异中的时间模式进行单独的表征,区分那些与修饰语无关的和那些与之相关的。我们用胎儿生长研究来说明这种方法,研究了怀孕期间不同种族和民族群体胎儿发育轨迹的差异。
{"title":"Decomposition of longitudinal disparities: an application to the fetal growth-singletons study.","authors":"Sang Kyu Lee, Seonjin Kim, Mi-Ok Kim, Katherine L Grantz, Hyokyoung G Hong","doi":"10.1093/biostatistics/kxaf044","DOIUrl":"10.1093/biostatistics/kxaf044","url":null,"abstract":"<p><p>Addressing health disparities across demographic groups remains a critical challenge in public health, with significant gaps in understanding how these disparities evolve over time. This paper extends the traditional Peters-Belson decomposition to a longitudinal setting, focusing on the role of a single explanatory variable, referred to as a modifier, that captures complex interactions with other covariates. The proposed method partitions disparities into 3 components: (i) the portion associated with differences in the conditional distribution of covariates, evaluated under a common distribution of the modifier across groups; (ii) the portion arising from differences in the distribution of the modifier and its interactions with other covariates; and (iii) the unexplained disparity not accounted for by observed covariates. Rather than aggregating the first 2 components into one \"explained disparity,\" the proposed method allows for a separate characterization of temporal patterns in disparities, distinguishing those that are unassociated with the modifier from those that are associated with it. We illustrate the method using a fetal growth study, examining disparities in fetal development trajectories across racial and ethnic groups during pregnancy.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12701353/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145744404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biostatistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1