首页 > 最新文献

Biostatistics最新文献

英文 中文
Determining vaccine responders in the presence of baseline immunity using single-cell assays and paired control samples. 在基线免疫存在的情况下,使用单细胞试验和配对对照样本确定疫苗应答者。
IF 2 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf045
Zhe Chen, Siyu Heng, Asa Tapley, Stephen De Rosa, Bo Zhang

A key objective in vaccine studies is to evaluate vaccine-induced immunogenicity and determine whether participants have mounted a response to the vaccine. Cellular immune responses are essential for assessing vaccine-induced immunogenicity, and single-cell assays, such as intracellular cytokine staining (ICS) and B-cell phenotyping (BCP), are commonly employed to profile individual immune cell phenotypes and the cytokines they produce after stimulation. In this article, we introduce a novel statistical framework for identifying vaccine responders using ICS data collected before and after vaccination. This framework incorporates paired control data to account for potential unintended variations between assay runs, such as batch effects, that could lead to misclassification of participants as vaccine responders or non-responders. To formally integrate paired control data for accounting for assay variation across different time points (ie before and after vaccination), our proposed framework calculates and reports two $ P $-values, both adjusting for paired control data but in distinct ways: (i) the maximally adjusted $ P $-value, which applies the most conservative adjustment to the unadjusted $ P $-value, ensuring validity over all plausible batch effects consistent with the paired control samples' data, and (ii) the minimally adjusted $ P $-value, which imposes only the minimal adjustment to the unadjusted $ P $-value, such that the adjusted $ P $-value cannot be falsified by the paired control samples' data. Minimally and maximally adjusted $ P $-values offer a balanced approach to managing Type I error rates and statistical power in the presence of batch effects. We apply this framework to analyze ICS data collected at baseline and 4 wks post-vaccination from the COVID-19 Prevention Network (CoVPN) 3008 study. Our analysis helps address two clinical questions: (i) which participants exhibited evidence of an incident Omicron infection between baseline and 4 wks after receiving the final dose of the primary vaccination series, and (ii) which participants showed vaccine-induced T cell responses against the Omicron BA.4/5 Spike protein.

疫苗研究的一个关键目标是评估疫苗诱导的免疫原性,并确定参与者是否对疫苗产生了反应。细胞免疫应答对于评估疫苗诱导的免疫原性至关重要,单细胞试验,如细胞内细胞因子染色(ICS)和b细胞表型(BCP),通常用于分析个体免疫细胞表型及其在刺激后产生的细胞因子。在本文中,我们介绍了一种新的统计框架,用于使用接种前后收集的ICS数据来识别疫苗应答者。该框架纳入了成对对照数据,以解释分析运行之间潜在的意外变化,例如批量效应,这可能导致将参与者错误分类为疫苗应答者或无应答者。为了正式整合成对对照数据,以解释不同时间点(即接种疫苗之前和之后)的检测变化,我们提出的框架计算并报告两个P值,它们都对成对对照数据进行了调整,但方式不同:(i)最大调整的$ P $值,它对未调整的$ P $值应用最保守的调整,确保与成对对照样本数据一致的所有似是而非的批效应的有效性;(ii)最小调整的$ P $值,它只对未调整的$ P $值施加最小的调整,这样调整后的$ P $值就不会被成对对照样本的数据伪造。最小和最大调整的$ P $值提供了一种平衡的方法来管理第一类错误率和存在批处理效应的统计能力。我们应用这一框架分析了COVID-19预防网络(CoVPN) 3008研究在基线和接种疫苗后4周收集的ICS数据。我们的分析有助于解决两个临床问题:(i)哪些参与者在接受一次疫苗系列的最后剂量后的基线和4周之间表现出意外的Omicron感染的证据,以及(ii)哪些参与者表现出疫苗诱导的针对Omicron BA.4/5刺突蛋白的T细胞反应。
{"title":"Determining vaccine responders in the presence of baseline immunity using single-cell assays and paired control samples.","authors":"Zhe Chen, Siyu Heng, Asa Tapley, Stephen De Rosa, Bo Zhang","doi":"10.1093/biostatistics/kxaf045","DOIUrl":"https://doi.org/10.1093/biostatistics/kxaf045","url":null,"abstract":"<p><p>A key objective in vaccine studies is to evaluate vaccine-induced immunogenicity and determine whether participants have mounted a response to the vaccine. Cellular immune responses are essential for assessing vaccine-induced immunogenicity, and single-cell assays, such as intracellular cytokine staining (ICS) and B-cell phenotyping (BCP), are commonly employed to profile individual immune cell phenotypes and the cytokines they produce after stimulation. In this article, we introduce a novel statistical framework for identifying vaccine responders using ICS data collected before and after vaccination. This framework incorporates paired control data to account for potential unintended variations between assay runs, such as batch effects, that could lead to misclassification of participants as vaccine responders or non-responders. To formally integrate paired control data for accounting for assay variation across different time points (ie before and after vaccination), our proposed framework calculates and reports two $ P $-values, both adjusting for paired control data but in distinct ways: (i) the maximally adjusted $ P $-value, which applies the most conservative adjustment to the unadjusted $ P $-value, ensuring validity over all plausible batch effects consistent with the paired control samples' data, and (ii) the minimally adjusted $ P $-value, which imposes only the minimal adjustment to the unadjusted $ P $-value, such that the adjusted $ P $-value cannot be falsified by the paired control samples' data. Minimally and maximally adjusted $ P $-values offer a balanced approach to managing Type I error rates and statistical power in the presence of batch effects. We apply this framework to analyze ICS data collected at baseline and 4 wks post-vaccination from the COVID-19 Prevention Network (CoVPN) 3008 study. Our analysis helps address two clinical questions: (i) which participants exhibited evidence of an incident Omicron infection between baseline and 4 wks after receiving the final dose of the primary vaccination series, and (ii) which participants showed vaccine-induced T cell responses against the Omicron BA.4/5 Spike protein.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145643009","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decomposition of longitudinal disparities: an application to the fetal growth-singletons study. 纵向差异分解:在胎儿生长-单胎研究中的应用。
IF 2 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf044
Sang Kyu Lee, Seonjin Kim, Mi-Ok Kim, Katherine L Grantz, Hyokyoung G Hong

Addressing health disparities across demographic groups remains a critical challenge in public health, with significant gaps in understanding how these disparities evolve over time. This paper extends the traditional Peters-Belson decomposition to a longitudinal setting, focusing on the role of a single explanatory variable, referred to as a modifier, that captures complex interactions with other covariates. The proposed method partitions disparities into 3 components: (i) the portion associated with differences in the conditional distribution of covariates, evaluated under a common distribution of the modifier across groups; (ii) the portion arising from differences in the distribution of the modifier and its interactions with other covariates; and (iii) the unexplained disparity not accounted for by observed covariates. Rather than aggregating the first 2 components into one "explained disparity," the proposed method allows for a separate characterization of temporal patterns in disparities, distinguishing those that are unassociated with the modifier from those that are associated with it. We illustrate the method using a fetal growth study, examining disparities in fetal development trajectories across racial and ethnic groups during pregnancy.

解决人口群体之间的健康差异仍然是公共卫生领域的一项重大挑战,在了解这些差异如何随时间演变方面存在重大差距。本文将传统的彼得斯-贝尔森分解扩展到纵向设置,重点关注单个解释变量的作用,称为修饰符,它捕获了与其他协变量的复杂相互作用。该方法将差异划分为3个部分:(i)与协变量条件分布差异相关的部分,在组间修饰符的共同分布下进行评估;(ii)修饰语分布的差异及其与其他协变量的相互作用所产生的部分;(iii)未被观测到的协变量解释的无法解释的差异。与其将前两个成分聚合成一个“可解释的差异”,建议的方法允许对差异中的时间模式进行单独的表征,区分那些与修饰语无关的和那些与之相关的。我们用胎儿生长研究来说明这种方法,研究了怀孕期间不同种族和民族群体胎儿发育轨迹的差异。
{"title":"Decomposition of longitudinal disparities: an application to the fetal growth-singletons study.","authors":"Sang Kyu Lee, Seonjin Kim, Mi-Ok Kim, Katherine L Grantz, Hyokyoung G Hong","doi":"10.1093/biostatistics/kxaf044","DOIUrl":"10.1093/biostatistics/kxaf044","url":null,"abstract":"<p><p>Addressing health disparities across demographic groups remains a critical challenge in public health, with significant gaps in understanding how these disparities evolve over time. This paper extends the traditional Peters-Belson decomposition to a longitudinal setting, focusing on the role of a single explanatory variable, referred to as a modifier, that captures complex interactions with other covariates. The proposed method partitions disparities into 3 components: (i) the portion associated with differences in the conditional distribution of covariates, evaluated under a common distribution of the modifier across groups; (ii) the portion arising from differences in the distribution of the modifier and its interactions with other covariates; and (iii) the unexplained disparity not accounted for by observed covariates. Rather than aggregating the first 2 components into one \"explained disparity,\" the proposed method allows for a separate characterization of temporal patterns in disparities, distinguishing those that are unassociated with the modifier from those that are associated with it. We illustrate the method using a fetal growth study, examining disparities in fetal development trajectories across racial and ethnic groups during pregnancy.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12701353/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145744404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Understanding the opioid syndemic in North Carolina: A novel approach to modeling and identifying factors. 了解北卡罗莱纳州的阿片类药物综合征:一种建模和识别因素的新方法。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae052
Eva Murphy, David Kline, Kathleen L Egan, Kathryn E Lancaster, William C Miller, Lance A Waller, Staci A Hepler

The opioid epidemic is a significant public health challenge in North Carolina, but limited data restrict our understanding of its complexity. Examining trends and relationships among different outcomes believed to reflect opioid misuse provides an alternative perspective to understand the opioid epidemic. We use a Bayesian dynamic spatial factor model to capture the interrelated dynamics within six different county-level outcomes, such as illicit opioid overdose deaths, emergency department visits related to drug overdose, treatment counts for opioid use disorder, patients receiving prescriptions for buprenorphine, and newly diagnosed cases of acute and chronic hepatitis C virus and human immunodeficiency virus. We design the factor model to yield meaningful interactions among predefined subsets of these outcomes, causing a departure from the conventional lower triangular structure in the loadings matrix and leading to familiar identifiability issues. To address this challenge, we propose a novel approach that involves decomposing the loadings matrix within a Markov chain Monte Carlo algorithm, allowing us to estimate the loadings and factors uniquely. As a result, we gain a better understanding of the spatio-temporal dynamics of the opioid epidemic in North Carolina.

阿片类药物流行是北卡罗来纳州重大的公共卫生挑战,但有限的数据限制了我们对其复杂性的理解。研究被认为反映阿片类药物滥用的不同结果之间的趋势和关系,为了解阿片类药物流行提供了另一种视角。我们使用贝叶斯动态空间因子模型来捕捉六个不同县级结果的相关动态,例如非法阿片类药物过量死亡,与药物过量相关的急诊就诊,阿片类药物使用障碍的治疗计数,接受丁丙诺啡处方的患者,以及新诊断的急性和慢性丙型肝炎病毒和人类免疫缺陷病毒病例。我们设计了因子模型,以在这些结果的预定义子集之间产生有意义的相互作用,从而导致负载矩阵中传统的下三角形结构的偏离,并导致熟悉的可识别性问题。为了解决这一挑战,我们提出了一种新的方法,该方法涉及在马尔可夫链蒙特卡罗算法中分解负载矩阵,使我们能够唯一地估计负载和因素。因此,我们对北卡罗来纳州阿片类药物流行的时空动态有了更好的了解。
{"title":"Understanding the opioid syndemic in North Carolina: A novel approach to modeling and identifying factors.","authors":"Eva Murphy, David Kline, Kathleen L Egan, Kathryn E Lancaster, William C Miller, Lance A Waller, Staci A Hepler","doi":"10.1093/biostatistics/kxae052","DOIUrl":"10.1093/biostatistics/kxae052","url":null,"abstract":"<p><p>The opioid epidemic is a significant public health challenge in North Carolina, but limited data restrict our understanding of its complexity. Examining trends and relationships among different outcomes believed to reflect opioid misuse provides an alternative perspective to understand the opioid epidemic. We use a Bayesian dynamic spatial factor model to capture the interrelated dynamics within six different county-level outcomes, such as illicit opioid overdose deaths, emergency department visits related to drug overdose, treatment counts for opioid use disorder, patients receiving prescriptions for buprenorphine, and newly diagnosed cases of acute and chronic hepatitis C virus and human immunodeficiency virus. We design the factor model to yield meaningful interactions among predefined subsets of these outcomes, causing a departure from the conventional lower triangular structure in the loadings matrix and leading to familiar identifiability issues. To address this challenge, we propose a novel approach that involves decomposing the loadings matrix within a Markov chain Monte Carlo algorithm, allowing us to estimate the loadings and factors uniquely. As a result, we gain a better understanding of the spatio-temporal dynamics of the opioid epidemic in North Carolina.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11823283/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143048855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mediation analysis with graph mediator. 使用图中介的中介分析。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf004
Yixi Xu, Yi Zhao

This study introduces a mediation analysis framework when the mediator is a graph. A Gaussian covariance graph model is assumed for graph presentation. Causal estimands and assumptions are discussed under this presentation. With a covariance matrix as the mediator, a low-rank representation is introduced and parametric mediation models are considered under the structural equation modeling framework. Assuming Gaussian random errors, likelihood-based estimators are introduced to simultaneously identify the low-rank representation and causal parameters. An efficient computational algorithm is proposed and asymptotic properties of the estimators are investigated. Via simulation studies, the performance of the proposed approach is evaluated. Applying to a resting-state fMRI study, a brain network is identified within which functional connectivity mediates the sex difference in the performance of a motor task.

本研究引入了一个以图为中介的中介分析框架。图的表示采用高斯协方差图模型。本报告将讨论因果估计和假设。以协方差矩阵为中介,引入低秩表示,在结构方程建模框架下考虑参数化中介模型。在假设高斯随机误差的情况下,引入基于似然的估计器来同时识别低秩表示和因果参数。提出了一种有效的计算算法,并研究了估计量的渐近性质。通过仿真研究,对该方法的性能进行了评价。应用静息状态fMRI研究,确定了一个大脑网络,其中功能连接介导了运动任务表现的性别差异。
{"title":"Mediation analysis with graph mediator.","authors":"Yixi Xu, Yi Zhao","doi":"10.1093/biostatistics/kxaf004","DOIUrl":"10.1093/biostatistics/kxaf004","url":null,"abstract":"<p><p>This study introduces a mediation analysis framework when the mediator is a graph. A Gaussian covariance graph model is assumed for graph presentation. Causal estimands and assumptions are discussed under this presentation. With a covariance matrix as the mediator, a low-rank representation is introduced and parametric mediation models are considered under the structural equation modeling framework. Assuming Gaussian random errors, likelihood-based estimators are introduced to simultaneously identify the low-rank representation and causal parameters. An efficient computational algorithm is proposed and asymptotic properties of the estimators are investigated. Via simulation studies, the performance of the proposed approach is evaluated. Applying to a resting-state fMRI study, a brain network is identified within which functional connectivity mediates the sex difference in the performance of a motor task.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11979487/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143626882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Markov switching zero-inflated space-time multinomial models for comparing multiple infectious diseases. 比较多种传染病的马尔可夫切换零膨胀时空多项模型。
IF 2 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf034
Dirk Douwes-Schultz, Alexandra M Schmidt, Laís Picinini Freitas, Marilia Sá Carvalho

Univariate zero-inflated models are increasingly being used to account for excess zeros in spatio-temporal infectious disease counts. However, the multivariate case is challenging due to the need to account for correlations across space, time and disease in both the count and zero-inflated components of the model. We are interested in comparing the transmission dynamics of several co-circulating infectious diseases across space and time, where some of the diseases can be absent for long periods. We first assume there is a baseline disease that is well-established and always present in the region. The other diseases switch between periods of presence and absence in each area through a series of coupled Markov chains, which account for long periods of disease absence, disease interactions and disease spread from neighboring areas. Since we are mainly interested in comparing the diseases, we assume the cases of the present diseases in an area jointly follow an autoregressive multinomial model. We use the multinomial model to investigate whether there are associations between certain factors, such as temperature, and differences in the transmission intensity of the diseases. Inference is performed using efficient Bayesian Markov chain Monte Carlo methods based on jointly sampling all unknown presence indicators. We apply the model to spatio-temporal counts of dengue, Zika, and chikungunya cases in Rio de Janeiro, during the first triple epidemic there.

单变量零膨胀模型越来越多地被用于解释时空传染病计数中的超额零。然而,多变量情况具有挑战性,因为需要在模型的计数和零膨胀成分中考虑到空间、时间和疾病之间的相关性。我们感兴趣的是比较几种共循环传染病在空间和时间上的传播动力学,其中一些疾病可以长时间不存在。我们首先假设存在一种基线疾病,该疾病在该地区得到确认并一直存在。其他疾病通过一系列耦合的马尔可夫链在每个地区存在和不存在的时期之间切换,这解释了长时间的疾病缺失,疾病相互作用和疾病从邻近地区传播。由于我们主要对疾病的比较感兴趣,我们假设一个地区的现有疾病病例共同遵循自回归多项式模型。我们使用多项模型来研究某些因素(如温度)与疾病传播强度的差异之间是否存在关联。基于联合采样所有未知存在指标,使用有效的贝叶斯马尔可夫链蒙特卡罗方法进行推理。我们将该模型应用于巴西里约热内卢首次三重流行期间登革热、寨卡和基孔肯雅病例的时空计数。
{"title":"Markov switching zero-inflated space-time multinomial models for comparing multiple infectious diseases.","authors":"Dirk Douwes-Schultz, Alexandra M Schmidt, Laís Picinini Freitas, Marilia Sá Carvalho","doi":"10.1093/biostatistics/kxaf034","DOIUrl":"10.1093/biostatistics/kxaf034","url":null,"abstract":"<p><p>Univariate zero-inflated models are increasingly being used to account for excess zeros in spatio-temporal infectious disease counts. However, the multivariate case is challenging due to the need to account for correlations across space, time and disease in both the count and zero-inflated components of the model. We are interested in comparing the transmission dynamics of several co-circulating infectious diseases across space and time, where some of the diseases can be absent for long periods. We first assume there is a baseline disease that is well-established and always present in the region. The other diseases switch between periods of presence and absence in each area through a series of coupled Markov chains, which account for long periods of disease absence, disease interactions and disease spread from neighboring areas. Since we are mainly interested in comparing the diseases, we assume the cases of the present diseases in an area jointly follow an autoregressive multinomial model. We use the multinomial model to investigate whether there are associations between certain factors, such as temperature, and differences in the transmission intensity of the diseases. Inference is performed using efficient Bayesian Markov chain Monte Carlo methods based on jointly sampling all unknown presence indicators. We apply the model to spatio-temporal counts of dengue, Zika, and chikungunya cases in Rio de Janeiro, during the first triple epidemic there.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12596980/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145483585","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Wastewater-based reproduction rates for epidemic curve reconstruction. 流行病曲线重建中基于废水的繁殖率。
IF 2 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf033
Emily Somerset, Justin J Slater, Patrick E Brown

We introduce a hierarchical Bayesian framework for reconstructing epidemic curves using under-reported case counts and wastewater data. Our approach models wastewater signals as differentiable Gaussian processes, enabling inference on their relative growth rates, which are used to define a wastewater-based reproduction rate. These estimates are incorporated into a binomially thinned Poisson autoregressive model for case counts using a modular inference strategy. We apply this framework to reconstruct the Covid-19 epidemic curve in Toronto, validating our model through out-of-sample forecasts and comparisons with independent serosurvey-based cumulative incidence estimates. We also apply the framework to New Zealand's Covid-19 data to reconstruct its epidemic curve and demonstrate improvements over an existing joint model for wastewater and case data. A key advantage of our framework, highlighted in this comparison, is that it does not rely on pre-specified constant parameters, allowing the model to better adapt to evolving pandemic conditions.

我们引入了一个层次贝叶斯框架,用于利用未报告的病例数和废水数据重建流行病曲线。我们的方法将废水信号建模为可微的高斯过程,从而可以推断其相对增长率,从而用于定义基于废水的繁殖率。这些估计被纳入一个二项稀释泊松自回归模型的情况下计数使用模块化推理策略。我们将该框架应用于重建多伦多的Covid-19流行曲线,通过样本外预测和与基于独立血清调查的累积发病率估计的比较来验证我们的模型。我们还将该框架应用于新西兰的Covid-19数据,以重建其流行曲线,并展示对现有废水和病例数据联合模型的改进。这一比较突出表明,我们的框架的一个关键优势是,它不依赖于预先指定的恒定参数,从而使模型能够更好地适应不断变化的大流行情况。
{"title":"Wastewater-based reproduction rates for epidemic curve reconstruction.","authors":"Emily Somerset, Justin J Slater, Patrick E Brown","doi":"10.1093/biostatistics/kxaf033","DOIUrl":"10.1093/biostatistics/kxaf033","url":null,"abstract":"<p><p>We introduce a hierarchical Bayesian framework for reconstructing epidemic curves using under-reported case counts and wastewater data. Our approach models wastewater signals as differentiable Gaussian processes, enabling inference on their relative growth rates, which are used to define a wastewater-based reproduction rate. These estimates are incorporated into a binomially thinned Poisson autoregressive model for case counts using a modular inference strategy. We apply this framework to reconstruct the Covid-19 epidemic curve in Toronto, validating our model through out-of-sample forecasts and comparisons with independent serosurvey-based cumulative incidence estimates. We also apply the framework to New Zealand's Covid-19 data to reconstruct its epidemic curve and demonstrate improvements over an existing joint model for wastewater and case data. A key advantage of our framework, highlighted in this comparison, is that it does not rely on pre-specified constant parameters, allowing the model to better adapt to evolving pandemic conditions.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12533577/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145314108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Functional quantile principal component analysis. 功能量化主成分分析
IF 2 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae040
Álvaro Méndez-Civieta, Ying Wei, Keith M Diaz, Jeff Goldsmith

This paper introduces functional quantile principal component analysis (FQPCA), a dimensionality reduction technique that extends the concept of functional principal components analysis (FPCA) to the examination of participant-specific quantiles curves. Our approach borrows strength across participants to estimate patterns in quantiles, and uses participant-level data to estimate loadings on those patterns. As a result, FQPCA is able to capture shifts in the scale and distribution of data that affect participant-level quantile curves, and is also a robust methodology suitable for dealing with outliers, heteroscedastic data or skewed data. The need for such methodology is exemplified by physical activity data collected using wearable devices. Participants often differ in the timing and intensity of physical activity behaviors, and capturing information beyond the participant-level expected value curves produced by FPCA is necessary for a robust quantification of diurnal patterns of activity. We illustrate our methods using accelerometer data from the National Health and Nutrition Examination Survey, and produce participant-level 10%, 50%, and 90% quantile curves over 24 h of activity. The proposed methodology is supported by simulation results, and is available as an R package.

本文介绍了功能量化主成分分析(FQPCA),这是一种降维技术,它将功能主成分分析(FPCA)的概念扩展到了对特定参与者量化曲线的研究。我们的方法借用不同参与者的力量来估计量化曲线的模式,并使用参与者层面的数据来估计这些模式的载荷。因此,FQPCA 能够捕捉到数据规模和分布中影响参与者水平量化曲线的变化,也是一种适用于处理异常值、异方差数据或倾斜数据的稳健方法。使用可穿戴设备收集的身体活动数据就说明了对这种方法的需求。参与者的体力活动行为在时间和强度上往往各不相同,要想对昼夜活动模式进行稳健的量化,就必须捕捉 FPCA 生成的参与者级预期值曲线以外的信息。我们使用美国国家健康与营养调查的加速度计数据来说明我们的方法,并生成了参与者水平的 10%、50% 和 90% 的 24 小时活动量定量曲线。我们提出的方法得到了模拟结果的支持,并以 R 软件包的形式提供。
{"title":"Functional quantile principal component analysis.","authors":"Álvaro Méndez-Civieta, Ying Wei, Keith M Diaz, Jeff Goldsmith","doi":"10.1093/biostatistics/kxae040","DOIUrl":"10.1093/biostatistics/kxae040","url":null,"abstract":"<p><p>This paper introduces functional quantile principal component analysis (FQPCA), a dimensionality reduction technique that extends the concept of functional principal components analysis (FPCA) to the examination of participant-specific quantiles curves. Our approach borrows strength across participants to estimate patterns in quantiles, and uses participant-level data to estimate loadings on those patterns. As a result, FQPCA is able to capture shifts in the scale and distribution of data that affect participant-level quantile curves, and is also a robust methodology suitable for dealing with outliers, heteroscedastic data or skewed data. The need for such methodology is exemplified by physical activity data collected using wearable devices. Participants often differ in the timing and intensity of physical activity behaviors, and capturing information beyond the participant-level expected value curves produced by FPCA is necessary for a robust quantification of diurnal patterns of activity. We illustrate our methods using accelerometer data from the National Health and Nutrition Examination Survey, and produce participant-level 10%, 50%, and 90% quantile curves over 24 h of activity. The proposed methodology is supported by simulation results, and is available as an R package.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11823270/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142513407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian model averaging for partial ordering continual reassessment methods. 偏序连续重评价的贝叶斯模型平均方法。
IF 2 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf035
Luka Kovačević, Weishi Chen, Helen Barnett, Thomas Jaki, Pavel Mozgunov

Phase I clinical trials are essential to bringing novel therapies from chemical development to widespread use. Traditional approaches to dose-finding in Phase I trials, such as the '3 + 3' method and the continual reassessment method (CRM), provide a principled approach for escalating across dose levels. However, these methods lack the ability to incorporate uncertainty regarding the dose-toxicity ordering as found in combination drug trials. Under this setting, dose levels vary across multiple drugs simultaneously, leading to multiple possible dose-toxicity orderings. The CRM for partial ordering (POCRM) extends to these settings by allowing for multiple dose-toxicity orderings. In this work, it is shown that the POCRM is vulnerable to 'estimation incoherency' whereby toxicity estimates shift in an illogical way, threatening patient safety and undermining clinician trust in dose-finding models. To this end, the Bayesian model averaged POCRM (BMA-POCRM) is formalized. BMA-POCRM uses Bayesian model averaging to take into account all possible orderings simultaneously, reducing the frequency of estimation incoherencies. We derive novel theoretical guarantees on the estimation coherency of the POCRM and BMA-POCRM. The effectiveness of BMA-POCRM in drug combination settings is demonstrated through a specific instance of estimate incoherency of POCRM and simulation studies. The results highlight the improved safety, accuracy, and reduced occurrence of estimate incoherency in trials applying the BMA-POCRM relative to the POCRM model.

I期临床试验对于将新疗法从化学研发推向广泛应用至关重要。传统的I期试验剂量测定方法,如“3 + 3”方法和持续重新评估方法(CRM),提供了一种跨剂量水平递增的原则性方法。然而,这些方法缺乏结合在联合药物试验中发现的剂量-毒性顺序的不确定性的能力。在这种情况下,多种药物的剂量水平同时变化,导致多种可能的剂量毒性顺序。部分排序的CRM (POCRM)通过允许多个剂量毒性排序扩展到这些设置。在这项工作中,研究表明POCRM容易受到“估计不连贯”的影响,即毒性估计以一种不合逻辑的方式转移,威胁患者安全并破坏临床医生对剂量发现模型的信任。为此,将贝叶斯模型平均POCRM (BMA-POCRM)形式化。BMA-POCRM使用贝叶斯模型平均同时考虑所有可能的排序,减少了估计不相干的频率。我们对POCRM和BMA-POCRM的估计相干性给出了新的理论保证。BMA-POCRM在药物联合环境中的有效性通过POCRM的估计不一致性和仿真研究的具体实例得到了证明。结果表明,相对于POCRM模型,应用BMA-POCRM的试验提高了安全性、准确性,并减少了估计不一致性的发生。
{"title":"Bayesian model averaging for partial ordering continual reassessment methods.","authors":"Luka Kovačević, Weishi Chen, Helen Barnett, Thomas Jaki, Pavel Mozgunov","doi":"10.1093/biostatistics/kxaf035","DOIUrl":"10.1093/biostatistics/kxaf035","url":null,"abstract":"<p><p>Phase I clinical trials are essential to bringing novel therapies from chemical development to widespread use. Traditional approaches to dose-finding in Phase I trials, such as the '3 + 3' method and the continual reassessment method (CRM), provide a principled approach for escalating across dose levels. However, these methods lack the ability to incorporate uncertainty regarding the dose-toxicity ordering as found in combination drug trials. Under this setting, dose levels vary across multiple drugs simultaneously, leading to multiple possible dose-toxicity orderings. The CRM for partial ordering (POCRM) extends to these settings by allowing for multiple dose-toxicity orderings. In this work, it is shown that the POCRM is vulnerable to 'estimation incoherency' whereby toxicity estimates shift in an illogical way, threatening patient safety and undermining clinician trust in dose-finding models. To this end, the Bayesian model averaged POCRM (BMA-POCRM) is formalized. BMA-POCRM uses Bayesian model averaging to take into account all possible orderings simultaneously, reducing the frequency of estimation incoherencies. We derive novel theoretical guarantees on the estimation coherency of the POCRM and BMA-POCRM. The effectiveness of BMA-POCRM in drug combination settings is demonstrated through a specific instance of estimate incoherency of POCRM and simulation studies. The results highlight the improved safety, accuracy, and reduced occurrence of estimate incoherency in trials applying the BMA-POCRM relative to the POCRM model.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12538209/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145338280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Variable-based probabilistic calibration with binary outcome. 二元结果的基于变量的概率校准。
IF 2 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf026
Hiroe Seto, Shuji Kitora, Asuka Oyama, Hiroshi Toki, Ryohei Yamamoto, Michio Yamamoto

In developing risk prediction models for specific diseases, it is essential to evaluate the calibration performance of the prediction model. Various methods have been proposed to assess the calibration of prediction models, but it has been pointed out that conventional methods based on the predicted probability of the model are insufficient to detect miscalibration. Another problem is that a method for evaluating calibration for continuous variables of interest has not yet been established. We therefore propose two methods to evaluate the calibration of the variable of interest: the variable-based probabilistic calibration plot (VPC-Plot), which is a visual assessment, and the variable-based probabilistic calibration error (VPCE), which is a corresponding evaluation metric. We conducted theoretical and simulation studies to investigate the properties and effectiveness of the proposed method. Theoretical and simulation studies demonstrated that the proposed methods can detect miscalibration by evaluating the calibration based on the variable of interest, even when conventional methods fail to detect miscalibration. To show the usefulness in the real-world data analysis, we evaluated diabetes prediction models developed using the national health insurance database for Osaka, Japan. We show that the proposed method can identify miscalibration of key covariate in a diabetes prediction model.

在建立特定疾病的风险预测模型时,评估预测模型的校准性能至关重要。人们提出了各种方法来评估预测模型的校准,但有人指出,传统的基于模型预测概率的方法不足以检测误校准。另一个问题是,对感兴趣的连续变量的校准评估方法尚未建立。因此,我们提出了两种评估感兴趣变量校准的方法:基于变量的概率校准图(vc - plot),这是一种视觉评估,以及基于变量的概率校准误差(VPCE),这是一个相应的评估指标。我们进行了理论和仿真研究,以调查所提出的方法的性质和有效性。理论和仿真研究表明,即使传统方法无法检测到误校准,该方法也可以通过评估基于感兴趣变量的校准来检测误校准。为了显示在现实世界数据分析中的有用性,我们评估了使用日本大阪国家健康保险数据库开发的糖尿病预测模型。我们表明,该方法可以识别糖尿病预测模型中关键协变量的误校正。
{"title":"Variable-based probabilistic calibration with binary outcome.","authors":"Hiroe Seto, Shuji Kitora, Asuka Oyama, Hiroshi Toki, Ryohei Yamamoto, Michio Yamamoto","doi":"10.1093/biostatistics/kxaf026","DOIUrl":"https://doi.org/10.1093/biostatistics/kxaf026","url":null,"abstract":"<p><p>In developing risk prediction models for specific diseases, it is essential to evaluate the calibration performance of the prediction model. Various methods have been proposed to assess the calibration of prediction models, but it has been pointed out that conventional methods based on the predicted probability of the model are insufficient to detect miscalibration. Another problem is that a method for evaluating calibration for continuous variables of interest has not yet been established. We therefore propose two methods to evaluate the calibration of the variable of interest: the variable-based probabilistic calibration plot (VPC-Plot), which is a visual assessment, and the variable-based probabilistic calibration error (VPCE), which is a corresponding evaluation metric. We conducted theoretical and simulation studies to investigate the properties and effectiveness of the proposed method. Theoretical and simulation studies demonstrated that the proposed methods can detect miscalibration by evaluating the calibration based on the variable of interest, even when conventional methods fail to detect miscalibration. To show the usefulness in the real-world data analysis, we evaluated diabetes prediction models developed using the national health insurance database for Osaka, Japan. We show that the proposed method can identify miscalibration of key covariate in a diabetes prediction model.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145314132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributed lag interaction model with index modification. 具有索引修改的分布式滞后交互模型。
IF 1.8 3区 数学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf017
Danielle Demateis, Sandra India-Aldana, Robert O Wright, Rosalind J Wright, Andrea Baccarelli, Elena Colicino, Ander Wilson, Kayleigh P Keller

Epidemiological evidence supports an association between exposure to air pollution during pregnancy and birth and child health outcomes. Typically, such associations are estimated by regressing an outcome on daily or weekly measures of exposure during pregnancy using a distributed lag model. However, these associations may be modified by multiple factors. We propose a distributed lag interaction model with index modification that allows for effect modification of a functional predictor by a weighted average of multiple modifiers. Our model allows for simultaneous estimation of modifier index weights and the exposure-time-response function via a spline cross-basis in a Bayesian hierarchical framework. Through simulations, we showed that our model out-performs competing methods when there are multiple modifiers of unknown importance. We applied our proposed method to a Colorado birth cohort to estimate the association between birth weight and air pollution modified by a neighborhood-vulnerability index and to a Mexican birth cohort to estimate the association between birthing-parent cardio-metabolic endpoints and air pollution modified by a birthing-parent lifetime stress index.

流行病学证据支持在怀孕和分娩期间接触空气污染与儿童健康结果之间存在关联。通常,这种关联是通过使用分布滞后模型对怀孕期间每日或每周暴露量的结果进行回归来估计的。然而,这些关联可能受到多种因素的影响。我们提出了一个具有指数修正的分布式滞后交互模型,该模型允许通过多个修正因子的加权平均值对功能预测因子进行效果修正。我们的模型允许在贝叶斯层次框架中通过样条交叉基同时估计修正指标权重和暴露-时间-响应函数。通过仿真,我们发现当存在多个未知重要度的修饰符时,我们的模型优于竞争方法。我们将我们提出的方法应用于科罗拉多州的一个出生队列,通过邻居脆弱性指数来估计出生体重与空气污染之间的关系,并将其应用于墨西哥的一个出生队列,通过出生父母一生压力指数来估计出生父母心脏代谢终点与空气污染之间的关系。
{"title":"Distributed lag interaction model with index modification.","authors":"Danielle Demateis, Sandra India-Aldana, Robert O Wright, Rosalind J Wright, Andrea Baccarelli, Elena Colicino, Ander Wilson, Kayleigh P Keller","doi":"10.1093/biostatistics/kxaf017","DOIUrl":"10.1093/biostatistics/kxaf017","url":null,"abstract":"<p><p>Epidemiological evidence supports an association between exposure to air pollution during pregnancy and birth and child health outcomes. Typically, such associations are estimated by regressing an outcome on daily or weekly measures of exposure during pregnancy using a distributed lag model. However, these associations may be modified by multiple factors. We propose a distributed lag interaction model with index modification that allows for effect modification of a functional predictor by a weighted average of multiple modifiers. Our model allows for simultaneous estimation of modifier index weights and the exposure-time-response function via a spline cross-basis in a Bayesian hierarchical framework. Through simulations, we showed that our model out-performs competing methods when there are multiple modifiers of unknown importance. We applied our proposed method to a Colorado birth cohort to estimate the association between birth weight and air pollution modified by a neighborhood-vulnerability index and to a Mexican birth cohort to estimate the association between birthing-parent cardio-metabolic endpoints and air pollution modified by a birthing-parent lifetime stress index.</p>","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12205949/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144369549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biostatistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1