首页 > 最新文献

Biometrics最新文献

英文 中文
On exact randomization-based covariate-adjusted confidence intervals. 关于基于精确随机化的协变量调整置信区间。
IF 1.9 4区 数学 Q1 Mathematics Pub Date : 2024-03-27 DOI: 10.1093/biomtc/ujae051
Jacob Fiksel

Randomization-based inference using the Fisher randomization test allows for the computation of Fisher-exact P-values, making it an attractive option for the analysis of small, randomized experiments with non-normal outcomes. Two common test statistics used to perform Fisher randomization tests are the difference-in-means between the treatment and control groups and the covariate-adjusted version of the difference-in-means using analysis of covariance. Modern computing allows for fast computation of the Fisher-exact P-value, but confidence intervals have typically been obtained by inverting the Fisher randomization test over a range of possible effect sizes. The test inversion procedure is computationally expensive, limiting the usage of randomization-based inference in applied work. A recent paper by Zhu and Liu developed a closed form expression for the randomization-based confidence interval using the difference-in-means statistic. We develop an important extension of Zhu and Liu to obtain a closed form expression for the randomization-based covariate-adjusted confidence interval and give practitioners a sufficiency condition that can be checked using observed data and that guarantees that these confidence intervals have correct coverage. Simulations show that our procedure generates randomization-based covariate-adjusted confidence intervals that are robust to non-normality and that can be calculated in nearly the same time as it takes to calculate the Fisher-exact P-value, thus removing the computational barrier to performing randomization-based inference when adjusting for covariates. We also demonstrate our method on a re-analysis of phase I clinical trial data.

使用费舍尔随机化检验进行基于随机化的推断,可以计算费舍尔精确 P 值,使其成为分析具有非正态性结果的小型随机实验的一个有吸引力的选择。进行费舍尔随机化检验常用的两个检验统计量是治疗组和对照组的均值差和使用协方差分析的均值差的协方差调整版本。现代计算技术可以快速计算费雪精确 P 值,但置信区间通常是通过在一系列可能的效应大小范围内反转费雪随机检验来获得的。检验反演过程的计算成本很高,限制了基于随机化的推断在应用工作中的使用。Zhu 和 Liu 最近发表的一篇论文利用均值差统计量开发了基于随机化的置信区间的封闭式表达式。我们在 Zhu 和 Liu 的基础上进行了重要扩展,得到了基于随机化的协变量调整置信区间的闭合形式表达式,并为实践者提供了一个充分条件,该条件可以使用观测数据进行检验,并保证这些置信区间具有正确的覆盖范围。模拟结果表明,我们的程序生成的基于随机化协变量调整的置信区间对非正态性具有稳健性,计算时间几乎与计算 Fisher 精确 P 值的时间相同,从而消除了在调整协变量时进行基于随机化推断的计算障碍。我们还在一期临床试验数据的重新分析中演示了我们的方法。
{"title":"On exact randomization-based covariate-adjusted confidence intervals.","authors":"Jacob Fiksel","doi":"10.1093/biomtc/ujae051","DOIUrl":"https://doi.org/10.1093/biomtc/ujae051","url":null,"abstract":"<p><p>Randomization-based inference using the Fisher randomization test allows for the computation of Fisher-exact P-values, making it an attractive option for the analysis of small, randomized experiments with non-normal outcomes. Two common test statistics used to perform Fisher randomization tests are the difference-in-means between the treatment and control groups and the covariate-adjusted version of the difference-in-means using analysis of covariance. Modern computing allows for fast computation of the Fisher-exact P-value, but confidence intervals have typically been obtained by inverting the Fisher randomization test over a range of possible effect sizes. The test inversion procedure is computationally expensive, limiting the usage of randomization-based inference in applied work. A recent paper by Zhu and Liu developed a closed form expression for the randomization-based confidence interval using the difference-in-means statistic. We develop an important extension of Zhu and Liu to obtain a closed form expression for the randomization-based covariate-adjusted confidence interval and give practitioners a sufficiency condition that can be checked using observed data and that guarantees that these confidence intervals have correct coverage. Simulations show that our procedure generates randomization-based covariate-adjusted confidence intervals that are robust to non-normality and that can be calculated in nearly the same time as it takes to calculate the Fisher-exact P-value, thus removing the computational barrier to performing randomization-based inference when adjusting for covariates. We also demonstrate our method on a re-analysis of phase I clinical trial data.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141260413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian meta-analysis of penetrance for cancer risk. 癌症风险渗透性的贝叶斯荟萃分析。
IF 1.9 4区 数学 Q1 Mathematics Pub Date : 2024-03-27 DOI: 10.1093/biomtc/ujae038
Thanthirige Lakshika M Ruberu, Danielle Braun, Giovanni Parmigiani, Swati Biswas

Multi-gene panel testing allows many cancer susceptibility genes to be tested quickly at a lower cost making such testing accessible to a broader population. Thus, more patients carrying pathogenic germline mutations in various cancer-susceptibility genes are being identified. This creates a great opportunity, as well as an urgent need, to counsel these patients about appropriate risk-reducing management strategies. Counseling hinges on accurate estimates of age-specific risks of developing various cancers associated with mutations in a specific gene, ie, penetrance estimation. We propose a meta-analysis approach based on a Bayesian hierarchical random-effects model to obtain penetrance estimates by integrating studies reporting different types of risk measures (eg, penetrance, relative risk, odds ratio) while accounting for the associated uncertainties. After estimating posterior distributions of the parameters via a Markov chain Monte Carlo algorithm, we estimate penetrance and credible intervals. We investigate the proposed method and compare with an existing approach via simulations based on studies reporting risks for two moderate-risk breast cancer susceptibility genes, ATM and PALB2. Our proposed method is far superior in terms of coverage probability of credible intervals and mean square error of estimates. Finally, we apply our method to estimate the penetrance of breast cancer among carriers of pathogenic mutations in the ATM gene.

多基因面板检测能以较低的成本快速检测许多癌症易感基因,使更多的人可以接受这种检测。因此,越来越多携带各种癌症易感基因致病性种系突变的患者被发现。这为我们提供了一个绝佳的机会,同时也迫切需要为这些患者提供咨询,使其了解适当的降低风险管理策略。咨询取决于对与特定基因突变相关的各种癌症的特定年龄风险的准确估计,即渗透率估计。我们提出了一种基于贝叶斯分层随机效应模型的荟萃分析方法,通过整合报告不同类型风险度量(如渗透率、相对风险、几率比率)的研究,同时考虑相关的不确定性,来获得渗透率估计值。通过马尔科夫链蒙特卡洛算法估计参数的后验分布后,我们估计了渗透率和可信区间。我们根据对两个中度风险乳腺癌易感基因(ATM 和 PALB2)的风险报告研究,对所提出的方法进行了研究,并通过模拟与现有方法进行了比较。就可信区间的覆盖概率和估计值的均方误差而言,我们提出的方法要优越得多。最后,我们将我们的方法应用于估计 ATM 基因致病突变携带者患乳腺癌的渗透率。
{"title":"Bayesian meta-analysis of penetrance for cancer risk.","authors":"Thanthirige Lakshika M Ruberu, Danielle Braun, Giovanni Parmigiani, Swati Biswas","doi":"10.1093/biomtc/ujae038","DOIUrl":"10.1093/biomtc/ujae038","url":null,"abstract":"<p><p>Multi-gene panel testing allows many cancer susceptibility genes to be tested quickly at a lower cost making such testing accessible to a broader population. Thus, more patients carrying pathogenic germline mutations in various cancer-susceptibility genes are being identified. This creates a great opportunity, as well as an urgent need, to counsel these patients about appropriate risk-reducing management strategies. Counseling hinges on accurate estimates of age-specific risks of developing various cancers associated with mutations in a specific gene, ie, penetrance estimation. We propose a meta-analysis approach based on a Bayesian hierarchical random-effects model to obtain penetrance estimates by integrating studies reporting different types of risk measures (eg, penetrance, relative risk, odds ratio) while accounting for the associated uncertainties. After estimating posterior distributions of the parameters via a Markov chain Monte Carlo algorithm, we estimate penetrance and credible intervals. We investigate the proposed method and compare with an existing approach via simulations based on studies reporting risks for two moderate-risk breast cancer susceptibility genes, ATM and PALB2. Our proposed method is far superior in terms of coverage probability of credible intervals and mean square error of estimates. Finally, we apply our method to estimate the penetrance of breast cancer among carriers of pathogenic mutations in the ATM gene.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11140851/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141178675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Single proxy control. 单一代理控制。
IF 1.9 4区 数学 Q1 Mathematics Pub Date : 2024-03-27 DOI: 10.1093/biomtc/ujae027
Chan Park, David B Richardson, Eric J Tchetgen Tchetgen

Negative control variables are sometimes used in nonexperimental studies to detect the presence of confounding by hidden factors. A negative control outcome (NCO) is an outcome that is influenced by unobserved confounders of the exposure effects on the outcome in view, but is not causally impacted by the exposure. Tchetgen Tchetgen (2013) introduced the Control Outcome Calibration Approach (COCA) as a formal NCO counterfactual method to detect and correct for residual confounding bias. For identification, COCA treats the NCO as an error-prone proxy of the treatment-free counterfactual outcome of interest, and involves regressing the NCO on the treatment-free counterfactual, together with a rank-preserving structural model, which assumes a constant individual-level causal effect. In this work, we establish nonparametric COCA identification for the average causal effect for the treated, without requiring rank-preservation, therefore accommodating unrestricted effect heterogeneity across units. This nonparametric identification result has important practical implications, as it provides single-proxy confounding control, in contrast to recently proposed proximal causal inference, which relies for identification on a pair of confounding proxies. For COCA estimation we propose 3 separate strategies: (i) an extended propensity score approach, (ii) an outcome bridge function approach, and (iii) a doubly-robust approach. Finally, we illustrate the proposed methods in an application evaluating the causal impact of a Zika virus outbreak on birth rate in Brazil.

在非实验研究中,有时会使用负控制变量来检测是否存在隐藏因素的干扰。负控制结果(NCO)是指受暴露对结果影响的未观测混杂因素影响,但不受暴露因果影响的结果。Tchetgen Tchetgen(2013 年)介绍了控制结果校准方法(COCA),作为一种正式的 NCO 反事实方法,用于检测和纠正残余混杂偏差。为了进行识别,COCA 将 NCO 视为相关无治疗反事实结果的易出错替代物,并将 NCO 与无治疗反事实结果进行回归,同时采用等级保护结构模型,该模型假定个体水平的因果效应不变。在这项工作中,我们为受治疗者的平均因果效应建立了非参数 COCA 识别,而不需要等级保留,因此可以适应各单位间不受限制的效应异质性。这一非参数识别结果具有重要的实际意义,因为它提供了单一代理混杂控制,这与最近提出的近似因果推断不同,后者依赖于一对混杂代理进行识别。对于 COCA 估算,我们提出了三种不同的策略:(i) 扩展倾向得分法,(ii) 结果桥函数法,(iii) 双重稳健法。最后,我们在评估寨卡病毒爆发对巴西出生率的因果影响的应用中说明了所提出的方法。
{"title":"Single proxy control.","authors":"Chan Park, David B Richardson, Eric J Tchetgen Tchetgen","doi":"10.1093/biomtc/ujae027","DOIUrl":"https://doi.org/10.1093/biomtc/ujae027","url":null,"abstract":"<p><p>Negative control variables are sometimes used in nonexperimental studies to detect the presence of confounding by hidden factors. A negative control outcome (NCO) is an outcome that is influenced by unobserved confounders of the exposure effects on the outcome in view, but is not causally impacted by the exposure. Tchetgen Tchetgen (2013) introduced the Control Outcome Calibration Approach (COCA) as a formal NCO counterfactual method to detect and correct for residual confounding bias. For identification, COCA treats the NCO as an error-prone proxy of the treatment-free counterfactual outcome of interest, and involves regressing the NCO on the treatment-free counterfactual, together with a rank-preserving structural model, which assumes a constant individual-level causal effect. In this work, we establish nonparametric COCA identification for the average causal effect for the treated, without requiring rank-preservation, therefore accommodating unrestricted effect heterogeneity across units. This nonparametric identification result has important practical implications, as it provides single-proxy confounding control, in contrast to recently proposed proximal causal inference, which relies for identification on a pair of confounding proxies. For COCA estimation we propose 3 separate strategies: (i) an extended propensity score approach, (ii) an outcome bridge function approach, and (iii) a doubly-robust approach. Finally, we illustrate the proposed methods in an application evaluating the causal impact of a Zika virus outbreak on birth rate in Brazil.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11033710/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140847741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discussion on "Bayesian meta-analysis of penetrance for cancer risk" by Thanthirige Lakshika M. Ruberu, Danielle Braun, Giovanni Parmigiani, and Swati Biswas. Thanthirige Lakshika M. Ruberu、Danielle Braun、Giovanni Parmigiani 和 Swati Biswas 关于 "癌症风险渗透的贝叶斯元分析 "的讨论。
IF 1.9 4区 数学 Q1 Mathematics Pub Date : 2024-03-27 DOI: 10.1093/biomtc/ujae042
Peter Müller, Bernardo Flores

Ruberu et al. (2023) introduce an elegant approach to fit a complicated meta-analysis problem with diverse reporting modalities into the framework of hierarchical Bayesian inference. We discuss issues related to some of the involved parametric model assumptions.

Ruberu 等人(2023 年)介绍了一种优雅的方法,将具有不同报告模式的复杂荟萃分析问题纳入分层贝叶斯推断框架。我们讨论了涉及参数模型假设的一些相关问题。
{"title":"Discussion on \"Bayesian meta-analysis of penetrance for cancer risk\" by Thanthirige Lakshika M. Ruberu, Danielle Braun, Giovanni Parmigiani, and Swati Biswas.","authors":"Peter Müller, Bernardo Flores","doi":"10.1093/biomtc/ujae042","DOIUrl":"https://doi.org/10.1093/biomtc/ujae042","url":null,"abstract":"<p><p>Ruberu et al. (2023) introduce an elegant approach to fit a complicated meta-analysis problem with diverse reporting modalities into the framework of hierarchical Bayesian inference. We discuss issues related to some of the involved parametric model assumptions.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141178723","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Confounder-dependent Bayesian mixture model: Characterizing heterogeneity of causal effects in air pollution epidemiology. 依赖于混杂因素的贝叶斯混合物模型:描述空气污染流行病学中因果效应的异质性。
IF 1.9 4区 数学 Q1 Mathematics Pub Date : 2024-03-27 DOI: 10.1093/biomtc/ujae025
Dafne Zorzetto, Falco J Bargagli-Stoffi, Antonio Canale, Francesca Dominici

Several epidemiological studies have provided evidence that long-term exposure to fine particulate matter (pm2.5) increases mortality rate. Furthermore, some population characteristics (e.g., age, race, and socioeconomic status) might play a crucial role in understanding vulnerability to air pollution. To inform policy, it is necessary to identify groups of the population that are more or less vulnerable to air pollution. In causal inference literature, the group average treatment effect (GATE) is a distinctive facet of the conditional average treatment effect. This widely employed metric serves to characterize the heterogeneity of a treatment effect based on some population characteristics. In this paper, we introduce a novel Confounder-Dependent Bayesian Mixture Model (CDBMM) to characterize causal effect heterogeneity. More specifically, our method leverages the flexibility of the dependent Dirichlet process to model the distribution of the potential outcomes conditionally to the covariates and the treatment levels, thus enabling us to: (i) identify heterogeneous and mutually exclusive population groups defined by similar GATEs in a data-driven way, and (ii) estimate and characterize the causal effects within each of the identified groups. Through simulations, we demonstrate the effectiveness of our method in uncovering key insights about treatment effects heterogeneity. We apply our method to claims data from Medicare enrollees in Texas. We found six mutually exclusive groups where the causal effects of pm2.5 on mortality rate are heterogeneous.

多项流行病学研究证明,长期暴露于细颗粒物(pm2.5)会增加死亡率。此外,一些人口特征(如年龄、种族和社会经济地位)可能在了解易受空气污染影响的程度方面起着至关重要的作用。为了给政策提供依据,有必要确定哪些人群更容易或不容易受到空气污染的影响。在因果推理文献中,群体平均治疗效果 (GATE) 是条件平均治疗效果的一个独特方面。这个被广泛使用的指标用于描述基于某些人口特征的治疗效果的异质性。在本文中,我们引入了一种新颖的混杂因素依赖贝叶斯混杂模型(CDBMM)来描述因果效应异质性。更具体地说,我们的方法利用了依赖性 Dirichlet 过程的灵活性,对协变因素和治疗水平条件下的潜在结果分布进行建模,从而使我们能够:(i) 以数据驱动的方式识别由相似 GATEs 定义的异质性和相互排斥的人群组;(ii) 估计和描述每个已识别人群组内的因果效应。通过模拟,我们证明了我们的方法在揭示治疗效果异质性的关键见解方面的有效性。我们将我们的方法应用于德克萨斯州医疗保险参保者的报销数据。我们发现 pm2.5 对死亡率的因果效应具有异质性的六个相互排斥的组别。
{"title":"Confounder-dependent Bayesian mixture model: Characterizing heterogeneity of causal effects in air pollution epidemiology.","authors":"Dafne Zorzetto, Falco J Bargagli-Stoffi, Antonio Canale, Francesca Dominici","doi":"10.1093/biomtc/ujae025","DOIUrl":"https://doi.org/10.1093/biomtc/ujae025","url":null,"abstract":"<p><p>Several epidemiological studies have provided evidence that long-term exposure to fine particulate matter (pm2.5) increases mortality rate. Furthermore, some population characteristics (e.g., age, race, and socioeconomic status) might play a crucial role in understanding vulnerability to air pollution. To inform policy, it is necessary to identify groups of the population that are more or less vulnerable to air pollution. In causal inference literature, the group average treatment effect (GATE) is a distinctive facet of the conditional average treatment effect. This widely employed metric serves to characterize the heterogeneity of a treatment effect based on some population characteristics. In this paper, we introduce a novel Confounder-Dependent Bayesian Mixture Model (CDBMM) to characterize causal effect heterogeneity. More specifically, our method leverages the flexibility of the dependent Dirichlet process to model the distribution of the potential outcomes conditionally to the covariates and the treatment levels, thus enabling us to: (i) identify heterogeneous and mutually exclusive population groups defined by similar GATEs in a data-driven way, and (ii) estimate and characterize the causal effects within each of the identified groups. Through simulations, we demonstrate the effectiveness of our method in uncovering key insights about treatment effects heterogeneity. We apply our method to claims data from Medicare enrollees in Texas. We found six mutually exclusive groups where the causal effects of pm2.5 on mortality rate are heterogeneous.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11028589/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140874452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantifying the HIV reservoir with dilution assays and deep viral sequencing. 利用稀释测定和深度病毒测序量化艾滋病毒库。
IF 1.9 4区 数学 Q1 Mathematics Pub Date : 2024-01-29 DOI: 10.1093/biomtc/ujad018
Sarah C Lotspeich, Brian D Richardson, Pedro L Baldoni, Kimberly P Enders, Michael G Hudgens

People living with HIV on antiretroviral therapy often have undetectable virus levels by standard assays, but "latent" HIV still persists in viral reservoirs. Eliminating these reservoirs is the goal of HIV cure research. The quantitative viral outgrowth assay (QVOA) is commonly used to estimate the reservoir size, that is, the infectious units per million (IUPM) of HIV-persistent resting CD4+ T cells. A new variation of the QVOA, the ultra deep sequencing assay of the outgrowth virus (UDSA), was recently developed that further quantifies the number of viral lineages within a subset of infected wells. Performing the UDSA on a subset of wells provides additional information that can improve IUPM estimation. This paper considers statistical inference about the IUPM from combined dilution assay (QVOA) and deep viral sequencing (UDSA) data, even when some deep sequencing data are missing. Methods are proposed to accommodate assays with wells sequenced at multiple dilution levels and with imperfect sensitivity and specificity, and a novel bias-corrected estimator is included for small samples. The proposed methods are evaluated in a simulation study, applied to data from the University of North Carolina HIV Cure Center, and implemented in the open-source R package SLDeepAssay.

接受抗逆转录病毒治疗的艾滋病病毒感染者通常在标准检测方法中检测不到病毒,但 "潜伏 "的艾滋病病毒仍然存在于病毒库中。消除这些病毒库是艾滋病治愈研究的目标。定量病毒外生测定法(QVOA)通常用于估算病毒库的规模,即静息 CD4+ T 细胞中每百万感染单位(IUPM)的 HIV 感染率。最近又开发了一种 QVOA 的新变体--生长期病毒超深度测序分析法(UDSA),它能进一步量化感染井子集内的病毒系数量。在感染井子集上进行 UDSA 可提供额外信息,从而改进 IUPM 估算。本文考虑从联合稀释测定(QVOA)和深度病毒测序(UDSA)数据中对 IUPM 进行统计推断,即使某些深度测序数据缺失。本文提出的方法适用于多稀释水平测序井、灵敏度和特异性不完善的检测,还包括一种适用于小样本的新型偏差校正估算器。我们在模拟研究中对所提出的方法进行了评估,并将其应用于北卡罗来纳大学艾滋病治疗中心的数据,并在开源 R 软件包 SLDeepAssay 中实现。
{"title":"Quantifying the HIV reservoir with dilution assays and deep viral sequencing.","authors":"Sarah C Lotspeich, Brian D Richardson, Pedro L Baldoni, Kimberly P Enders, Michael G Hudgens","doi":"10.1093/biomtc/ujad018","DOIUrl":"10.1093/biomtc/ujad018","url":null,"abstract":"<p><p>People living with HIV on antiretroviral therapy often have undetectable virus levels by standard assays, but \"latent\" HIV still persists in viral reservoirs. Eliminating these reservoirs is the goal of HIV cure research. The quantitative viral outgrowth assay (QVOA) is commonly used to estimate the reservoir size, that is, the infectious units per million (IUPM) of HIV-persistent resting CD4+ T cells. A new variation of the QVOA, the ultra deep sequencing assay of the outgrowth virus (UDSA), was recently developed that further quantifies the number of viral lineages within a subset of infected wells. Performing the UDSA on a subset of wells provides additional information that can improve IUPM estimation. This paper considers statistical inference about the IUPM from combined dilution assay (QVOA) and deep viral sequencing (UDSA) data, even when some deep sequencing data are missing. Methods are proposed to accommodate assays with wells sequenced at multiple dilution levels and with imperfect sensitivity and specificity, and a novel bias-corrected estimator is included for small samples. The proposed methods are evaluated in a simulation study, applied to data from the University of North Carolina HIV Cure Center, and implemented in the open-source R package SLDeepAssay.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10873562/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139745915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Two-phase designs with failure time processes subject to nonsusceptibility. 两阶段设计的失效时间过程受非敏感性影响。
IF 1.9 4区 数学 Q1 Mathematics Pub Date : 2024-01-29 DOI: 10.1093/biomtc/ujad038
Fangya Mao, Li C Cheung, Richard J Cook

Epidemiological studies based on 2-phase designs help ensure efficient use of limited resources in situations where certain covariates are prohibitively expensive to measure for a full cohort. Typically, these designs involve 2 steps: In phase I, data on an outcome and inexpensive covariates are acquired, and in phase II, a subsample is chosen in which the costly variable of interest is measured. For right-censored data, 2-phase designs have been primarily based on the Cox model. We develop efficient 2-phase design strategies for settings involving a fraction of long-term survivors due to nonsusceptibility. Using mixture models accommodating a nonsusceptible fraction, we consider 3 regression frameworks, including (a) a logistic "cure" model, (b) a proportional hazards model for those who are susceptible, and (c) regression models for susceptibility and failure time in those susceptible. Importantly, we introduce a novel class of bivariate residual-dependent designs to address the unique challenges presented in scenario (c), which involves 2 parameters of interest. Extensive simulation studies demonstrate the superiority of our approach over various phase II subsampling schemes. We illustrate the method through applications to the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial.

基于两阶段设计的流行病学研究有助于确保在某些协变量的测量成本过高而无法对整个队列进行测量的情况下,有效利用有限的资源。通常,这些设计包括两个步骤:在第一阶段,获取结果和廉价协变量的数据;在第二阶段,选择一个子样本,测量昂贵的相关变量。对于右删失数据,两阶段设计主要基于 Cox 模型。我们开发了高效的两阶段设计策略,适用于因非易感性而涉及部分长期存活者的情况。通过使用可容纳非易感人群的混合模型,我们考虑了 3 种回归框架,包括:(a) 逻辑 "治愈 "模型;(b) 易感人群的比例危险模型;(c) 易感人群的易感性和失败时间回归模型。重要的是,我们引入了一类新型的双变量残差依赖设计,以应对方案(c)中涉及两个相关参数的独特挑战。广泛的模拟研究证明,我们的方法优于各种第二阶段子采样方案。我们通过应用于前列腺癌、肺癌、结肠直肠癌和卵巢癌筛查试验来说明这种方法。
{"title":"Two-phase designs with failure time processes subject to nonsusceptibility.","authors":"Fangya Mao, Li C Cheung, Richard J Cook","doi":"10.1093/biomtc/ujad038","DOIUrl":"10.1093/biomtc/ujad038","url":null,"abstract":"<p><p>Epidemiological studies based on 2-phase designs help ensure efficient use of limited resources in situations where certain covariates are prohibitively expensive to measure for a full cohort. Typically, these designs involve 2 steps: In phase I, data on an outcome and inexpensive covariates are acquired, and in phase II, a subsample is chosen in which the costly variable of interest is measured. For right-censored data, 2-phase designs have been primarily based on the Cox model. We develop efficient 2-phase design strategies for settings involving a fraction of long-term survivors due to nonsusceptibility. Using mixture models accommodating a nonsusceptible fraction, we consider 3 regression frameworks, including (a) a logistic \"cure\" model, (b) a proportional hazards model for those who are susceptible, and (c) regression models for susceptibility and failure time in those susceptible. Importantly, we introduce a novel class of bivariate residual-dependent designs to address the unique challenges presented in scenario (c), which involves 2 parameters of interest. Extensive simulation studies demonstrate the superiority of our approach over various phase II subsampling schemes. We illustrate the method through applications to the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140038631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rejoinder to the discussion on "The central role of the identifying assumption in population size estimation". 对 "识别假设在种群规模估计中的核心作用 "讨论的再评论。
IF 1.9 4区 数学 Q1 Mathematics Pub Date : 2024-01-29 DOI: 10.1093/biomtc/ujad033
Serge Aleshin-Guendel, Mauricio Sadinle, Jon Wakefield

We organize the discussants' major comments into the following categories: sensitivity analyses, zero counts, model selection, the marginal no-highest-order interaction (NHOI) assumption, and the usefulness of our proposed framework.

我们将讨论者的主要意见归纳为以下几类:敏感性分析、零计数、模型选择、边际无最高阶交互作用(NHOI)假设以及我们提出的框架的实用性。
{"title":"Rejoinder to the discussion on \"The central role of the identifying assumption in population size estimation\".","authors":"Serge Aleshin-Guendel, Mauricio Sadinle, Jon Wakefield","doi":"10.1093/biomtc/ujad033","DOIUrl":"10.1093/biomtc/ujad033","url":null,"abstract":"<p><p>We organize the discussants' major comments into the following categories: sensitivity analyses, zero counts, model selection, the marginal no-highest-order interaction (NHOI) assumption, and the usefulness of our proposed framework.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140058584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Soft classification and regression analysis of audiometric phenotypes of age-related hearing loss. 老年性听力损失听力表型的软分类和回归分析。
IF 1.9 4区 数学 Q1 Mathematics Pub Date : 2024-01-29 DOI: 10.1093/biomtc/ujae013
Ce Yang, Benjamin Langworthy, Sharon Curhan, Kenneth I Vaden, Gary Curhan, Judy R Dubno, Molin Wang

Age-related hearing loss has a complex etiology. Researchers have made efforts to classify relevant audiometric phenotypes, aiming to enhance medical interventions and improve hearing health. We leveraged existing pattern analyses of age-related hearing loss and implemented the phenotype classification via quadratic discriminant analysis (QDA). We herein propose a method for analyzing the exposure effects on the soft classification probabilities of the phenotypes via estimating equations. Under reasonable assumptions, the estimating equations are unbiased and lead to consistent estimators. The resulting estimator had good finite sample performances in simulation studies. As an illustrative example, we applied our proposed methods to assess the association between a dietary intake pattern, assessed as adherence scores for the dietary approaches to stop hypertension diet calculated using validated food-frequency questionnaires, and audiometric phenotypes (older-normal, metabolic, sensory, and metabolic plus sensory), determined based on data obtained in the Nurses' Health Study II Conservation of Hearing Study, the Audiology Assessment Arm. Our findings suggested that participants with a more healthful dietary pattern were less likely to develop the metabolic plus sensory phenotype of age-related hearing loss.

老年性听力损失的病因复杂。研究人员努力对相关听力表型进行分类,旨在加强医疗干预和改善听力健康。我们利用现有的老年性听力损失模式分析,通过二次判别分析(QDA)实现了表型分类。我们在此提出一种方法,通过估计方程分析暴露对表型软分类概率的影响。在合理的假设条件下,估计方程是无偏的,并能得到一致的估计值。在模拟研究中,所得到的估计器具有良好的有限样本性能。举例说明,我们应用所提出的方法评估了膳食摄入模式与听力表型(老年正常听力、代谢听力、感官听力和代谢加感官听力)之间的关系,膳食摄入模式是通过有效的食物频率调查问卷计算出的高血压防治膳食方法的依从性得分,而听力表型则是根据护士健康研究 II 听力保护研究听力评估臂中获得的数据确定的。我们的研究结果表明,饮食模式更健康的参与者不太可能出现老年性听力损失的代谢加感觉表型。
{"title":"Soft classification and regression analysis of audiometric phenotypes of age-related hearing loss.","authors":"Ce Yang, Benjamin Langworthy, Sharon Curhan, Kenneth I Vaden, Gary Curhan, Judy R Dubno, Molin Wang","doi":"10.1093/biomtc/ujae013","DOIUrl":"10.1093/biomtc/ujae013","url":null,"abstract":"<p><p>Age-related hearing loss has a complex etiology. Researchers have made efforts to classify relevant audiometric phenotypes, aiming to enhance medical interventions and improve hearing health. We leveraged existing pattern analyses of age-related hearing loss and implemented the phenotype classification via quadratic discriminant analysis (QDA). We herein propose a method for analyzing the exposure effects on the soft classification probabilities of the phenotypes via estimating equations. Under reasonable assumptions, the estimating equations are unbiased and lead to consistent estimators. The resulting estimator had good finite sample performances in simulation studies. As an illustrative example, we applied our proposed methods to assess the association between a dietary intake pattern, assessed as adherence scores for the dietary approaches to stop hypertension diet calculated using validated food-frequency questionnaires, and audiometric phenotypes (older-normal, metabolic, sensory, and metabolic plus sensory), determined based on data obtained in the Nurses' Health Study II Conservation of Hearing Study, the Audiology Assessment Arm. Our findings suggested that participants with a more healthful dietary pattern were less likely to develop the metabolic plus sensory phenotype of age-related hearing loss.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10941322/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140130630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient computation of high-dimensional penalized generalized linear mixed models by latent factor modeling of the random effects. 通过随机效应的潜在因素建模,高效计算高维惩罚性广义线性混合模型。
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2024-01-29 DOI: 10.1093/biomtc/ujae016
Hillary M Heiling, Naim U Rashid, Quefeng Li, Xianlu L Peng, Jen Jen Yeh, Joseph G Ibrahim

Modern biomedical datasets are increasingly high-dimensional and exhibit complex correlation structures. Generalized linear mixed models (GLMMs) have long been employed to account for such dependencies. However, proper specification of the fixed and random effects in GLMMs is increasingly difficult in high dimensions, and computational complexity grows with increasing dimension of the random effects. We present a novel reformulation of the GLMM using a factor model decomposition of the random effects, enabling scalable computation of GLMMs in high dimensions by reducing the latent space from a large number of random effects to a smaller set of latent factors. We also extend our prior work to estimate model parameters using a modified Monte Carlo Expectation Conditional Minimization algorithm, allowing us to perform variable selection on both the fixed and random effects simultaneously. We show through simulation that through this factor model decomposition, our method can fit high-dimensional penalized GLMMs faster than comparable methods and more easily scale to larger dimensions not previously seen in existing approaches.

现代生物医学数据集的维度越来越高,并呈现出复杂的相关结构。长期以来,人们一直采用广义线性混合模型(GLMM)来解释这种相关性。然而,在高维情况下,GLMMs 中固定效应和随机效应的正确规范越来越困难,计算复杂性也随着随机效应维度的增加而增加。我们提出了一种新的 GLMM 重构方法,使用随机效应的因子模型分解,通过将潜空间从大量随机效应减少到较小的潜因子集,实现了高维 GLMM 的可扩展计算。我们还扩展了之前的工作,使用改进的蒙特卡罗期望条件最小化算法来估计模型参数,使我们能够同时对固定效应和随机效应进行变量选择。我们通过仿真表明,通过这种因素模型分解,我们的方法比同类方法更快地拟合出高维的受惩罚 GLMM,而且更容易扩展到现有方法所没有的更大维度。
{"title":"Efficient computation of high-dimensional penalized generalized linear mixed models by latent factor modeling of the random effects.","authors":"Hillary M Heiling, Naim U Rashid, Quefeng Li, Xianlu L Peng, Jen Jen Yeh, Joseph G Ibrahim","doi":"10.1093/biomtc/ujae016","DOIUrl":"10.1093/biomtc/ujae016","url":null,"abstract":"<p><p>Modern biomedical datasets are increasingly high-dimensional and exhibit complex correlation structures. Generalized linear mixed models (GLMMs) have long been employed to account for such dependencies. However, proper specification of the fixed and random effects in GLMMs is increasingly difficult in high dimensions, and computational complexity grows with increasing dimension of the random effects. We present a novel reformulation of the GLMM using a factor model decomposition of the random effects, enabling scalable computation of GLMMs in high dimensions by reducing the latent space from a large number of random effects to a smaller set of latent factors. We also extend our prior work to estimate model parameters using a modified Monte Carlo Expectation Conditional Minimization algorithm, allowing us to perform variable selection on both the fixed and random effects simultaneously. We show through simulation that through this factor model decomposition, our method can fit high-dimensional penalized GLMMs faster than comparable methods and more easily scale to larger dimensions not previously seen in existing approaches.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":null,"pages":null},"PeriodicalIF":1.4,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10946237/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140142743","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biometrics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1