首页 > 最新文献

Biometrics最新文献

英文 中文
A Bayesian convolutional neural network-based generalized linear model. 基于贝叶斯卷积神经网络的广义线性模型。
IF 1.9 4区 数学 Q3 BIOLOGY Pub Date : 2024-03-27 DOI: 10.1093/biomtc/ujae057
Yeseul Jeon, Won Chang, Seonghyun Jeong, Sanghoon Han, Jaewoo Park

Convolutional neural networks (CNNs) provide flexible function approximations for a wide variety of applications when the input variables are in the form of images or spatial data. Although CNNs often outperform traditional statistical models in prediction accuracy, statistical inference, such as estimating the effects of covariates and quantifying the prediction uncertainty, is not trivial due to the highly complicated model structure and overparameterization. To address this challenge, we propose a new Bayesian approach by embedding CNNs within the generalized linear models (GLMs) framework. We use extracted nodes from the last hidden layer of CNN with Monte Carlo (MC) dropout as informative covariates in GLM. This improves accuracy in prediction and regression coefficient inference, allowing for the interpretation of coefficients and uncertainty quantification. By fitting ensemble GLMs across multiple realizations from MC dropout, we can account for uncertainties in extracting the features. We apply our methods to biological and epidemiological problems, which have both high-dimensional correlated inputs and vector covariates. Specifically, we consider malaria incidence data, brain tumor image data, and fMRI data. By extracting information from correlated inputs, the proposed method can provide an interpretable Bayesian analysis. The algorithm can be broadly applicable to image regressions or correlated data analysis by enabling accurate Bayesian inference quickly.

当输入变量为图像或空间数据时,卷积神经网络(CNN)可为各种应用提供灵活的函数近似。虽然卷积神经网络在预测准确性上往往优于传统统计模型,但由于模型结构非常复杂且参数过多,统计推断(如估计协变量的影响和量化预测的不确定性)并非易事。为了应对这一挑战,我们提出了一种新的贝叶斯方法,即在广义线性模型(GLM)框架内嵌入 CNN。我们将从 CNN 最后一个隐藏层提取的节点与蒙特卡罗(MC)剔除作为广义线性模型中的信息协变量。这提高了预测和回归系数推断的准确性,允许对系数进行解释和不确定性量化。通过拟合来自 MC 丢失的多个变现的集合 GLM,我们可以考虑提取特征时的不确定性。我们将我们的方法应用于生物和流行病学问题,这些问题既有高维相关输入,也有向量协变量。具体来说,我们考虑了疟疾发病率数据、脑肿瘤图像数据和 fMRI 数据。通过从相关输入中提取信息,所提出的方法可以提供可解释的贝叶斯分析。通过快速实现准确的贝叶斯推理,该算法可广泛应用于图像回归或相关数据分析。
{"title":"A Bayesian convolutional neural network-based generalized linear model.","authors":"Yeseul Jeon, Won Chang, Seonghyun Jeong, Sanghoon Han, Jaewoo Park","doi":"10.1093/biomtc/ujae057","DOIUrl":"https://doi.org/10.1093/biomtc/ujae057","url":null,"abstract":"<p><p>Convolutional neural networks (CNNs) provide flexible function approximations for a wide variety of applications when the input variables are in the form of images or spatial data. Although CNNs often outperform traditional statistical models in prediction accuracy, statistical inference, such as estimating the effects of covariates and quantifying the prediction uncertainty, is not trivial due to the highly complicated model structure and overparameterization. To address this challenge, we propose a new Bayesian approach by embedding CNNs within the generalized linear models (GLMs) framework. We use extracted nodes from the last hidden layer of CNN with Monte Carlo (MC) dropout as informative covariates in GLM. This improves accuracy in prediction and regression coefficient inference, allowing for the interpretation of coefficients and uncertainty quantification. By fitting ensemble GLMs across multiple realizations from MC dropout, we can account for uncertainties in extracting the features. We apply our methods to biological and epidemiological problems, which have both high-dimensional correlated inputs and vector covariates. Specifically, we consider malaria incidence data, brain tumor image data, and fMRI data. By extracting information from correlated inputs, the proposed method can provide an interpretable Bayesian analysis. The algorithm can be broadly applicable to image regressions or correlated data analysis by enabling accurate Bayesian inference quickly.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141417569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dissecting the colocalized GWAS and eQTLs with mediation analysis for high-dimensional exposures and confounders. 通过对高维暴露和混杂因素进行中介分析,剖析定位的 GWAS 和 eQTL。
IF 1.9 4区 数学 Q3 BIOLOGY Pub Date : 2024-03-27 DOI: 10.1093/biomtc/ujae050
Qi Zhang, Zhikai Yang, Jinliang Yang

To leverage the advancements in genome-wide association studies (GWAS) and quantitative trait loci (QTL) mapping for traits and molecular phenotypes to gain mechanistic understanding of the genetic regulation, biological researchers often investigate the expression QTLs (eQTLs) that colocalize with QTL or GWAS peaks. Our research is inspired by 2 such studies. One aims to identify the causal single nucleotide polymorphisms that are responsible for the phenotypic variation and whose effects can be explained by their impacts at the transcriptomic level in maize. The other study in mouse focuses on uncovering the cis-driver genes that induce phenotypic changes by regulating trans-regulated genes. Both studies can be formulated as mediation problems with potentially high-dimensional exposures, confounders, and mediators that seek to estimate the overall indirect effect (IE) for each exposure. In this paper, we propose MedDiC, a novel procedure to estimate the overall IE based on difference-in-coefficients approach. Our simulation studies find that MedDiC offers valid inference for the IE with higher power, shorter confidence intervals, and faster computing time than competing methods. We apply MedDiC to the 2 aforementioned motivating datasets and find that MedDiC yields reproducible outputs across the analysis of closely related traits, with results supported by external biological evidence. The code and additional information are available on our GitHub page (https://github.com/QiZhangStat/MedDiC).

为了充分利用全基因组关联研究(GWAS)和性状与分子表型的数量性状位点(QTL)图谱的进步,从机理上理解遗传调控,生物研究人员经常调查与 QTL 或 GWAS 峰值共定位的表达 QTL(eQTL)。我们的研究受到了两项此类研究的启发。一项研究的目的是找出导致表型变异的因果单核苷酸多态性,这些单核苷酸多态性在玉米转录组水平的影响可以解释其效应。另一项以小鼠为对象的研究侧重于发现通过调控反式调控基因诱导表型变化的顺式驱动基因。这两项研究都可以表述为具有潜在高维暴露、混杂因素和中介因素的中介问题,旨在估算每种暴露的总体间接效应(IE)。在本文中,我们提出了 MedDiC,这是一种基于差异系数法估算总体 IE 的新程序。我们的模拟研究发现,与其他竞争方法相比,MedDiC 能提供有效的 IE 推断,具有更高的功率、更短的置信区间和更快的计算时间。我们将 MedDiC 应用于上述两个激励性数据集,发现 MedDiC 在分析密切相关的性状时能产生可重复的输出结果,并得到外部生物学证据的支持。代码和其他信息可在我们的 GitHub 页面 (https://github.com/QiZhangStat/MedDiC) 上获取。
{"title":"Dissecting the colocalized GWAS and eQTLs with mediation analysis for high-dimensional exposures and confounders.","authors":"Qi Zhang, Zhikai Yang, Jinliang Yang","doi":"10.1093/biomtc/ujae050","DOIUrl":"https://doi.org/10.1093/biomtc/ujae050","url":null,"abstract":"<p><p>To leverage the advancements in genome-wide association studies (GWAS) and quantitative trait loci (QTL) mapping for traits and molecular phenotypes to gain mechanistic understanding of the genetic regulation, biological researchers often investigate the expression QTLs (eQTLs) that colocalize with QTL or GWAS peaks. Our research is inspired by 2 such studies. One aims to identify the causal single nucleotide polymorphisms that are responsible for the phenotypic variation and whose effects can be explained by their impacts at the transcriptomic level in maize. The other study in mouse focuses on uncovering the cis-driver genes that induce phenotypic changes by regulating trans-regulated genes. Both studies can be formulated as mediation problems with potentially high-dimensional exposures, confounders, and mediators that seek to estimate the overall indirect effect (IE) for each exposure. In this paper, we propose MedDiC, a novel procedure to estimate the overall IE based on difference-in-coefficients approach. Our simulation studies find that MedDiC offers valid inference for the IE with higher power, shorter confidence intervals, and faster computing time than competing methods. We apply MedDiC to the 2 aforementioned motivating datasets and find that MedDiC yields reproducible outputs across the analysis of closely related traits, with results supported by external biological evidence. The code and additional information are available on our GitHub page (https://github.com/QiZhangStat/MedDiC).</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141155115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Case weighted power priors for hybrid control analyses with time-to-event data. 利用时间到事件数据进行混合控制分析的案例加权幂先验。
IF 1.9 4区 数学 Q3 BIOLOGY Pub Date : 2024-03-27 DOI: 10.1093/biomtc/ujae019
Evan Kwiatkowski, Jiawen Zhu, Xiao Li, Herbert Pang, Grazyna Lieberman, Matthew A Psioda

We develop a method for hybrid analyses that uses external controls to augment internal control arms in randomized controlled trials (RCTs) where the degree of borrowing is determined based on similarity between RCT and external control patients to account for systematic differences (e.g., unmeasured confounders). The method represents a novel extension of the power prior where discounting weights are computed separately for each external control based on compatibility with the randomized control data. The discounting weights are determined using the predictive distribution for the external controls derived via the posterior distribution for time-to-event parameters estimated from the RCT. This method is applied using a proportional hazards regression model with piecewise constant baseline hazard. A simulation study and a real-data example are presented based on a completed trial in non-small cell lung cancer. It is shown that the case weighted power prior provides robust inference under various forms of incompatibility between the external controls and RCT population.

我们开发了一种混合分析方法,利用外部对照来增强随机对照试验(RCT)中的内部对照臂,根据 RCT 和外部对照患者的相似性来确定借用程度,以考虑系统性差异(如未测量的混杂因素)。该方法是对功率先验的新扩展,根据与随机对照数据的兼容性,分别计算每个外部对照的贴现权重。贴现权重是利用外部对照的预测分布确定的,预测分布是通过 RCT 估计的时间到事件参数的后验分布得出的。该方法使用的是具有片断恒定基线危害的比例危害回归模型。本文以一项已完成的非小细胞肺癌试验为基础,介绍了一项模拟研究和一个真实数据示例。结果表明,在外部对照和 RCT 群体之间存在各种形式的不相容性的情况下,病例加权功率先验可提供稳健的推断。
{"title":"Case weighted power priors for hybrid control analyses with time-to-event data.","authors":"Evan Kwiatkowski, Jiawen Zhu, Xiao Li, Herbert Pang, Grazyna Lieberman, Matthew A Psioda","doi":"10.1093/biomtc/ujae019","DOIUrl":"10.1093/biomtc/ujae019","url":null,"abstract":"<p><p>We develop a method for hybrid analyses that uses external controls to augment internal control arms in randomized controlled trials (RCTs) where the degree of borrowing is determined based on similarity between RCT and external control patients to account for systematic differences (e.g., unmeasured confounders). The method represents a novel extension of the power prior where discounting weights are computed separately for each external control based on compatibility with the randomized control data. The discounting weights are determined using the predictive distribution for the external controls derived via the posterior distribution for time-to-event parameters estimated from the RCT. This method is applied using a proportional hazards regression model with piecewise constant baseline hazard. A simulation study and a real-data example are presented based on a completed trial in non-small cell lung cancer. It is shown that the case weighted power prior provides robust inference under various forms of incompatibility between the external controls and RCT population.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10968526/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140304678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep partially linear cox model for current status data. 针对现状数据的深度部分线性 Cox 模型。
IF 1.9 4区 数学 Q3 BIOLOGY Pub Date : 2024-03-27 DOI: 10.1093/biomtc/ujae024
Qiang Wu, Xingwei Tong, Xingqiu Zhao

Deep learning has continuously attained huge success in diverse fields, while its application to survival data analysis remains limited and deserves further exploration. For the analysis of current status data, a deep partially linear Cox model is proposed to circumvent the curse of dimensionality. Modeling flexibility is attained by using deep neural networks (DNNs) to accommodate nonlinear covariate effects and monotone splines to approximate the baseline cumulative hazard function. We establish the convergence rate of the proposed maximum likelihood estimators. Moreover, we derive that the finite-dimensional estimator for treatment covariate effects is $sqrt{n}$-consistent, asymptotically normal, and attains semiparametric efficiency. Finally, we demonstrate the performance of our procedures through extensive simulation studies and application to real-world data on news popularity.

深度学习在多个领域不断取得巨大成功,但其在生存数据分析中的应用仍然有限,值得进一步探索。为分析现状数据,我们提出了一种深度部分线性 Cox 模型,以规避维度诅咒。通过使用深度神经网络(DNN)来适应非线性协变量效应,并使用单调样条来近似基线累积危险函数,从而实现建模的灵活性。我们确定了所提出的最大似然估计值的收敛率。此外,我们还推导出治疗协变量效应的有限维估计器是$sqrt{n}$一致的、渐近正态的,并且达到了半参数效率。最后,我们通过大量的模拟研究并应用于现实世界的新闻流行度数据,证明了我们的程序的性能。
{"title":"Deep partially linear cox model for current status data.","authors":"Qiang Wu, Xingwei Tong, Xingqiu Zhao","doi":"10.1093/biomtc/ujae024","DOIUrl":"10.1093/biomtc/ujae024","url":null,"abstract":"<p><p>Deep learning has continuously attained huge success in diverse fields, while its application to survival data analysis remains limited and deserves further exploration. For the analysis of current status data, a deep partially linear Cox model is proposed to circumvent the curse of dimensionality. Modeling flexibility is attained by using deep neural networks (DNNs) to accommodate nonlinear covariate effects and monotone splines to approximate the baseline cumulative hazard function. We establish the convergence rate of the proposed maximum likelihood estimators. Moreover, we derive that the finite-dimensional estimator for treatment covariate effects is $sqrt{n}$-consistent, asymptotically normal, and attains semiparametric efficiency. Finally, we demonstrate the performance of our procedures through extensive simulation studies and application to real-world data on news popularity.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140334555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Addressing age measurement errors in fish growth estimation from length-stratified samples. 利用长度分层样本估算鱼类生长过程中的年龄测量误差。
IF 1.9 4区 数学 Q3 BIOLOGY Pub Date : 2024-03-27 DOI: 10.1093/biomtc/ujae029
Nan Zheng, Atefeh Kheirollahi, Yildiz Yilmaz

Fish growth models are crucial for fisheries stock assessments and are commonly estimated using fish length-at-age data. This data is widely collected using length-stratified age sampling (LSAS), a cost-effective two-phase response-selective sampling method. The data may contain age measurement errors (MEs). We propose a methodology that accounts for both LSAS and age MEs to accurately estimate fish growth. The proposed methods use empirical proportion likelihood methodology for LSAS and the structural errors in variables methodology for age MEs. We provide a measure of uncertainty for parameter estimates and standardized residuals for model validation. To model the age distribution, we employ a continuation ratio-logit model that is consistent with the random nature of the true age distribution. We also apply a discretization approach for age and length distributions, which significantly improves computational efficiency and is consistent with the discrete age and length data typically encountered in practice. Our simulation study shows that neglecting age MEs can lead to significant bias in growth estimation, even with small but non-negligible age MEs. However, our new approach performs well regardless of the magnitude of age MEs and accurately estimates SEs of parameter estimators. Real data analysis demonstrates the effectiveness of the proposed model validation device. Computer codes to implement the methodology are provided.

鱼类生长模型对渔业资源评估至关重要,通常使用鱼类的年龄长度数据进行估算。这些数据广泛采用长度分层年龄取样法(LSAS)收集,这是一种具有成本效益的两阶段反应选择取样法。这些数据可能包含年龄测量误差(ME)。我们提出了一种既考虑 LSAS 又考虑年龄测量误差的方法,以准确估计鱼类的生长情况。建议的方法对 LSAS 采用经验比例似然法,对年龄 ME 采用变量结构误差法。我们为参数估计提供了不确定性度量,并为模型验证提供了标准化残差。为了建立年龄分布模型,我们采用了与真实年龄分布的随机性相一致的延续比对数模型。我们还对年龄和身长分布采用了离散化方法,这大大提高了计算效率,并与实践中通常遇到的离散年龄和身长数据相一致。我们的模拟研究表明,忽略年龄 ME 会导致生长估计出现明显偏差,即使年龄 ME 较小但不可忽略。然而,无论年龄中位数的大小如何,我们的新方法都能表现出色,并能准确估计参数估计值的 SE。实际数据分析证明了所提出的模型验证方法的有效性。本文还提供了实现该方法的计算机代码。
{"title":"Addressing age measurement errors in fish growth estimation from length-stratified samples.","authors":"Nan Zheng, Atefeh Kheirollahi, Yildiz Yilmaz","doi":"10.1093/biomtc/ujae029","DOIUrl":"https://doi.org/10.1093/biomtc/ujae029","url":null,"abstract":"<p><p>Fish growth models are crucial for fisheries stock assessments and are commonly estimated using fish length-at-age data. This data is widely collected using length-stratified age sampling (LSAS), a cost-effective two-phase response-selective sampling method. The data may contain age measurement errors (MEs). We propose a methodology that accounts for both LSAS and age MEs to accurately estimate fish growth. The proposed methods use empirical proportion likelihood methodology for LSAS and the structural errors in variables methodology for age MEs. We provide a measure of uncertainty for parameter estimates and standardized residuals for model validation. To model the age distribution, we employ a continuation ratio-logit model that is consistent with the random nature of the true age distribution. We also apply a discretization approach for age and length distributions, which significantly improves computational efficiency and is consistent with the discrete age and length data typically encountered in practice. Our simulation study shows that neglecting age MEs can lead to significant bias in growth estimation, even with small but non-negligible age MEs. However, our new approach performs well regardless of the magnitude of age MEs and accurately estimates SEs of parameter estimators. Real data analysis demonstrates the effectiveness of the proposed model validation device. Computer codes to implement the methodology are provided.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140849395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Doubly robust estimation and sensitivity analysis for marginal structural quantile models. 边际结构量化模型的双稳健估计和敏感性分析。
IF 1.9 4区 数学 Q3 BIOLOGY Pub Date : 2024-03-27 DOI: 10.1093/biomtc/ujae045
Chao Cheng, Liangyuan Hu, Fan Li

The marginal structure quantile model (MSQM) provides a unique lens to understand the causal effect of a time-varying treatment on the full distribution of potential outcomes. Under the semiparametric framework, we derive the efficiency influence function for the MSQM, from which a new doubly robust estimator is proposed for point estimation and inference. We show that the doubly robust estimator is consistent if either of the models associated with treatment assignment or the potential outcome distributions is correctly specified, and is semiparametric efficient if both models are correct. To implement the doubly robust MSQM estimator, we propose to solve a smoothed estimating equation to facilitate efficient computation of the point and variance estimates. In addition, we develop a confounding function approach to investigate the sensitivity of several MSQM estimators when the sequential ignorability assumption is violated. Extensive simulations are conducted to examine the finite-sample performance characteristics of the proposed methods. We apply the proposed methods to the Yale New Haven Health System Electronic Health Record data to study the effect of antihypertensive medications to patients with severe hypertension and assess the robustness of the findings to unmeasured baseline and time-varying confounding.

边际结构量子模型(MSQM)提供了一个独特的视角来理解时变处理对潜在结果的全部分布的因果效应。在半参数框架下,我们推导出了 MSQM 的效率影响函数,并据此提出了一种新的双重稳健估计器,用于点估计和推断。我们证明,如果与治疗分配或潜在结果分布相关的模型中的任何一个模型指定正确,则双重稳健估计器是一致的;如果两个模型都正确,则双重稳健估计器是半参数有效的。为了实现双重稳健 MSQM 估计器,我们建议求解一个平滑估计方程,以便高效计算点估计值和方差估计值。此外,我们还开发了一种混杂函数方法,用于研究违反顺序无知假设时多个 MSQM 估计器的敏感性。我们进行了大量模拟,以检验所提出方法的有限样本性能特征。我们将所提出的方法应用于耶鲁大学纽黑文健康系统电子健康记录数据,研究抗高血压药物对严重高血压患者的影响,并评估研究结果对未测量基线和时变混杂因素的稳健性。
{"title":"Doubly robust estimation and sensitivity analysis for marginal structural quantile models.","authors":"Chao Cheng, Liangyuan Hu, Fan Li","doi":"10.1093/biomtc/ujae045","DOIUrl":"https://doi.org/10.1093/biomtc/ujae045","url":null,"abstract":"<p><p>The marginal structure quantile model (MSQM) provides a unique lens to understand the causal effect of a time-varying treatment on the full distribution of potential outcomes. Under the semiparametric framework, we derive the efficiency influence function for the MSQM, from which a new doubly robust estimator is proposed for point estimation and inference. We show that the doubly robust estimator is consistent if either of the models associated with treatment assignment or the potential outcome distributions is correctly specified, and is semiparametric efficient if both models are correct. To implement the doubly robust MSQM estimator, we propose to solve a smoothed estimating equation to facilitate efficient computation of the point and variance estimates. In addition, we develop a confounding function approach to investigate the sensitivity of several MSQM estimators when the sequential ignorability assumption is violated. Extensive simulations are conducted to examine the finite-sample performance characteristics of the proposed methods. We apply the proposed methods to the Yale New Haven Health System Electronic Health Record data to study the effect of antihypertensive medications to patients with severe hypertension and assess the robustness of the findings to unmeasured baseline and time-varying confounding.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141330357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-dimensional multisubject time series transition matrix inference with application to brain connectivity analysis. 高维多受试者时间序列转换矩阵推理在大脑连接性分析中的应用。
IF 1.9 4区 数学 Q3 BIOLOGY Pub Date : 2024-03-27 DOI: 10.1093/biomtc/ujae021
Xiang Lyu, Jian Kang, Lexin Li

Brain-effective connectivity analysis quantifies directed influence of one neural element or region over another, and it is of great scientific interest to understand how effective connectivity pattern is affected by variations of subject conditions. Vector autoregression (VAR) is a useful tool for this type of problems. However, there is a paucity of solutions when there is measurement error, when there are multiple subjects, and when the focus is the inference of the transition matrix. In this article, we study the problem of transition matrix inference under the high-dimensional VAR model with measurement error and multiple subjects. We propose a simultaneous testing procedure, with three key components: a modified expectation-maximization (EM) algorithm, a test statistic based on the tensor regression of a bias-corrected estimator of the lagged auto-covariance given the covariates, and a properly thresholded simultaneous test. We establish the uniform consistency for the estimators of our modified EM, and show that the subsequent test achieves both a consistent false discovery control, and its power approaches one asymptotically. We demonstrate the efficacy of our method through both simulations and a brain connectivity study of task-evoked functional magnetic resonance imaging.

大脑有效连接分析量化了一个神经元素或区域对另一个神经元素或区域的定向影响,了解有效连接模式如何受主体条件变化的影响具有重大的科学意义。向量自回归(VAR)是解决这类问题的有效工具。然而,当存在测量误差、有多个受试者以及重点是推断过渡矩阵时,解决方案却非常匮乏。本文研究了具有测量误差和多主体的高维 VAR 模型下的转换矩阵推断问题。我们提出了一种同步检验程序,包括三个关键部分:改进的期望最大化(EM)算法、基于给定协变量的滞后自协方差偏差校正估计器的张量回归的检验统计量,以及适当阈值化的同步检验。我们建立了修正 EM 估计数的统一一致性,并证明随后的检验既实现了一致的误发现控制,其功率也渐近于 1。我们通过模拟和任务诱发功能磁共振成像的大脑连接研究证明了我们方法的有效性。
{"title":"High-dimensional multisubject time series transition matrix inference with application to brain connectivity analysis.","authors":"Xiang Lyu, Jian Kang, Lexin Li","doi":"10.1093/biomtc/ujae021","DOIUrl":"10.1093/biomtc/ujae021","url":null,"abstract":"<p><p>Brain-effective connectivity analysis quantifies directed influence of one neural element or region over another, and it is of great scientific interest to understand how effective connectivity pattern is affected by variations of subject conditions. Vector autoregression (VAR) is a useful tool for this type of problems. However, there is a paucity of solutions when there is measurement error, when there are multiple subjects, and when the focus is the inference of the transition matrix. In this article, we study the problem of transition matrix inference under the high-dimensional VAR model with measurement error and multiple subjects. We propose a simultaneous testing procedure, with three key components: a modified expectation-maximization (EM) algorithm, a test statistic based on the tensor regression of a bias-corrected estimator of the lagged auto-covariance given the covariates, and a properly thresholded simultaneous test. We establish the uniform consistency for the estimators of our modified EM, and show that the subsequent test achieves both a consistent false discovery control, and its power approaches one asymptotically. We demonstrate the efficacy of our method through both simulations and a brain connectivity study of task-evoked functional magnetic resonance imaging.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10988359/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140850327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Causal inference for time-to-event data with a cured subpopulation. 具有固化子群的时间到事件数据的因果推理。
IF 1.9 4区 数学 Q3 BIOLOGY Pub Date : 2024-03-27 DOI: 10.1093/biomtc/ujae028
Yi Wang, Yuhao Deng, Xiao-Hua Zhou

When studying the treatment effect on time-to-event outcomes, it is common that some individuals never experience failure events, which suggests that they have been cured. However, the cure status may not be observed due to censoring which makes it challenging to define treatment effects. Current methods mainly focus on estimating model parameters in various cure models, ultimately leading to a lack of causal interpretations. To address this issue, we propose 2 causal estimands, the timewise risk difference and mean survival time difference, in the always-uncured based on principal stratification as a complement to the treatment effect on cure rates. These estimands allow us to study the treatment effects on failure times in the always-uncured subpopulation. We show the identifiability using a substitutional variable for the potential cure status under ignorable treatment assignment mechanism, these 2 estimands are identifiable. We also provide estimation methods using mixture cure models. We applied our approach to an observational study that compared the leukemia-free survival rates of different transplantation types to cure acute lymphoblastic leukemia. Our proposed approach yielded insightful results that can be used to inform future treatment decisions.

在研究治疗对时间到事件结果的影响时,常见的情况是有些人从未发生过失败事件,这表明他们已经治愈。然而,由于普查的原因,可能无法观察到治愈状态,这就给确定治疗效果带来了挑战。目前的方法主要侧重于估计各种治愈模型的模型参数,最终导致缺乏因果解释。为解决这一问题,我们提出了基于主分层的始终未治愈者的两个因果估计值,即时间风险差异和平均生存时间差异,作为治疗效果对治愈率的补充。通过这些估计值,我们可以研究治疗对始终未治愈亚群中失败时间的影响。我们展示了在可忽略的治疗分配机制下,使用潜在治愈状态的替代变量的可识别性,这两个估计值是可识别的。我们还提供了使用混合治愈模型的估计方法。我们将我们的方法应用于一项观察性研究,该研究比较了不同移植类型治愈急性淋巴细胞白血病的无白血病生存率。我们提出的方法得出了具有洞察力的结果,可为未来的治疗决策提供依据。
{"title":"Causal inference for time-to-event data with a cured subpopulation.","authors":"Yi Wang, Yuhao Deng, Xiao-Hua Zhou","doi":"10.1093/biomtc/ujae028","DOIUrl":"https://doi.org/10.1093/biomtc/ujae028","url":null,"abstract":"<p><p>When studying the treatment effect on time-to-event outcomes, it is common that some individuals never experience failure events, which suggests that they have been cured. However, the cure status may not be observed due to censoring which makes it challenging to define treatment effects. Current methods mainly focus on estimating model parameters in various cure models, ultimately leading to a lack of causal interpretations. To address this issue, we propose 2 causal estimands, the timewise risk difference and mean survival time difference, in the always-uncured based on principal stratification as a complement to the treatment effect on cure rates. These estimands allow us to study the treatment effects on failure times in the always-uncured subpopulation. We show the identifiability using a substitutional variable for the potential cure status under ignorable treatment assignment mechanism, these 2 estimands are identifiable. We also provide estimation methods using mixture cure models. We applied our approach to an observational study that compared the leukemia-free survival rates of different transplantation types to cure acute lymphoblastic leukemia. Our proposed approach yielded insightful results that can be used to inform future treatment decisions.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140856016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Regression models for average hazard. 平均危害回归模型
IF 1.4 4区 数学 Q3 BIOLOGY Pub Date : 2024-03-27 DOI: 10.1093/biomtc/ujae037
Hajime Uno, Lu Tian, Miki Horiguchi, Satoshi Hattori, Kenneth L Kehl

Limitations of using the traditional Cox's hazard ratio for summarizing the magnitude of the treatment effect on time-to-event outcomes have been widely discussed, and alternative measures that do not have such limitations are gaining attention. One of the alternative methods recently proposed, in a simple 2-sample comparison setting, uses the average hazard with survival weight (AH), which can be interpreted as the general censoring-free person-time incidence rate on a given time window. In this paper, we propose a new regression analysis approach for the AH with a truncation time τ. We investigate 3 versions of AH regression analysis, assuming (1) independent censoring, (2) group-specific censoring, and (3) covariate-dependent censoring. The proposed AH regression methods are closely related to robust Poisson regression. While the new approach needs to require a truncation time τ explicitly, it can be more robust than Poisson regression in the presence of censoring. With the AH regression approach, one can summarize the between-group treatment difference in both absolute difference and relative terms, adjusting for covariates that are associated with the outcome. This property will increase the likelihood that the treatment effect magnitude is correctly interpreted. The AH regression approach can be a useful alternative to the traditional Cox's hazard ratio approach for estimating and reporting the magnitude of the treatment effect on time-to-event outcomes.

使用传统的 Cox 危险比来概括治疗对时间到事件结果的影响程度的局限性已被广泛讨论,而没有这些局限性的替代测量方法正受到越来越多的关注。最近提出的一种替代方法是,在简单的双样本比较设置中,使用带生存权重的平均危险度(AH),它可以解释为给定时间窗上的一般无删减人时发病率。本文提出了一种新的截断时间为 τ 的 AH 回归分析方法。我们研究了 3 个版本的 AH 回归分析,分别假定:(1)独立普查;(2)特定组普查;(3)依赖于协变量的普查。所提出的 AH 回归方法与稳健泊松回归密切相关。虽然新方法需要明确要求截断时间 τ,但在存在剔除的情况下,它比泊松回归更稳健。采用 AH 回归方法,我们可以用绝对差异和相对差异来概括组间治疗差异,并对与结果相关的协变量进行调整。这一特性将增加正确解释治疗效果大小的可能性。在估计和报告治疗对时间到事件结果的影响程度时,AH 回归方法可以替代传统的 Cox 危险比方法。
{"title":"Regression models for average hazard.","authors":"Hajime Uno, Lu Tian, Miki Horiguchi, Satoshi Hattori, Kenneth L Kehl","doi":"10.1093/biomtc/ujae037","DOIUrl":"10.1093/biomtc/ujae037","url":null,"abstract":"<p><p>Limitations of using the traditional Cox's hazard ratio for summarizing the magnitude of the treatment effect on time-to-event outcomes have been widely discussed, and alternative measures that do not have such limitations are gaining attention. One of the alternative methods recently proposed, in a simple 2-sample comparison setting, uses the average hazard with survival weight (AH), which can be interpreted as the general censoring-free person-time incidence rate on a given time window. In this paper, we propose a new regression analysis approach for the AH with a truncation time τ. We investigate 3 versions of AH regression analysis, assuming (1) independent censoring, (2) group-specific censoring, and (3) covariate-dependent censoring. The proposed AH regression methods are closely related to robust Poisson regression. While the new approach needs to require a truncation time τ explicitly, it can be more robust than Poisson regression in the presence of censoring. With the AH regression approach, one can summarize the between-group treatment difference in both absolute difference and relative terms, adjusting for covariates that are associated with the outcome. This property will increase the likelihood that the treatment effect magnitude is correctly interpreted. The AH regression approach can be a useful alternative to the traditional Cox's hazard ratio approach for estimating and reporting the magnitude of the treatment effect on time-to-event outcomes.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 2","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11107592/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141074620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discussion on "Bayesian meta-analysis of penetrance for cancer risk" by Thanthirige Lakshika M. Ruberu, Danielle Braun, Giovanni Parmigiani, and Swati Biswas. Thanthirige Lakshika M. Ruberu、Danielle Braun、Giovanni Parmigiani 和 Swati Biswas 关于 "癌症风险渗透的贝叶斯元分析 "的讨论。
IF 1.9 4区 数学 Q3 BIOLOGY Pub Date : 2024-03-27 DOI: 10.1093/biomtc/ujae041
Gianluca Baio
{"title":"Discussion on \"Bayesian meta-analysis of penetrance for cancer risk\" by Thanthirige Lakshika M. Ruberu, Danielle Braun, Giovanni Parmigiani, and Swati Biswas.","authors":"Gianluca Baio","doi":"10.1093/biomtc/ujae041","DOIUrl":"https://doi.org/10.1093/biomtc/ujae041","url":null,"abstract":"","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"80 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141178720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biometrics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1