首页 > 最新文献

Journal of the Royal Statistical Society Series C-Applied Statistics最新文献

英文 中文
Precision Mental Health: Predicting Heterogeneous Treatment Effects for Depression through Data Integration. 精准心理健康:通过数据整合预测抑郁症的异质性治疗效果。
IF 1.3 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2025-12-12 DOI: 10.1093/jrsssc/qlaf068
Carly L Brantner, Trang Quynh Nguyen, Harsh Parikh, Congwen Zhao, Hwanhee Hong, Elizabeth A Stuart

When treating depression, clinicians are interested in determining the optimal treatment for a given patient, which is challenging given the amount of treatments available. To advance individualized treatment allocation, integrating data across multiple randomized controlled trials (RCTs) can enhance our understanding of treatment effect heterogeneity by increasing available information. However, extending these inferences to individuals outside of the original RCTs remains crucial for clinical decision-making. We introduce a two-stage meta-analytic method that predicts conditional average treatment effects (CATEs) in target patient populations by leveraging the distribution of CATEs across RCTs. Our approach generates 95% prediction intervals for CATEs in target settings using first-stage models that can incorporate parametric regression or non-parametric methods such as causal forests or Bayesian additive regression trees (BART). We validate our method through simulation studies and operationalize it to integrate multiple RCTs comparing depression treatments, duloxetine and vortioxetine, to generate prediction intervals for target patient profiles. Our analysis reveals no strong evidence of effect heterogeneity across trials, with the exception of potential age-related variability. Importantly, we show that CATE prediction intervals capture broader uncertainty than study-specific confidence intervals when warranted, reflecting both within-study and between-study variability.

在治疗抑郁症时,临床医生感兴趣的是确定针对特定患者的最佳治疗方法,考虑到现有的治疗方法数量,这是具有挑战性的。为了推进个体化治疗分配,整合多个随机对照试验(RCTs)的数据可以通过增加可用信息来增强我们对治疗效果异质性的理解。然而,将这些推论扩展到原始随机对照试验之外的个体,对于临床决策仍然至关重要。我们引入了一种两阶段荟萃分析方法,通过利用随机对照试验中条件平均治疗效果(CATEs)的分布来预测目标患者群体的条件平均治疗效果(CATEs)。我们的方法使用第一阶段模型在目标设置中生成95%的CATEs预测区间,该模型可以结合参数回归或非参数方法,如因果森林或贝叶斯加性回归树(BART)。我们通过模拟研究验证了我们的方法,并将其应用于多个比较抑郁症治疗、度洛西汀和沃替西汀的随机对照试验,以生成目标患者概况的预测区间。我们的分析显示,除了潜在的年龄相关变异性外,没有强有力的证据表明不同试验的效果存在异质性。重要的是,我们表明,在有保证的情况下,CATE预测区间比研究特定置信区间捕捉到更大的不确定性,反映了研究内和研究间的可变性。
{"title":"Precision Mental Health: Predicting Heterogeneous Treatment Effects for Depression through Data Integration.","authors":"Carly L Brantner, Trang Quynh Nguyen, Harsh Parikh, Congwen Zhao, Hwanhee Hong, Elizabeth A Stuart","doi":"10.1093/jrsssc/qlaf068","DOIUrl":"10.1093/jrsssc/qlaf068","url":null,"abstract":"<p><p>When treating depression, clinicians are interested in determining the optimal treatment for a given patient, which is challenging given the amount of treatments available. To advance individualized treatment allocation, integrating data across multiple randomized controlled trials (RCTs) can enhance our understanding of treatment effect heterogeneity by increasing available information. However, extending these inferences to individuals outside of the original RCTs remains crucial for clinical decision-making. We introduce a two-stage meta-analytic method that predicts conditional average treatment effects (CATEs) in target patient populations by leveraging the distribution of CATEs across RCTs. Our approach generates 95% prediction intervals for CATEs in target settings using first-stage models that can incorporate parametric regression or non-parametric methods such as causal forests or Bayesian additive regression trees (BART). We validate our method through simulation studies and operationalize it to integrate multiple RCTs comparing depression treatments, duloxetine and vortioxetine, to generate prediction intervals for target patient profiles. Our analysis reveals no strong evidence of effect heterogeneity across trials, with the exception of potential age-related variability. Importantly, we show that CATE prediction intervals capture broader uncertainty than study-specific confidence intervals when warranted, reflecting both within-study and between-study variability.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":" ","pages":""},"PeriodicalIF":1.3,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12768387/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145913599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A patient similarity-embedded Bayesian approach to prognostic biomarker inference with application to thoracic cancer immunity. 患者相似性嵌入贝叶斯方法在预后生物标志物推断中的应用与胸部癌症免疫。
IF 1.3 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2025-06-01 Epub Date: 2025-01-23 DOI: 10.1093/jrsssc/qlaf001
Duo Yu, Meilin Huang, Michael J Kane, Brian P Hobbs

This paper introduces a novel statistical methodology integrating machine learning (ML) and Bayesian modelling to facilitate personalized prognostic predictions with application to oncology. Utilizing power priors, we construct 'patient-similarity embeddings' that identify localized patterns of prognosis. The methodology is applied to study the prognostic value of markers of anticancer immunity within the tumour microenvironment of nonsmall cell lung cancer while adjusting for established clinical characteristics. The method outperforms traditional regression and ML models, while accurately identifying subgroup patterns, thereby enhancing statistical inference and hypothesis testing.

本文介绍了一种集成机器学习(ML)和贝叶斯模型的新型统计方法,以促进个性化预后预测并应用于肿瘤学。利用权力先验,我们构建了“患者相似嵌入”来识别局部预后模式。该方法用于研究非小细胞肺癌肿瘤微环境中抗癌免疫标志物的预后价值,同时根据已建立的临床特征进行调整。该方法优于传统的回归模型和ML模型,同时准确地识别子群模式,从而增强了统计推断和假设检验。
{"title":"A patient similarity-embedded Bayesian approach to prognostic biomarker inference with application to thoracic cancer immunity.","authors":"Duo Yu, Meilin Huang, Michael J Kane, Brian P Hobbs","doi":"10.1093/jrsssc/qlaf001","DOIUrl":"10.1093/jrsssc/qlaf001","url":null,"abstract":"<p><p>This paper introduces a novel statistical methodology integrating machine learning (ML) and Bayesian modelling to facilitate personalized prognostic predictions with application to oncology. Utilizing power priors, we construct 'patient-similarity embeddings' that identify localized patterns of prognosis. The methodology is applied to study the prognostic value of markers of anticancer immunity within the tumour microenvironment of nonsmall cell lung cancer while adjusting for established clinical characteristics. The method outperforms traditional regression and ML models, while accurately identifying subgroup patterns, thereby enhancing statistical inference and hypothesis testing.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"74 3","pages":"800-823"},"PeriodicalIF":1.3,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12449836/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145115019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correcting for bias due to mismeasured exposure in mediation analysis with a survival outcome. 校正因存在生存结果的中介分析中暴露量测量错误造成的偏倚。
IF 1.3 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2025-02-14 eCollection Date: 2025-11-01 DOI: 10.1093/jrsssc/qlaf010
Chao Cheng, Donna Spiegelman, Fan Li

We investigate the impact of exposure measurement error on assessing mediation with a survival outcome modelled by Cox regression. We first derive the bias formulas of natural indirect and direct effects with a rare outcome and no exposure-mediator interaction. We then develop several calibration approaches to correct for the measurement error-induced bias, and generalize our methods to accommodate a common outcome and an exposure-mediator interaction. We apply the proposed methods to analyse the Health Professionals Follow-up Study (1986-2016) and evaluate the extent to which reduced body mass index mediates the protective effect of physical activity on risk of cardiovascular diseases.

我们通过Cox回归模型研究了暴露测量误差对评估中介作用的影响。我们首先推导了自然间接效应和直接效应的偏差公式,这些影响具有罕见的结果,没有暴露-中介相互作用。然后,我们开发了几种校准方法来纠正测量误差引起的偏差,并推广了我们的方法,以适应共同的结果和暴露-中介相互作用。我们应用提出的方法分析了卫生专业人员随访研究(1986-2016),并评估了身体质量指数降低在多大程度上介导了身体活动对心血管疾病风险的保护作用。
{"title":"Correcting for bias due to mismeasured exposure in mediation analysis with a survival outcome.","authors":"Chao Cheng, Donna Spiegelman, Fan Li","doi":"10.1093/jrsssc/qlaf010","DOIUrl":"10.1093/jrsssc/qlaf010","url":null,"abstract":"<p><p>We investigate the impact of exposure measurement error on assessing mediation with a survival outcome modelled by Cox regression. We first derive the bias formulas of natural indirect and direct effects with a rare outcome and no exposure-mediator interaction. We then develop several calibration approaches to correct for the measurement error-induced bias, and generalize our methods to accommodate a common outcome and an exposure-mediator interaction. We apply the proposed methods to analyse the Health Professionals Follow-up Study (1986-2016) and evaluate the extent to which reduced body mass index mediates the protective effect of physical activity on risk of cardiovascular diseases.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"74 4","pages":"969-993"},"PeriodicalIF":1.3,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12364547/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144976833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Synergistic self-learning approach to establishing individualized treatment rules from multiple benefit outcomes in a calcium supplementation trial. 从补钙试验的多重获益结果建立个体化治疗规则的协同自学习方法。
IF 1.3 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2025-02-14 eCollection Date: 2025-11-01 DOI: 10.1093/jrsssc/qlaf008
Yiwang Zhou, Peter X K Song

In utero lead exposure poses risks to children's neurobehavioral development. The Early Life Exposure in Mexico to ENvironmental Toxicants' calcium supplementation trial studies the effect of calcium supplement in reducing maternal lead exposure to infants during pregnancy. An individualized treatment rule (ITR) is needed to guide pregnant women on taking calcium supplement. This article introduces a statistical learning method, synergistic self-learning (SS-learning), to tackle two challenges in deriving ITR with multiple outcomes, including heterogeneous multidimensional outcomes and complex missing data patterns. Applying SS-learning to the trial, important covariates were identified to form an ITR, expected to lead to higher lead reduction if implemented across the study population.

在子宫内接触铅会对儿童的神经行为发育造成风险。墨西哥环境毒物的早期生命暴露钙补充试验研究了钙补充在减少怀孕期间母亲对婴儿的铅暴露方面的影响。需要制定个体化治疗规则(ITR)来指导孕妇服用钙补充剂。本文介绍了一种统计学习方法,即协同自学习(SS-learning),以解决在推导具有多种结果的ITR时面临的两个挑战,包括异构多维结果和复杂的缺失数据模式。将ss学习应用于试验,确定了重要的协变量以形成ITR,如果在整个研究人群中实施,预计将导致更高的铅减少。
{"title":"Synergistic self-learning approach to establishing individualized treatment rules from multiple benefit outcomes in a calcium supplementation trial.","authors":"Yiwang Zhou, Peter X K Song","doi":"10.1093/jrsssc/qlaf008","DOIUrl":"10.1093/jrsssc/qlaf008","url":null,"abstract":"<p><p>In utero lead exposure poses risks to children's neurobehavioral development. The Early Life Exposure in Mexico to ENvironmental Toxicants' calcium supplementation trial studies the effect of calcium supplement in reducing maternal lead exposure to infants during pregnancy. An individualized treatment rule (ITR) is needed to guide pregnant women on taking calcium supplement. This article introduces a statistical learning method, synergistic self-learning (SS-learning), to tackle two challenges in deriving ITR with multiple outcomes, including heterogeneous multidimensional outcomes and complex missing data patterns. Applying SS-learning to the trial, important covariates were identified to form an ITR, expected to lead to higher lead reduction if implemented across the study population.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"74 4","pages":"925-945"},"PeriodicalIF":1.3,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12364546/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144976799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Building absolute breast cancer risk prediction models for women treated with chest radiation for Hodgkin lymphoma. 建立霍奇金淋巴瘤胸部放射治疗女性乳腺癌绝对风险预测模型。
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2025-01-03 eCollection Date: 2025-03-01 DOI: 10.1093/jrsssc/qlae063
Sander Roberti, Flora E van Leeuwen, Michael Hauptmann, Ruth M Pfeiffer

We built models to predict absolute breast cancer (BC) risk in women treated with radiotherapy for Hodgkin lymphoma (HL). We first estimated relative risks (RRs) for risk factors, including radiation dose to 10 breast segments to accommodate heterogeneity of treatment effects, using a case-control sample nested in an HL survivor cohort. To estimate RRs of case-control matching factors we developed novel weighting approaches. We then combined RRs with age-specific BC incidence and competing mortality rates from the HL survivor cohort and a population-based registry, accommodating differences between them. We compared the performance of models using segment-specific doses with using mean dose only.

我们建立了模型来预测接受放射治疗的霍奇金淋巴瘤(HL)女性患乳腺癌(BC)的绝对风险。我们首先估算了风险因素的相对风险(rr),包括对10个乳腺节段的辐射剂量,以适应治疗效果的异质性,使用了一个嵌套在HL幸存者队列中的病例对照样本。为了估计病例对照匹配因素的rr,我们开发了新的加权方法。然后,我们将rr与来自HL幸存者队列和基于人群的登记的年龄特异性BC发病率和竞争死亡率相结合,以适应它们之间的差异。我们比较了使用分段特定剂量和仅使用平均剂量的模型的性能。
{"title":"Building absolute breast cancer risk prediction models for women treated with chest radiation for Hodgkin lymphoma.","authors":"Sander Roberti, Flora E van Leeuwen, Michael Hauptmann, Ruth M Pfeiffer","doi":"10.1093/jrsssc/qlae063","DOIUrl":"https://doi.org/10.1093/jrsssc/qlae063","url":null,"abstract":"<p><p>We built models to predict absolute breast cancer (BC) risk in women treated with radiotherapy for Hodgkin lymphoma (HL). We first estimated relative risks (RRs) for risk factors, including radiation dose to 10 breast segments to accommodate heterogeneity of treatment effects, using a case-control sample nested in an HL survivor cohort. To estimate RRs of case-control matching factors we developed novel weighting approaches. We then combined RRs with age-specific BC incidence and competing mortality rates from the HL survivor cohort and a population-based registry, accommodating differences between them. We compared the performance of models using segment-specific doses with using mean dose only.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"74 2","pages":"466-490"},"PeriodicalIF":1.0,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11905882/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143651706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inferring bivariate associations with continuous data from studies using respondent-driven sampling. 利用调查对象驱动的抽样研究的连续数据推断双变量关联。
IF 1.3 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-11-26 eCollection Date: 2025-03-01 DOI: 10.1093/jrsssc/qlae061
Samantha Malatesta, Karen R Jacobson, Tara Carney, Eric D Kolaczyk, Krista J Gile, Laura F White

Respondent-driven sampling (RDS) is a link-tracing sampling design that was developed to sample from hidden populations. Although associations between variables are of great interest in epidemiological research, there has been little statistical work on inference on relationships between variables collected through RDS. The link-tracing design, combined with homophily, the tendency for people to connect to others with whom they share characteristics, induces similarity between linked individuals. This dependence inflates the Type 1 error of conventional statistical methods (e.g. t-tests, regression, etc.). A semiparametric randomization test for bivariate association was developed to test for association between two categorical variables. We directly extend this work and propose a semiparametric randomization test for relationships between two variables, when one or both are continuous. We apply our method to variables that are important for understanding tuberculosis epidemiology among people who smoke illicit drugs in Worcester, South Africa.

被调查者驱动抽样(RDS)是一种链接追踪抽样设计,旨在从隐藏人群中进行抽样。尽管流行病学研究对变量之间的关联非常感兴趣,但通过RDS收集的变量之间的关系进行推断的统计工作很少。链接追踪设计与同质性相结合,即人们倾向于与有共同特征的人建立联系,从而在有联系的个体之间产生相似性。这种依赖性放大了传统统计方法(如t检验、回归等)的第一类误差。建立了双变量关联的半参数随机化检验来检验两个分类变量之间的关联。我们直接扩展了这项工作,并提出了两个变量之间关系的半参数随机化检验,当一个或两个变量是连续的。我们将我们的方法应用于变量,这些变量对于了解南非伍斯特的非法吸毒者之间的结核病流行病学很重要。
{"title":"Inferring bivariate associations with continuous data from studies using respondent-driven sampling.","authors":"Samantha Malatesta, Karen R Jacobson, Tara Carney, Eric D Kolaczyk, Krista J Gile, Laura F White","doi":"10.1093/jrsssc/qlae061","DOIUrl":"10.1093/jrsssc/qlae061","url":null,"abstract":"<p><p>Respondent-driven sampling (RDS) is a link-tracing sampling design that was developed to sample from hidden populations. Although associations between variables are of great interest in epidemiological research, there has been little statistical work on inference on relationships between variables collected through RDS. The link-tracing design, combined with homophily, the tendency for people to connect to others with whom they share characteristics, induces similarity between linked individuals. This dependence inflates the Type 1 error of conventional statistical methods (e.g. <i>t</i>-tests, regression, etc.). A semiparametric randomization test for bivariate association was developed to test for association between two categorical variables. We directly extend this work and propose a semiparametric randomization test for relationships between two variables, when one or both are continuous. We apply our method to variables that are important for understanding tuberculosis epidemiology among people who smoke illicit drugs in Worcester, South Africa.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"74 2","pages":"429-446"},"PeriodicalIF":1.3,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11905883/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143651715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multivariate Bayesian variable selection for multi-trait genetic fine mapping. 多性状遗传精细定位的多元贝叶斯变量选择。
IF 1.3 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-10-28 eCollection Date: 2025-03-01 DOI: 10.1093/jrsssc/qlae055
Travis Canida, Hongjie Ke, Shuo Chen, Zhenyao Ye, Tianzhou Ma

Genome-wide association studies (GWAS) have identified thousands of single-nucleotide polymorphisms (SNPs) associated with complex traits, but determining the underlying causal variants remains challenging. Fine mapping aims to pinpoint the potentially causal variants from a large number of correlated SNPs possibly with group structure in GWAS-enriched genomic regions using variable selection approaches. In multi-trait fine mapping, we are interested in identifying the causal variants for multiple related traits. Existing multivariate variable selection methods for fine mapping select variables for all responses without considering the possible heterogeneity across different responses. Here, we develop a novel multivariate Bayesian variable selection method for multi-trait fine mapping to select causal variants from a large number of grouped SNPs that target at multiple correlated and possibly heterogeneous traits. Our new method is featured by its selection at multiple levels, incorporation of prior biological knowledge to guide selection and identification of best subset of traits the variants target at. We showed the advantage of our method over existing methods via comprehensive simulations that mimic typical fine-mapping settings and a real-world fine-mapping example in UK Biobank, where we identified critical causal variants potentially targeting at different subsets of addictive behaviours and risk factors.

全基因组关联研究(GWAS)已经确定了数千种与复杂性状相关的单核苷酸多态性(snp),但确定潜在的因果变异仍然具有挑战性。精细定位的目的是利用变量选择方法,从大量可能与gwas富集基因组区域的群体结构相关的snp中找出潜在的因果变异。在多性状精细映射中,我们感兴趣的是识别多个相关性状的因果变异。现有的多变量选择方法用于精细映射选择所有响应的变量,而没有考虑不同响应之间可能存在的异质性。在这里,我们开发了一种新的多元贝叶斯变量选择方法,用于多性状精细映射,从大量针对多个相关和可能异质性状的snp分组中选择因果变异。我们的新方法的特点是多层次的选择,结合先前的生物学知识来指导选择和识别变异所针对的最佳性状子集。我们通过全面模拟模拟典型的精细映射设置和英国生物银行的现实世界精细映射示例,展示了我们的方法优于现有方法的优势,在英国生物银行中,我们确定了潜在针对不同成瘾行为和风险因素子集的关键因果变异。
{"title":"Multivariate Bayesian variable selection for multi-trait genetic fine mapping.","authors":"Travis Canida, Hongjie Ke, Shuo Chen, Zhenyao Ye, Tianzhou Ma","doi":"10.1093/jrsssc/qlae055","DOIUrl":"10.1093/jrsssc/qlae055","url":null,"abstract":"<p><p>Genome-wide association studies (GWAS) have identified thousands of single-nucleotide polymorphisms (SNPs) associated with complex traits, but determining the underlying causal variants remains challenging. Fine mapping aims to pinpoint the potentially causal variants from a large number of correlated SNPs possibly with group structure in GWAS-enriched genomic regions using variable selection approaches. In multi-trait fine mapping, we are interested in identifying the causal variants for multiple related traits. Existing multivariate variable selection methods for fine mapping select variables for all responses without considering the possible heterogeneity across different responses. Here, we develop a novel multivariate Bayesian variable selection method for multi-trait fine mapping to select causal variants from a large number of grouped SNPs that target at multiple correlated and possibly heterogeneous traits. Our new method is featured by its selection at multiple levels, incorporation of prior biological knowledge to guide selection and identification of best subset of traits the variants target at. We showed the advantage of our method over existing methods via comprehensive simulations that mimic typical fine-mapping settings and a real-world fine-mapping example in UK Biobank, where we identified critical causal variants potentially targeting at different subsets of addictive behaviours and risk factors.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"74 2","pages":"331-351"},"PeriodicalIF":1.3,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11905884/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143651720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
tdCoxSNN: Time-dependent Cox survival neural network for continuous-time dynamic prediction. tdCoxSNN:用于连续时间动态预测的时变Cox生存神经网络。
IF 1.3 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-10-11 eCollection Date: 2025-01-01 DOI: 10.1093/jrsssc/qlae051
Lang Zeng, Jipeng Zhang, Wei Chen, Ying Ding

The aim of dynamic prediction is to provide individualized risk predictions over time, which are updated as new data become available. In pursuit of constructing a dynamic prediction model for a progressive eye disorder, age-related macular degeneration (AMD), we propose a time-dependent Cox survival neural network (tdCoxSNN) to predict its progression using longitudinal fundus images. tdCoxSNN builds upon the time-dependent Cox model by utilizing a neural network to capture the nonlinear effect of time-dependent covariates on the survival outcome. Moreover, by concurrently integrating a convolutional neural network with the survival network, tdCoxSNN can directly take longitudinal images as input. We evaluate and compare our proposed method with joint modelling and landmarking approaches through extensive simulations. We applied the proposed approach to two real datasets. One is a large AMD study, the Age-Related Eye Disease Study, in which more than 50,000 fundus images were captured over a period of 12 years for more than 4,000 participants. Another is a public dataset of the primary biliary cirrhosis disease, where multiple laboratory tests were longitudinally collected to predict the time-to-liver transplant. Our approach demonstrates commendable predictive performance in both simulation studies and the analysis of the two real datasets.

动态预测的目的是随着时间的推移提供个性化的风险预测,并随着新数据的出现而更新。为了构建进行性眼病——年龄相关性黄斑变性(AMD)的动态预测模型,我们提出了一种时间依赖的Cox生存神经网络(tdCoxSNN),利用眼底纵向图像预测其进展。tdCoxSNN建立在时间依赖的Cox模型上,利用神经网络捕捉时间依赖协变量对生存结果的非线性影响。此外,通过卷积神经网络与生存网络的并行集成,tdCoxSNN可以直接将纵向图像作为输入。我们通过广泛的模拟来评估和比较我们提出的方法与联合建模和地标方法。我们将提出的方法应用于两个真实的数据集。其中一个是一项大型的AMD研究,即与年龄相关的眼病研究,在12年的时间里,4000多名参与者拍摄了5万多张眼底图像。另一个是原发性胆汁性肝硬化疾病的公共数据集,其中纵向收集多项实验室测试以预测肝移植时间。我们的方法在模拟研究和两个真实数据集的分析中都显示出值得称赞的预测性能。
{"title":"tdCoxSNN: Time-dependent Cox survival neural network for continuous-time dynamic prediction.","authors":"Lang Zeng, Jipeng Zhang, Wei Chen, Ying Ding","doi":"10.1093/jrsssc/qlae051","DOIUrl":"10.1093/jrsssc/qlae051","url":null,"abstract":"<p><p>The aim of dynamic prediction is to provide individualized risk predictions over time, which are updated as new data become available. In pursuit of constructing a dynamic prediction model for a progressive eye disorder, age-related macular degeneration (AMD), we propose a time-dependent Cox survival neural network (tdCoxSNN) to predict its progression using longitudinal fundus images. tdCoxSNN builds upon the time-dependent Cox model by utilizing a neural network to capture the nonlinear effect of time-dependent covariates on the survival outcome. Moreover, by concurrently integrating a convolutional neural network with the survival network, tdCoxSNN can directly take longitudinal images as input. We evaluate and compare our proposed method with joint modelling and landmarking approaches through extensive simulations. We applied the proposed approach to two real datasets. One is a large AMD study, the Age-Related Eye Disease Study, in which more than 50,000 fundus images were captured over a period of 12 years for more than 4,000 participants. Another is a public dataset of the primary biliary cirrhosis disease, where multiple laboratory tests were longitudinally collected to predict the time-to-liver transplant. Our approach demonstrates commendable predictive performance in both simulation studies and the analysis of the two real datasets.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"74 1","pages":"187-203"},"PeriodicalIF":1.3,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11725344/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142980658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Measuring the impact of new risk factors within survival models. 在生存模型中测量新的风险因素的影响。
IF 1.3 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-09-03 eCollection Date: 2025-01-01 DOI: 10.1093/jrsssc/qlae045
Glenn Heller, Sean M Devlin

Survival is poor for patients with metastatic cancer, and it is vital to examine new biomarkers that can improve patient prognostication and identify those who would benefit from more aggressive therapy. In metastatic prostate cancer, 2 new assays have become available: one that quantifies the number of cancer cells circulating in the peripheral blood, and the other a marker of the aggressiveness of the disease. It is critical to determine the magnitude of the effect of these biomarkers on the discrimination of a model-based risk score. To do so, most analysts frequently consider the discrimination of 2 separate survival models: one that includes both the new and standard factors and a second that includes the standard factors alone. However, this analysis is ultimately incorrect for many of the scale-transformation models ubiquitous in survival, as the reduced model is misspecified if the full model is specified correctly. To circumvent this issue, we developed a projection-based approach to estimate the impact of the 2 prostate cancer biomarkers. The results indicate that the new biomarkers can influence model discrimination and justify their inclusion in the risk model; however, the hunt remains for an applicable model to risk-stratify patients with metastatic prostate cancer.

转移性癌症患者的生存率很低,因此检查新的生物标志物以改善患者预后并确定哪些患者将从更积极的治疗中受益至关重要。在转移性前列腺癌中,有两种新的检测方法可用:一种是量化外周血中循环的癌细胞数量,另一种是疾病侵袭性的标志。确定这些生物标志物对基于模型的风险评分的区分的影响程度是至关重要的。为了做到这一点,大多数分析师经常考虑两种独立生存模型的区别:一种既包括新因素又包括标准因素,另一种只包括标准因素。然而,对于生存中普遍存在的许多尺度转换模型来说,这种分析最终是不正确的,因为如果正确地指定了完整模型,则错误地指定了简化模型。为了避免这个问题,我们开发了一种基于预测的方法来估计两种前列腺癌生物标志物的影响。结果表明,新的生物标志物可以影响模型判别,并证明将其纳入风险模型是合理的;然而,对转移性前列腺癌患者进行风险分层的适用模型仍有待研究。
{"title":"Measuring the impact of new risk factors within survival models.","authors":"Glenn Heller, Sean M Devlin","doi":"10.1093/jrsssc/qlae045","DOIUrl":"10.1093/jrsssc/qlae045","url":null,"abstract":"<p><p>Survival is poor for patients with metastatic cancer, and it is vital to examine new biomarkers that can improve patient prognostication and identify those who would benefit from more aggressive therapy. In metastatic prostate cancer, 2 new assays have become available: one that quantifies the number of cancer cells circulating in the peripheral blood, and the other a marker of the aggressiveness of the disease. It is critical to determine the magnitude of the effect of these biomarkers on the discrimination of a model-based risk score. To do so, most analysts frequently consider the discrimination of 2 separate survival models: one that includes both the new and standard factors and a second that includes the standard factors alone. However, this analysis is ultimately incorrect for many of the scale-transformation models ubiquitous in survival, as the reduced model is misspecified if the full model is specified correctly. To circumvent this issue, we developed a projection-based approach to estimate the impact of the 2 prostate cancer biomarkers. The results indicate that the new biomarkers can influence model discrimination and justify their inclusion in the risk model; however, the hunt remains for an applicable model to risk-stratify patients with metastatic prostate cancer.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"74 1","pages":"83-99"},"PeriodicalIF":1.3,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11725343/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142980653","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Non-parametric Bayesian approach to multiple treatment comparisons in network meta-analysis with application to comparisons of anti-depressants. 网络荟萃分析中多重治疗比较的非参数贝叶斯方法,应用于抗抑郁药的比较。
IF 1.3 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-09-02 eCollection Date: 2024-11-01 DOI: 10.1093/jrsssc/qlae038
Andrés F Barrientos, Garritt L Page, Lifeng Lin

Network meta-analysis is a powerful tool to synthesize evidence from independent studies and compare multiple treatments simultaneously. A critical task of performing a network meta-analysis is to offer ranks of all available treatment options for a specific disease outcome. Frequently, the estimated treatment rankings are accompanied by a large amount of uncertainty, suffer from multiplicity issues, and rarely permit possible ties of treatments with similar performance. These issues make interpreting rankings problematic as they are often treated as absolute metrics. To address these shortcomings, we formulate a ranking strategy that adapts to scenarios with high-order uncertainty by producing more conservative results. This improves the interpretability while simultaneously accounting for multiple comparisons. To admit ties between treatment effects in cases where differences between treatment effects are negligible, we also develop a Bayesian non-parametric approach for network meta-analysis. The approach capitalizes on the induced clustering mechanism of Bayesian non-parametric methods, producing a positive probability that two treatment effects are equal. We demonstrate the utility of the procedure through numerical experiments and a network meta-analysis designed to study antidepressant treatments.

网络荟萃分析是一种强大的工具,可以综合来自独立研究的证据,同时比较多种治疗方法。进行网络荟萃分析的一项关键任务是提供针对特定疾病结果的所有可用治疗方案的排名。通常情况下,估算出的治疗排名存在很大的不确定性,存在多重性问题,而且很少允许性能相似的治疗方案并列。这些问题使得排名的解释成为问题,因为它们通常被视为绝对指标。为了解决这些缺陷,我们制定了一种排名策略,通过产生更保守的结果来适应具有高阶不确定性的情景。这在提高可解释性的同时,也考虑到了多重比较。为了在治疗效果之间的差异可以忽略不计的情况下承认治疗效果之间的联系,我们还开发了一种用于网络荟萃分析的贝叶斯非参数方法。该方法利用了贝叶斯非参数方法的诱导聚类机制,产生了两个治疗效果相等的正概率。我们通过数值实验和一项旨在研究抗抑郁治疗的网络荟萃分析,展示了该方法的实用性。
{"title":"Non-parametric Bayesian approach to multiple treatment comparisons in network meta-analysis with application to comparisons of anti-depressants.","authors":"Andrés F Barrientos, Garritt L Page, Lifeng Lin","doi":"10.1093/jrsssc/qlae038","DOIUrl":"10.1093/jrsssc/qlae038","url":null,"abstract":"<p><p>Network meta-analysis is a powerful tool to synthesize evidence from independent studies and compare multiple treatments simultaneously. A critical task of performing a network meta-analysis is to offer ranks of all available treatment options for a specific disease outcome. Frequently, the estimated treatment rankings are accompanied by a large amount of uncertainty, suffer from multiplicity issues, and rarely permit possible ties of treatments with similar performance. These issues make interpreting rankings problematic as they are often treated as absolute metrics. To address these shortcomings, we formulate a ranking strategy that adapts to scenarios with high-order uncertainty by producing more conservative results. This improves the interpretability while simultaneously accounting for multiple comparisons. To admit ties between treatment effects in cases where differences between treatment effects are negligible, we also develop a Bayesian non-parametric approach for network meta-analysis. The approach capitalizes on the induced clustering mechanism of Bayesian non-parametric methods, producing a positive probability that two treatment effects are equal. We demonstrate the utility of the procedure through numerical experiments and a network meta-analysis designed to study antidepressant treatments.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"73 5","pages":"1333-1354"},"PeriodicalIF":1.3,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11561732/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142649544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of the Royal Statistical Society Series C-Applied Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1