Journal of the Royal Statistical Society Series C-Applied Statistics最新文献

Precision Mental Health: Predicting Heterogeneous Treatment Effects for Depression through Data Integration. 精准心理健康：通过数据整合预测抑郁症的异质性治疗效果。

IF 1.3 4区数学 Q3 STATISTICS & PROBABILITY

Journal of the Royal Statistical Society Series C-Applied Statistics

Pub Date : 2025-12-12 DOI: 10.1093/jrsssc/qlaf068

Carly L Brantner, Trang Quynh Nguyen, Harsh Parikh, Congwen Zhao, Hwanhee Hong, Elizabeth A Stuart

When treating depression, clinicians are interested in determining the optimal treatment for a given patient, which is challenging given the amount of treatments available. To advance individualized treatment allocation, integrating data across multiple randomized controlled trials (RCTs) can enhance our understanding of treatment effect heterogeneity by increasing available information. However, extending these inferences to individuals outside of the original RCTs remains crucial for clinical decision-making. We introduce a two-stage meta-analytic method that predicts conditional average treatment effects (CATEs) in target patient populations by leveraging the distribution of CATEs across RCTs. Our approach generates 95% prediction intervals for CATEs in target settings using first-stage models that can incorporate parametric regression or non-parametric methods such as causal forests or Bayesian additive regression trees (BART). We validate our method through simulation studies and operationalize it to integrate multiple RCTs comparing depression treatments, duloxetine and vortioxetine, to generate prediction intervals for target patient profiles. Our analysis reveals no strong evidence of effect heterogeneity across trials, with the exception of potential age-related variability. Importantly, we show that CATE prediction intervals capture broader uncertainty than study-specific confidence intervals when warranted, reflecting both within-study and between-study variability.

在治疗抑郁症时，临床医生感兴趣的是确定针对特定患者的最佳治疗方法，考虑到现有的治疗方法数量，这是具有挑战性的。为了推进个体化治疗分配，整合多个随机对照试验（RCTs）的数据可以通过增加可用信息来增强我们对治疗效果异质性的理解。然而，将这些推论扩展到原始随机对照试验之外的个体，对于临床决策仍然至关重要。我们引入了一种两阶段荟萃分析方法，通过利用随机对照试验中条件平均治疗效果（CATEs）的分布来预测目标患者群体的条件平均治疗效果（CATEs）。我们的方法使用第一阶段模型在目标设置中生成95%的CATEs预测区间，该模型可以结合参数回归或非参数方法，如因果森林或贝叶斯加性回归树（BART）。我们通过模拟研究验证了我们的方法，并将其应用于多个比较抑郁症治疗、度洛西汀和沃替西汀的随机对照试验，以生成目标患者概况的预测区间。我们的分析显示，除了潜在的年龄相关变异性外，没有强有力的证据表明不同试验的效果存在异质性。重要的是，我们表明，在有保证的情况下，CATE预测区间比研究特定置信区间捕捉到更大的不确定性，反映了研究内和研究间的可变性。

{"title":"Precision Mental Health: Predicting Heterogeneous Treatment Effects for Depression through Data Integration.","authors":"Carly L Brantner, Trang Quynh Nguyen, Harsh Parikh, Congwen Zhao, Hwanhee Hong, Elizabeth A Stuart","doi":"10.1093/jrsssc/qlaf068","DOIUrl":"10.1093/jrsssc/qlaf068","url":null,"abstract":"When treating depression, clinicians are interested in determining the optimal treatment for a given patient, which is challenging given the amount of treatments available. To advance individualized treatment allocation, integrating data across multiple randomized controlled trials (RCTs) can enhance our understanding of treatment effect heterogeneity by increasing available information. However, extending these inferences to individuals outside of the original RCTs remains crucial for clinical decision-making. We introduce a two-stage meta-analytic method that predicts conditional average treatment effects (CATEs) in target patient populations by leveraging the distribution of CATEs across RCTs. Our approach generates 95% prediction intervals for CATEs in target settings using first-stage models that can incorporate parametric regression or non-parametric methods such as causal forests or Bayesian additive regression trees (BART). We validate our method through simulation studies and operationalize it to integrate multiple RCTs comparing depression treatments, duloxetine and vortioxetine, to generate prediction intervals for target patient profiles. Our analysis reveals no strong evidence of effect heterogeneity across trials, with the exception of potential age-related variability. Importantly, we show that CATE prediction intervals capture broader uncertainty than study-specific confidence intervals when warranted, reflecting both within-study and between-study variability.","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":" ","pages":""},"PeriodicalIF":1.3,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12768387/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145913599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

IF 1.3 4区数学 Q3 STATISTICS & PROBABILITY

Journal of the Royal Statistical Society Series C-Applied Statistics

Pub Date : 2025-06-01 Epub Date: 2025-01-23 DOI: 10.1093/jrsssc/qlaf001

Duo Yu, Meilin Huang, Michael J Kane, Brian P Hobbs

This paper introduces a novel statistical methodology integrating machine learning (ML) and Bayesian modelling to facilitate personalized prognostic predictions with application to oncology. Utilizing power priors, we construct 'patient-similarity embeddings' that identify localized patterns of prognosis. The methodology is applied to study the prognostic value of markers of anticancer immunity within the tumour microenvironment of nonsmall cell lung cancer while adjusting for established clinical characteristics. The method outperforms traditional regression and ML models, while accurately identifying subgroup patterns, thereby enhancing statistical inference and hypothesis testing.

本文介绍了一种集成机器学习（ML）和贝叶斯模型的新型统计方法，以促进个性化预后预测并应用于肿瘤学。利用权力先验，我们构建了“患者相似嵌入”来识别局部预后模式。该方法用于研究非小细胞肺癌肿瘤微环境中抗癌免疫标志物的预后价值，同时根据已建立的临床特征进行调整。该方法优于传统的回归模型和ML模型，同时准确地识别子群模式，从而增强了统计推断和假设检验。

引用次数: 0

Building absolute breast cancer risk prediction models for women treated with chest radiation for Hodgkin lymphoma. 建立霍奇金淋巴瘤胸部放射治疗女性乳腺癌绝对风险预测模型。

IF 1 4区数学 Q3 STATISTICS & PROBABILITY

Journal of the Royal Statistical Society Series C-Applied Statistics

Pub Date : 2025-01-03 eCollection Date: 2025-03-01 DOI: 10.1093/jrsssc/qlae063

Sander Roberti, Flora E van Leeuwen, Michael Hauptmann, Ruth M Pfeiffer

We built models to predict absolute breast cancer (BC) risk in women treated with radiotherapy for Hodgkin lymphoma (HL). We first estimated relative risks (RRs) for risk factors, including radiation dose to 10 breast segments to accommodate heterogeneity of treatment effects, using a case-control sample nested in an HL survivor cohort. To estimate RRs of case-control matching factors we developed novel weighting approaches. We then combined RRs with age-specific BC incidence and competing mortality rates from the HL survivor cohort and a population-based registry, accommodating differences between them. We compared the performance of models using segment-specific doses with using mean dose only.

我们建立了模型来预测接受放射治疗的霍奇金淋巴瘤（HL）女性患乳腺癌（BC）的绝对风险。我们首先估算了风险因素的相对风险（rr），包括对10个乳腺节段的辐射剂量，以适应治疗效果的异质性，使用了一个嵌套在HL幸存者队列中的病例对照样本。为了估计病例对照匹配因素的rr，我们开发了新的加权方法。然后，我们将rr与来自HL幸存者队列和基于人群的登记的年龄特异性BC发病率和竞争死亡率相结合，以适应它们之间的差异。我们比较了使用分段特定剂量和仅使用平均剂量的模型的性能。

引用次数: 0

Inferring bivariate associations with continuous data from studies using respondent-driven sampling. 利用调查对象驱动的抽样研究的连续数据推断双变量关联。

IF 1.3 4区数学 Q3 STATISTICS & PROBABILITY

Journal of the Royal Statistical Society Series C-Applied Statistics

Pub Date : 2024-11-26 eCollection Date: 2025-03-01 DOI: 10.1093/jrsssc/qlae061

Samantha Malatesta, Karen R Jacobson, Tara Carney, Eric D Kolaczyk, Krista J Gile, Laura F White

Respondent-driven sampling (RDS) is a link-tracing sampling design that was developed to sample from hidden populations. Although associations between variables are of great interest in epidemiological research, there has been little statistical work on inference on relationships between variables collected through RDS. The link-tracing design, combined with homophily, the tendency for people to connect to others with whom they share characteristics, induces similarity between linked individuals. This dependence inflates the Type 1 error of conventional statistical methods (e.g. t-tests, regression, etc.). A semiparametric randomization test for bivariate association was developed to test for association between two categorical variables. We directly extend this work and propose a semiparametric randomization test for relationships between two variables, when one or both are continuous. We apply our method to variables that are important for understanding tuberculosis epidemiology among people who smoke illicit drugs in Worcester, South Africa.

被调查者驱动抽样（RDS）是一种链接追踪抽样设计，旨在从隐藏人群中进行抽样。尽管流行病学研究对变量之间的关联非常感兴趣，但通过RDS收集的变量之间的关系进行推断的统计工作很少。链接追踪设计与同质性相结合，即人们倾向于与有共同特征的人建立联系，从而在有联系的个体之间产生相似性。这种依赖性放大了传统统计方法（如t检验、回归等）的第一类误差。建立了双变量关联的半参数随机化检验来检验两个分类变量之间的关联。我们直接扩展了这项工作，并提出了两个变量之间关系的半参数随机化检验，当一个或两个变量是连续的。我们将我们的方法应用于变量，这些变量对于了解南非伍斯特的非法吸毒者之间的结核病流行病学很重要。

{"title":"Inferring bivariate associations with continuous data from studies using respondent-driven sampling.","authors":"Samantha Malatesta, Karen R Jacobson, Tara Carney, Eric D Kolaczyk, Krista J Gile, Laura F White","doi":"10.1093/jrsssc/qlae061","DOIUrl":"10.1093/jrsssc/qlae061","url":null,"abstract":"Respondent-driven sampling (RDS) is a link-tracing sampling design that was developed to sample from hidden populations. Although associations between variables are of great interest in epidemiological research, there has been little statistical work on inference on relationships between variables collected through RDS. The link-tracing design, combined with homophily, the tendency for people to connect to others with whom they share characteristics, induces similarity between linked individuals. This dependence inflates the Type 1 error of conventional statistical methods (e.g. t-tests, regression, etc.). A semiparametric randomization test for bivariate association was developed to test for association between two categorical variables. We directly extend this work and propose a semiparametric randomization test for relationships between two variables, when one or both are continuous. We apply our method to variables that are important for understanding tuberculosis epidemiology among people who smoke illicit drugs in Worcester, South Africa.","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"74 2","pages":"429-446"},"PeriodicalIF":1.3,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11905883/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143651715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multivariate Bayesian variable selection for multi-trait genetic fine mapping. 多性状遗传精细定位的多元贝叶斯变量选择。

IF 1.3 4区数学 Q3 STATISTICS & PROBABILITY

Journal of the Royal Statistical Society Series C-Applied Statistics

Pub Date : 2024-10-28 eCollection Date: 2025-03-01 DOI: 10.1093/jrsssc/qlae055

Travis Canida, Hongjie Ke, Shuo Chen, Zhenyao Ye, Tianzhou Ma

Genome-wide association studies (GWAS) have identified thousands of single-nucleotide polymorphisms (SNPs) associated with complex traits, but determining the underlying causal variants remains challenging. Fine mapping aims to pinpoint the potentially causal variants from a large number of correlated SNPs possibly with group structure in GWAS-enriched genomic regions using variable selection approaches. In multi-trait fine mapping, we are interested in identifying the causal variants for multiple related traits. Existing multivariate variable selection methods for fine mapping select variables for all responses without considering the possible heterogeneity across different responses. Here, we develop a novel multivariate Bayesian variable selection method for multi-trait fine mapping to select causal variants from a large number of grouped SNPs that target at multiple correlated and possibly heterogeneous traits. Our new method is featured by its selection at multiple levels, incorporation of prior biological knowledge to guide selection and identification of best subset of traits the variants target at. We showed the advantage of our method over existing methods via comprehensive simulations that mimic typical fine-mapping settings and a real-world fine-mapping example in UK Biobank, where we identified critical causal variants potentially targeting at different subsets of addictive behaviours and risk factors.

全基因组关联研究（GWAS）已经确定了数千种与复杂性状相关的单核苷酸多态性（snp），但确定潜在的因果变异仍然具有挑战性。精细定位的目的是利用变量选择方法，从大量可能与gwas富集基因组区域的群体结构相关的snp中找出潜在的因果变异。在多性状精细映射中，我们感兴趣的是识别多个相关性状的因果变异。现有的多变量选择方法用于精细映射选择所有响应的变量，而没有考虑不同响应之间可能存在的异质性。在这里，我们开发了一种新的多元贝叶斯变量选择方法，用于多性状精细映射，从大量针对多个相关和可能异质性状的snp分组中选择因果变异。我们的新方法的特点是多层次的选择，结合先前的生物学知识来指导选择和识别变异所针对的最佳性状子集。我们通过全面模拟模拟典型的精细映射设置和英国生物银行的现实世界精细映射示例，展示了我们的方法优于现有方法的优势，在英国生物银行中，我们确定了潜在针对不同成瘾行为和风险因素子集的关键因果变异。

{"title":"Multivariate Bayesian variable selection for multi-trait genetic fine mapping.","authors":"Travis Canida, Hongjie Ke, Shuo Chen, Zhenyao Ye, Tianzhou Ma","doi":"10.1093/jrsssc/qlae055","DOIUrl":"10.1093/jrsssc/qlae055","url":null,"abstract":"Genome-wide association studies (GWAS) have identified thousands of single-nucleotide polymorphisms (SNPs) associated with complex traits, but determining the underlying causal variants remains challenging. Fine mapping aims to pinpoint the potentially causal variants from a large number of correlated SNPs possibly with group structure in GWAS-enriched genomic regions using variable selection approaches. In multi-trait fine mapping, we are interested in identifying the causal variants for multiple related traits. Existing multivariate variable selection methods for fine mapping select variables for all responses without considering the possible heterogeneity across different responses. Here, we develop a novel multivariate Bayesian variable selection method for multi-trait fine mapping to select causal variants from a large number of grouped SNPs that target at multiple correlated and possibly heterogeneous traits. Our new method is featured by its selection at multiple levels, incorporation of prior biological knowledge to guide selection and identification of best subset of traits the variants target at. We showed the advantage of our method over existing methods via comprehensive simulations that mimic typical fine-mapping settings and a real-world fine-mapping example in UK Biobank, where we identified critical causal variants potentially targeting at different subsets of addictive behaviours and risk factors.","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"74 2","pages":"331-351"},"PeriodicalIF":1.3,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11905884/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143651720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

tdCoxSNN: Time-dependent Cox survival neural network for continuous-time dynamic prediction. tdCoxSNN：用于连续时间动态预测的时变Cox生存神经网络。

IF 1.3 4区数学 Q3 STATISTICS & PROBABILITY

Journal of the Royal Statistical Society Series C-Applied Statistics

Pub Date : 2024-10-11 eCollection Date: 2025-01-01 DOI: 10.1093/jrsssc/qlae051

Lang Zeng, Jipeng Zhang, Wei Chen, Ying Ding

The aim of dynamic prediction is to provide individualized risk predictions over time, which are updated as new data become available. In pursuit of constructing a dynamic prediction model for a progressive eye disorder, age-related macular degeneration (AMD), we propose a time-dependent Cox survival neural network (tdCoxSNN) to predict its progression using longitudinal fundus images. tdCoxSNN builds upon the time-dependent Cox model by utilizing a neural network to capture the nonlinear effect of time-dependent covariates on the survival outcome. Moreover, by concurrently integrating a convolutional neural network with the survival network, tdCoxSNN can directly take longitudinal images as input. We evaluate and compare our proposed method with joint modelling and landmarking approaches through extensive simulations. We applied the proposed approach to two real datasets. One is a large AMD study, the Age-Related Eye Disease Study, in which more than 50,000 fundus images were captured over a period of 12 years for more than 4,000 participants. Another is a public dataset of the primary biliary cirrhosis disease, where multiple laboratory tests were longitudinally collected to predict the time-to-liver transplant. Our approach demonstrates commendable predictive performance in both simulation studies and the analysis of the two real datasets.

动态预测的目的是随着时间的推移提供个性化的风险预测，并随着新数据的出现而更新。为了构建进行性眼病——年龄相关性黄斑变性（AMD）的动态预测模型，我们提出了一种时间依赖的Cox生存神经网络（tdCoxSNN），利用眼底纵向图像预测其进展。tdCoxSNN建立在时间依赖的Cox模型上，利用神经网络捕捉时间依赖协变量对生存结果的非线性影响。此外，通过卷积神经网络与生存网络的并行集成，tdCoxSNN可以直接将纵向图像作为输入。我们通过广泛的模拟来评估和比较我们提出的方法与联合建模和地标方法。我们将提出的方法应用于两个真实的数据集。其中一个是一项大型的AMD研究，即与年龄相关的眼病研究，在12年的时间里，4000多名参与者拍摄了5万多张眼底图像。另一个是原发性胆汁性肝硬化疾病的公共数据集，其中纵向收集多项实验室测试以预测肝移植时间。我们的方法在模拟研究和两个真实数据集的分析中都显示出值得称赞的预测性能。

{"title":"tdCoxSNN: Time-dependent Cox survival neural network for continuous-time dynamic prediction.","authors":"Lang Zeng, Jipeng Zhang, Wei Chen, Ying Ding","doi":"10.1093/jrsssc/qlae051","DOIUrl":"10.1093/jrsssc/qlae051","url":null,"abstract":"The aim of dynamic prediction is to provide individualized risk predictions over time, which are updated as new data become available. In pursuit of constructing a dynamic prediction model for a progressive eye disorder, age-related macular degeneration (AMD), we propose a time-dependent Cox survival neural network (tdCoxSNN) to predict its progression using longitudinal fundus images. tdCoxSNN builds upon the time-dependent Cox model by utilizing a neural network to capture the nonlinear effect of time-dependent covariates on the survival outcome. Moreover, by concurrently integrating a convolutional neural network with the survival network, tdCoxSNN can directly take longitudinal images as input. We evaluate and compare our proposed method with joint modelling and landmarking approaches through extensive simulations. We applied the proposed approach to two real datasets. One is a large AMD study, the Age-Related Eye Disease Study, in which more than 50,000 fundus images were captured over a period of 12 years for more than 4,000 participants. Another is a public dataset of the primary biliary cirrhosis disease, where multiple laboratory tests were longitudinally collected to predict the time-to-liver transplant. Our approach demonstrates commendable predictive performance in both simulation studies and the analysis of the two real datasets.","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"74 1","pages":"187-203"},"PeriodicalIF":1.3,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11725344/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142980658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Measuring the impact of new risk factors within survival models. 在生存模型中测量新的风险因素的影响。

IF 1.3 4区数学 Q3 STATISTICS & PROBABILITY

Journal of the Royal Statistical Society Series C-Applied Statistics

Pub Date : 2024-09-03 eCollection Date: 2025-01-01 DOI: 10.1093/jrsssc/qlae045

Glenn Heller, Sean M Devlin

Survival is poor for patients with metastatic cancer, and it is vital to examine new biomarkers that can improve patient prognostication and identify those who would benefit from more aggressive therapy. In metastatic prostate cancer, 2 new assays have become available: one that quantifies the number of cancer cells circulating in the peripheral blood, and the other a marker of the aggressiveness of the disease. It is critical to determine the magnitude of the effect of these biomarkers on the discrimination of a model-based risk score. To do so, most analysts frequently consider the discrimination of 2 separate survival models: one that includes both the new and standard factors and a second that includes the standard factors alone. However, this analysis is ultimately incorrect for many of the scale-transformation models ubiquitous in survival, as the reduced model is misspecified if the full model is specified correctly. To circumvent this issue, we developed a projection-based approach to estimate the impact of the 2 prostate cancer biomarkers. The results indicate that the new biomarkers can influence model discrimination and justify their inclusion in the risk model; however, the hunt remains for an applicable model to risk-stratify patients with metastatic prostate cancer.

转移性癌症患者的生存率很低，因此检查新的生物标志物以改善患者预后并确定哪些患者将从更积极的治疗中受益至关重要。在转移性前列腺癌中，有两种新的检测方法可用：一种是量化外周血中循环的癌细胞数量，另一种是疾病侵袭性的标志。确定这些生物标志物对基于模型的风险评分的区分的影响程度是至关重要的。为了做到这一点，大多数分析师经常考虑两种独立生存模型的区别：一种既包括新因素又包括标准因素，另一种只包括标准因素。然而，对于生存中普遍存在的许多尺度转换模型来说，这种分析最终是不正确的，因为如果正确地指定了完整模型，则错误地指定了简化模型。为了避免这个问题，我们开发了一种基于预测的方法来估计两种前列腺癌生物标志物的影响。结果表明，新的生物标志物可以影响模型判别，并证明将其纳入风险模型是合理的；然而，对转移性前列腺癌患者进行风险分层的适用模型仍有待研究。

{"title":"Measuring the impact of new risk factors within survival models.","authors":"Glenn Heller, Sean M Devlin","doi":"10.1093/jrsssc/qlae045","DOIUrl":"10.1093/jrsssc/qlae045","url":null,"abstract":"Survival is poor for patients with metastatic cancer, and it is vital to examine new biomarkers that can improve patient prognostication and identify those who would benefit from more aggressive therapy. In metastatic prostate cancer, 2 new assays have become available: one that quantifies the number of cancer cells circulating in the peripheral blood, and the other a marker of the aggressiveness of the disease. It is critical to determine the magnitude of the effect of these biomarkers on the discrimination of a model-based risk score. To do so, most analysts frequently consider the discrimination of 2 separate survival models: one that includes both the new and standard factors and a second that includes the standard factors alone. However, this analysis is ultimately incorrect for many of the scale-transformation models ubiquitous in survival, as the reduced model is misspecified if the full model is specified correctly. To circumvent this issue, we developed a projection-based approach to estimate the impact of the 2 prostate cancer biomarkers. The results indicate that the new biomarkers can influence model discrimination and justify their inclusion in the risk model; however, the hunt remains for an applicable model to risk-stratify patients with metastatic prostate cancer.","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"74 1","pages":"83-99"},"PeriodicalIF":1.3,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11725343/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142980653","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Non-parametric Bayesian approach to multiple treatment comparisons in network meta-analysis with application to comparisons of anti-depressants. 网络荟萃分析中多重治疗比较的非参数贝叶斯方法，应用于抗抑郁药的比较。

IF 1.3 4区数学 Q3 STATISTICS & PROBABILITY

Journal of the Royal Statistical Society Series C-Applied Statistics

Pub Date : 2024-09-02 eCollection Date: 2024-11-01 DOI: 10.1093/jrsssc/qlae038

Andrés F Barrientos, Garritt L Page, Lifeng Lin

Network meta-analysis is a powerful tool to synthesize evidence from independent studies and compare multiple treatments simultaneously. A critical task of performing a network meta-analysis is to offer ranks of all available treatment options for a specific disease outcome. Frequently, the estimated treatment rankings are accompanied by a large amount of uncertainty, suffer from multiplicity issues, and rarely permit possible ties of treatments with similar performance. These issues make interpreting rankings problematic as they are often treated as absolute metrics. To address these shortcomings, we formulate a ranking strategy that adapts to scenarios with high-order uncertainty by producing more conservative results. This improves the interpretability while simultaneously accounting for multiple comparisons. To admit ties between treatment effects in cases where differences between treatment effects are negligible, we also develop a Bayesian non-parametric approach for network meta-analysis. The approach capitalizes on the induced clustering mechanism of Bayesian non-parametric methods, producing a positive probability that two treatment effects are equal. We demonstrate the utility of the procedure through numerical experiments and a network meta-analysis designed to study antidepressant treatments.

网络荟萃分析是一种强大的工具，可以综合来自独立研究的证据，同时比较多种治疗方法。进行网络荟萃分析的一项关键任务是提供针对特定疾病结果的所有可用治疗方案的排名。通常情况下，估算出的治疗排名存在很大的不确定性，存在多重性问题，而且很少允许性能相似的治疗方案并列。这些问题使得排名的解释成为问题，因为它们通常被视为绝对指标。为了解决这些缺陷，我们制定了一种排名策略，通过产生更保守的结果来适应具有高阶不确定性的情景。这在提高可解释性的同时，也考虑到了多重比较。为了在治疗效果之间的差异可以忽略不计的情况下承认治疗效果之间的联系，我们还开发了一种用于网络荟萃分析的贝叶斯非参数方法。该方法利用了贝叶斯非参数方法的诱导聚类机制，产生了两个治疗效果相等的正概率。我们通过数值实验和一项旨在研究抗抑郁治疗的网络荟萃分析，展示了该方法的实用性。

{"title":"Non-parametric Bayesian approach to multiple treatment comparisons in network meta-analysis with application to comparisons of anti-depressants.","authors":"Andrés F Barrientos, Garritt L Page, Lifeng Lin","doi":"10.1093/jrsssc/qlae038","DOIUrl":"10.1093/jrsssc/qlae038","url":null,"abstract":"Network meta-analysis is a powerful tool to synthesize evidence from independent studies and compare multiple treatments simultaneously. A critical task of performing a network meta-analysis is to offer ranks of all available treatment options for a specific disease outcome. Frequently, the estimated treatment rankings are accompanied by a large amount of uncertainty, suffer from multiplicity issues, and rarely permit possible ties of treatments with similar performance. These issues make interpreting rankings problematic as they are often treated as absolute metrics. To address these shortcomings, we formulate a ranking strategy that adapts to scenarios with high-order uncertainty by producing more conservative results. This improves the interpretability while simultaneously accounting for multiple comparisons. To admit ties between treatment effects in cases where differences between treatment effects are negligible, we also develop a Bayesian non-parametric approach for network meta-analysis. The approach capitalizes on the induced clustering mechanism of Bayesian non-parametric methods, producing a positive probability that two treatment effects are equal. We demonstrate the utility of the procedure through numerical experiments and a network meta-analysis designed to study antidepressant treatments.","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"73 5","pages":"1333-1354"},"PeriodicalIF":1.3,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11561732/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142649544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Joint modelling of survival and backwards recurrence outcomes: an analysis of factors associated with fertility treatment in the U.S. 存活率和逆向复发结果的联合建模：美国生育治疗相关因素分析

IF 1.3 4区数学 Q3 STATISTICS & PROBABILITY

Journal of the Royal Statistical Society Series C-Applied Statistics

Pub Date : 2024-08-19 eCollection Date: 2024-11-01 DOI: 10.1093/jrsssc/qlae039

Siyuan Guo, Jiajia Zhang, Alexander C McLain

The motivation for this paper is to determine factors associated with time-to-fertility treatment (TTFT) among women currently attempting pregnancy in a cross-sectional sample. Challenges arise due to dependence between time-to-pregnancy (TTP) and TTFT. We propose appending a marginal accelerated failure time model to identify risk factors of TTFT with a model for TTP where fertility treatment is included as a time-varying treatment to account for their dependence. The latter requires extending backwards recurrence survival methods to incorporate time-varying covariates with time-varying coefficients. Since backwards recurrence survival methods are a function of mean survival, computational difficulties arise in formulating mean survival when fertility treatment is unobserved, i.e. when TTFT is censored. We address these challenges by developing computationally friendly forms for the double expectation of TTP and TTFT. The performance is validated via comprehensive simulation studies. We apply our approach to the National Survey of Family Growth and explore factors related to prolonged TTFT in the U.S.

本文的目的是通过横断面样本，确定目前试图怀孕的妇女中与不孕症治疗时间（TTFT）相关的因素。由于怀孕时间（TTP）与 TTFT 之间存在依赖关系，因此存在挑战。我们建议采用边际加速失败时间模型来识别 TTFT 的风险因素，同时采用 TTP 模型，将生育治疗作为时变治疗纳入其中，以考虑两者的依赖性。后者需要扩展后向复现生存法，以纳入具有时变系数的时变协变量。由于后向递推生存率方法是平均生存率的函数，当生育治疗是非观测变量时，即 TTFT 是有删减的，在计算平均生存率时就会遇到困难。我们为 TTP 和 TTFT 的双重期望开发了便于计算的形式，从而解决了这些难题。我们通过全面的模拟研究对其性能进行了验证。我们将这一方法应用于全国家庭成长调查，并探讨了与美国 TTFT 延长相关的因素。

{"title":"Joint modelling of survival and backwards recurrence outcomes: an analysis of factors associated with fertility treatment in the U.S.","authors":"Siyuan Guo, Jiajia Zhang, Alexander C McLain","doi":"10.1093/jrsssc/qlae039","DOIUrl":"10.1093/jrsssc/qlae039","url":null,"abstract":"The motivation for this paper is to determine factors associated with time-to-fertility treatment (TTFT) among women currently attempting pregnancy in a cross-sectional sample. Challenges arise due to dependence between time-to-pregnancy (TTP) and TTFT. We propose appending a marginal accelerated failure time model to identify risk factors of TTFT with a model for TTP where fertility treatment is included as a time-varying treatment to account for their dependence. The latter requires extending backwards recurrence survival methods to incorporate time-varying covariates with time-varying coefficients. Since backwards recurrence survival methods are a function of mean survival, computational difficulties arise in formulating mean survival when fertility treatment is unobserved, i.e. when TTFT is censored. We address these challenges by developing computationally friendly forms for the double expectation of TTP and TTFT. The performance is validated via comprehensive simulation studies. We apply our approach to the National Survey of Family Growth and explore factors related to prolonged TTFT in the U.S.","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"73 5","pages":"1355-1369"},"PeriodicalIF":1.3,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11561729/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142649543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Walking fingerprinting. 行走指纹识别

IF 1.3 4区数学 Q3 STATISTICS & PROBABILITY

Journal of the Royal Statistical Society Series C-Applied Statistics

Pub Date : 2024-07-29 eCollection Date: 2024-11-01 DOI: 10.1093/jrsssc/qlae033

Lily Koffman, Ciprian Crainiceanu, Andrew Leroux

We consider the problem of predicting an individual's identity from accelerometry data collected during walking. In a previous paper, we transformed the accelerometry time series into an image by constructing the joint distribution of the acceleration and lagged acceleration for a vector of lags. Predictors derived by partitioning this image into grid cells were used in logistic regression to predict individuals. Here, we (a) implement machine learning methods for prediction using the grid cell-derived predictors; (b) derive inferential methods to screen for the most predictive grid cells while adjusting for correlation and multiple comparisons; and (c) develop a novel multivariate functional regression model that avoids partitioning the predictor space. Prediction methods are compared on two open source acceleometry data sets collected from: (a) 32 individuals walking on a $1.06$ km path; and (b) six repetitions of walking on a 20 m path on two occasions at least 1 week apart for 153 study participants. In the 32-individual study, all methods achieve at least 95% rank-1 accuracy, while in the 153-individual study, accuracy varies from 41% to 98%, depending on the method and prediction task. Methods provide insights into why some individuals are easier to predict than others.

我们考虑的问题是从步行过程中收集的加速度数据预测个人身份。在之前的一篇论文中，我们通过构建加速度和滞后加速度向量的联合分布，将加速度时间序列转换为图像。通过将该图像划分为网格单元得出的预测因子被用于逻辑回归来预测个体。在这里，我们（a）使用网格单元衍生的预测因子实施机器学习方法进行预测；（b）推导出推论方法来筛选最具预测性的网格单元，同时调整相关性和多重比较；以及（c）开发一种新型多元函数回归模型，避免对预测因子空间进行分割。预测方法在两个开放源码的踝关节测量数据集上进行了比较，这些数据集收集自：(a) 32 人在 1.06 千米的路径上行走；(b) 153 名研究参与者在 20 米的路径上重复行走 6 次，两次行走至少相隔一周。在 32 人的研究中，所有方法都达到了至少 95% 的秩-1 准确率，而在 153 人的研究中，根据方法和预测任务的不同，准确率从 41% 到 98% 不等。这些方法让我们了解到为什么有些人比其他人更容易预测。

{"title":"Walking fingerprinting.","authors":"Lily Koffman, Ciprian Crainiceanu, Andrew Leroux","doi":"10.1093/jrsssc/qlae033","DOIUrl":"10.1093/jrsssc/qlae033","url":null,"abstract":"We consider the problem of predicting an individual's identity from accelerometry data collected during walking. In a previous paper, we transformed the accelerometry time series into an image by constructing the joint distribution of the acceleration and lagged acceleration for a vector of lags. Predictors derived by partitioning this image into grid cells were used in logistic regression to predict individuals. Here, we (a) implement machine learning methods for prediction using the grid cell-derived predictors; (b) derive inferential methods to screen for the most predictive grid cells while adjusting for correlation and multiple comparisons; and (c) develop a novel multivariate functional regression model that avoids partitioning the predictor space. Prediction methods are compared on two open source acceleometry data sets collected from: (a) 32 individuals walking on a <math><mn>1.06</mn></math> km path; and (b) six repetitions of walking on a 20 m path on two occasions at least 1 week apart for 153 study participants. In the 32-individual study, all methods achieve at least 95% rank-1 accuracy, while in the 153-individual study, accuracy varies from 41% to 98%, depending on the method and prediction task. Methods provide insights into why some individuals are easier to predict than others.","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"73 5","pages":"1221-1241"},"PeriodicalIF":1.3,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11561731/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142650551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0