首页 > 最新文献

Journal of the Royal Statistical Society Series C-Applied Statistics最新文献

英文 中文
tdCoxSNN: Time-dependent Cox survival neural network for continuous-time dynamic prediction.
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-10-11 eCollection Date: 2025-01-01 DOI: 10.1093/jrsssc/qlae051
Lang Zeng, Jipeng Zhang, Wei Chen, Ying Ding

The aim of dynamic prediction is to provide individualized risk predictions over time, which are updated as new data become available. In pursuit of constructing a dynamic prediction model for a progressive eye disorder, age-related macular degeneration (AMD), we propose a time-dependent Cox survival neural network (tdCoxSNN) to predict its progression using longitudinal fundus images. tdCoxSNN builds upon the time-dependent Cox model by utilizing a neural network to capture the nonlinear effect of time-dependent covariates on the survival outcome. Moreover, by concurrently integrating a convolutional neural network with the survival network, tdCoxSNN can directly take longitudinal images as input. We evaluate and compare our proposed method with joint modelling and landmarking approaches through extensive simulations. We applied the proposed approach to two real datasets. One is a large AMD study, the Age-Related Eye Disease Study, in which more than 50,000 fundus images were captured over a period of 12 years for more than 4,000 participants. Another is a public dataset of the primary biliary cirrhosis disease, where multiple laboratory tests were longitudinally collected to predict the time-to-liver transplant. Our approach demonstrates commendable predictive performance in both simulation studies and the analysis of the two real datasets.

{"title":"tdCoxSNN: Time-dependent Cox survival neural network for continuous-time dynamic prediction.","authors":"Lang Zeng, Jipeng Zhang, Wei Chen, Ying Ding","doi":"10.1093/jrsssc/qlae051","DOIUrl":"10.1093/jrsssc/qlae051","url":null,"abstract":"<p><p>The aim of dynamic prediction is to provide individualized risk predictions over time, which are updated as new data become available. In pursuit of constructing a dynamic prediction model for a progressive eye disorder, age-related macular degeneration (AMD), we propose a time-dependent Cox survival neural network (tdCoxSNN) to predict its progression using longitudinal fundus images. tdCoxSNN builds upon the time-dependent Cox model by utilizing a neural network to capture the nonlinear effect of time-dependent covariates on the survival outcome. Moreover, by concurrently integrating a convolutional neural network with the survival network, tdCoxSNN can directly take longitudinal images as input. We evaluate and compare our proposed method with joint modelling and landmarking approaches through extensive simulations. We applied the proposed approach to two real datasets. One is a large AMD study, the Age-Related Eye Disease Study, in which more than 50,000 fundus images were captured over a period of 12 years for more than 4,000 participants. Another is a public dataset of the primary biliary cirrhosis disease, where multiple laboratory tests were longitudinally collected to predict the time-to-liver transplant. Our approach demonstrates commendable predictive performance in both simulation studies and the analysis of the two real datasets.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"74 1","pages":"187-203"},"PeriodicalIF":1.0,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11725344/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142980658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Measuring the impact of new risk factors within survival models.
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-09-03 eCollection Date: 2025-01-01 DOI: 10.1093/jrsssc/qlae045
Glenn Heller, Sean M Devlin

Survival is poor for patients with metastatic cancer, and it is vital to examine new biomarkers that can improve patient prognostication and identify those who would benefit from more aggressive therapy. In metastatic prostate cancer, 2 new assays have become available: one that quantifies the number of cancer cells circulating in the peripheral blood, and the other a marker of the aggressiveness of the disease. It is critical to determine the magnitude of the effect of these biomarkers on the discrimination of a model-based risk score. To do so, most analysts frequently consider the discrimination of 2 separate survival models: one that includes both the new and standard factors and a second that includes the standard factors alone. However, this analysis is ultimately incorrect for many of the scale-transformation models ubiquitous in survival, as the reduced model is misspecified if the full model is specified correctly. To circumvent this issue, we developed a projection-based approach to estimate the impact of the 2 prostate cancer biomarkers. The results indicate that the new biomarkers can influence model discrimination and justify their inclusion in the risk model; however, the hunt remains for an applicable model to risk-stratify patients with metastatic prostate cancer.

{"title":"Measuring the impact of new risk factors within survival models.","authors":"Glenn Heller, Sean M Devlin","doi":"10.1093/jrsssc/qlae045","DOIUrl":"10.1093/jrsssc/qlae045","url":null,"abstract":"<p><p>Survival is poor for patients with metastatic cancer, and it is vital to examine new biomarkers that can improve patient prognostication and identify those who would benefit from more aggressive therapy. In metastatic prostate cancer, 2 new assays have become available: one that quantifies the number of cancer cells circulating in the peripheral blood, and the other a marker of the aggressiveness of the disease. It is critical to determine the magnitude of the effect of these biomarkers on the discrimination of a model-based risk score. To do so, most analysts frequently consider the discrimination of 2 separate survival models: one that includes both the new and standard factors and a second that includes the standard factors alone. However, this analysis is ultimately incorrect for many of the scale-transformation models ubiquitous in survival, as the reduced model is misspecified if the full model is specified correctly. To circumvent this issue, we developed a projection-based approach to estimate the impact of the 2 prostate cancer biomarkers. The results indicate that the new biomarkers can influence model discrimination and justify their inclusion in the risk model; however, the hunt remains for an applicable model to risk-stratify patients with metastatic prostate cancer.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"74 1","pages":"83-99"},"PeriodicalIF":1.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11725343/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142980653","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Non-parametric Bayesian approach to multiple treatment comparisons in network meta-analysis with application to comparisons of anti-depressants. 网络荟萃分析中多重治疗比较的非参数贝叶斯方法,应用于抗抑郁药的比较。
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-09-02 eCollection Date: 2024-11-01 DOI: 10.1093/jrsssc/qlae038
Andrés F Barrientos, Garritt L Page, Lifeng Lin

Network meta-analysis is a powerful tool to synthesize evidence from independent studies and compare multiple treatments simultaneously. A critical task of performing a network meta-analysis is to offer ranks of all available treatment options for a specific disease outcome. Frequently, the estimated treatment rankings are accompanied by a large amount of uncertainty, suffer from multiplicity issues, and rarely permit possible ties of treatments with similar performance. These issues make interpreting rankings problematic as they are often treated as absolute metrics. To address these shortcomings, we formulate a ranking strategy that adapts to scenarios with high-order uncertainty by producing more conservative results. This improves the interpretability while simultaneously accounting for multiple comparisons. To admit ties between treatment effects in cases where differences between treatment effects are negligible, we also develop a Bayesian non-parametric approach for network meta-analysis. The approach capitalizes on the induced clustering mechanism of Bayesian non-parametric methods, producing a positive probability that two treatment effects are equal. We demonstrate the utility of the procedure through numerical experiments and a network meta-analysis designed to study antidepressant treatments.

网络荟萃分析是一种强大的工具,可以综合来自独立研究的证据,同时比较多种治疗方法。进行网络荟萃分析的一项关键任务是提供针对特定疾病结果的所有可用治疗方案的排名。通常情况下,估算出的治疗排名存在很大的不确定性,存在多重性问题,而且很少允许性能相似的治疗方案并列。这些问题使得排名的解释成为问题,因为它们通常被视为绝对指标。为了解决这些缺陷,我们制定了一种排名策略,通过产生更保守的结果来适应具有高阶不确定性的情景。这在提高可解释性的同时,也考虑到了多重比较。为了在治疗效果之间的差异可以忽略不计的情况下承认治疗效果之间的联系,我们还开发了一种用于网络荟萃分析的贝叶斯非参数方法。该方法利用了贝叶斯非参数方法的诱导聚类机制,产生了两个治疗效果相等的正概率。我们通过数值实验和一项旨在研究抗抑郁治疗的网络荟萃分析,展示了该方法的实用性。
{"title":"Non-parametric Bayesian approach to multiple treatment comparisons in network meta-analysis with application to comparisons of anti-depressants.","authors":"Andrés F Barrientos, Garritt L Page, Lifeng Lin","doi":"10.1093/jrsssc/qlae038","DOIUrl":"10.1093/jrsssc/qlae038","url":null,"abstract":"<p><p>Network meta-analysis is a powerful tool to synthesize evidence from independent studies and compare multiple treatments simultaneously. A critical task of performing a network meta-analysis is to offer ranks of all available treatment options for a specific disease outcome. Frequently, the estimated treatment rankings are accompanied by a large amount of uncertainty, suffer from multiplicity issues, and rarely permit possible ties of treatments with similar performance. These issues make interpreting rankings problematic as they are often treated as absolute metrics. To address these shortcomings, we formulate a ranking strategy that adapts to scenarios with high-order uncertainty by producing more conservative results. This improves the interpretability while simultaneously accounting for multiple comparisons. To admit ties between treatment effects in cases where differences between treatment effects are negligible, we also develop a Bayesian non-parametric approach for network meta-analysis. The approach capitalizes on the induced clustering mechanism of Bayesian non-parametric methods, producing a positive probability that two treatment effects are equal. We demonstrate the utility of the procedure through numerical experiments and a network meta-analysis designed to study antidepressant treatments.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"73 5","pages":"1333-1354"},"PeriodicalIF":1.0,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11561732/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142649544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint modelling of survival and backwards recurrence outcomes: an analysis of factors associated with fertility treatment in the U.S. 存活率和逆向复发结果的联合建模:美国生育治疗相关因素分析
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-08-19 eCollection Date: 2024-11-01 DOI: 10.1093/jrsssc/qlae039
Siyuan Guo, Jiajia Zhang, Alexander C McLain

The motivation for this paper is to determine factors associated with time-to-fertility treatment (TTFT) among women currently attempting pregnancy in a cross-sectional sample. Challenges arise due to dependence between time-to-pregnancy (TTP) and TTFT. We propose appending a marginal accelerated failure time model to identify risk factors of TTFT with a model for TTP where fertility treatment is included as a time-varying treatment to account for their dependence. The latter requires extending backwards recurrence survival methods to incorporate time-varying covariates with time-varying coefficients. Since backwards recurrence survival methods are a function of mean survival, computational difficulties arise in formulating mean survival when fertility treatment is unobserved, i.e. when TTFT is censored. We address these challenges by developing computationally friendly forms for the double expectation of TTP and TTFT. The performance is validated via comprehensive simulation studies. We apply our approach to the National Survey of Family Growth and explore factors related to prolonged TTFT in the U.S.

本文的目的是通过横断面样本,确定目前试图怀孕的妇女中与不孕症治疗时间(TTFT)相关的因素。由于怀孕时间(TTP)与 TTFT 之间存在依赖关系,因此存在挑战。我们建议采用边际加速失败时间模型来识别 TTFT 的风险因素,同时采用 TTP 模型,将生育治疗作为时变治疗纳入其中,以考虑两者的依赖性。后者需要扩展后向复现生存法,以纳入具有时变系数的时变协变量。由于后向递推生存率方法是平均生存率的函数,当生育治疗是非观测变量时,即 TTFT 是有删减的,在计算平均生存率时就会遇到困难。我们为 TTP 和 TTFT 的双重期望开发了便于计算的形式,从而解决了这些难题。我们通过全面的模拟研究对其性能进行了验证。我们将这一方法应用于全国家庭成长调查,并探讨了与美国 TTFT 延长相关的因素。
{"title":"Joint modelling of survival and backwards recurrence outcomes: an analysis of factors associated with fertility treatment in the U.S.","authors":"Siyuan Guo, Jiajia Zhang, Alexander C McLain","doi":"10.1093/jrsssc/qlae039","DOIUrl":"10.1093/jrsssc/qlae039","url":null,"abstract":"<p><p>The motivation for this paper is to determine factors associated with time-to-fertility treatment (TTFT) among women currently attempting pregnancy in a cross-sectional sample. Challenges arise due to dependence between time-to-pregnancy (TTP) and TTFT. We propose appending a marginal accelerated failure time model to identify risk factors of TTFT with a model for TTP where fertility treatment is included as a time-varying treatment to account for their dependence. The latter requires extending backwards recurrence survival methods to incorporate time-varying covariates with time-varying coefficients. Since backwards recurrence survival methods are a function of mean survival, computational difficulties arise in formulating mean survival when fertility treatment is unobserved, i.e. when TTFT is censored. We address these challenges by developing computationally friendly forms for the double expectation of TTP and TTFT. The performance is validated via comprehensive simulation studies. We apply our approach to the National Survey of Family Growth and explore factors related to prolonged TTFT in the U.S.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"73 5","pages":"1355-1369"},"PeriodicalIF":1.0,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11561729/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142649543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Walking fingerprinting. 行走指纹识别
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-29 eCollection Date: 2024-11-01 DOI: 10.1093/jrsssc/qlae033
Lily Koffman, Ciprian Crainiceanu, Andrew Leroux

We consider the problem of predicting an individual's identity from accelerometry data collected during walking. In a previous paper, we transformed the accelerometry time series into an image by constructing the joint distribution of the acceleration and lagged acceleration for a vector of lags. Predictors derived by partitioning this image into grid cells were used in logistic regression to predict individuals. Here, we (a) implement machine learning methods for prediction using the grid cell-derived predictors; (b) derive inferential methods to screen for the most predictive grid cells while adjusting for correlation and multiple comparisons; and (c) develop a novel multivariate functional regression model that avoids partitioning the predictor space. Prediction methods are compared on two open source acceleometry data sets collected from: (a) 32 individuals walking on a 1.06 km path; and (b) six repetitions of walking on a 20 m path on two occasions at least 1 week apart for 153 study participants. In the 32-individual study, all methods achieve at least 95% rank-1 accuracy, while in the 153-individual study, accuracy varies from 41% to 98%, depending on the method and prediction task. Methods provide insights into why some individuals are easier to predict than others.

我们考虑的问题是从步行过程中收集的加速度数据预测个人身份。在之前的一篇论文中,我们通过构建加速度和滞后加速度向量的联合分布,将加速度时间序列转换为图像。通过将该图像划分为网格单元得出的预测因子被用于逻辑回归来预测个体。在这里,我们(a)使用网格单元衍生的预测因子实施机器学习方法进行预测;(b)推导出推论方法来筛选最具预测性的网格单元,同时调整相关性和多重比较;以及(c)开发一种新型多元函数回归模型,避免对预测因子空间进行分割。预测方法在两个开放源码的踝关节测量数据集上进行了比较,这些数据集收集自:(a) 32 人在 1.06 千米的路径上行走;(b) 153 名研究参与者在 20 米的路径上重复行走 6 次,两次行走至少相隔一周。在 32 人的研究中,所有方法都达到了至少 95% 的秩-1 准确率,而在 153 人的研究中,根据方法和预测任务的不同,准确率从 41% 到 98% 不等。这些方法让我们了解到为什么有些人比其他人更容易预测。
{"title":"Walking fingerprinting.","authors":"Lily Koffman, Ciprian Crainiceanu, Andrew Leroux","doi":"10.1093/jrsssc/qlae033","DOIUrl":"10.1093/jrsssc/qlae033","url":null,"abstract":"<p><p>We consider the problem of predicting an individual's identity from accelerometry data collected during walking. In a previous paper, we transformed the accelerometry time series into an image by constructing the joint distribution of the acceleration and lagged acceleration for a vector of lags. Predictors derived by partitioning this image into grid cells were used in logistic regression to predict individuals. Here, we (a) implement machine learning methods for prediction using the grid cell-derived predictors; (b) derive inferential methods to screen for the most predictive grid cells while adjusting for correlation and multiple comparisons; and (c) develop a novel multivariate functional regression model that avoids partitioning the predictor space. Prediction methods are compared on two open source acceleometry data sets collected from: (a) 32 individuals walking on a <math><mn>1.06</mn></math> km path; and (b) six repetitions of walking on a 20 m path on two occasions at least 1 week apart for 153 study participants. In the 32-individual study, all methods achieve at least 95% rank-1 accuracy, while in the 153-individual study, accuracy varies from 41% to 98%, depending on the method and prediction task. Methods provide insights into why some individuals are easier to predict than others.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"73 5","pages":"1221-1241"},"PeriodicalIF":1.0,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11561731/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142650551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating spatially varying health effects of wildland fire smoke using mobile health data. 利用移动健康数据估算野外火灾烟雾对健康的空间影响。
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-07-16 eCollection Date: 2024-11-01 DOI: 10.1093/jrsssc/qlae034
Lili Wu, Chenyin Gao, Shu Yang, Brian J Reich, Ana G Rappold

Wildland fire smoke exposures are an increasing threat to public health, highlighting the need for studying the effects of protective behaviours on reducing health outcomes. Emerging smartphone applications provide unprecedented opportunities to deliver health risk communication messages to a large number of individuals in real-time and subsequently study the effectiveness, but also pose methodological challenges. Smoke Sense, a citizen science project, provides an interactive smartphone app platform for participants to engage with information about air quality, and ways to record their own health symptoms and actions taken to reduce smoke exposure. We propose a doubly robust estimator of the structural nested mean model that accounts for spatially and time-varying effects via a local estimating equation approach with geographical kernel weighting. Moreover, our analytical framework also handles informative missingness by inverse probability weighting of estimating functions. We evaluate the method using extensive simulation studies and apply it to Smoke Sense data to increase the knowledge base about the relationship between health preventive measures and health-related outcomes. Our results show that the protective behaviours' effects vary over space and time and find that protective behaviours have more significant effects on reducing health symptoms in the Southwest than the Northwest region of the U.S.

野外火灾烟雾暴露对公众健康的威胁与日俱增,这凸显了研究防护行为对减少健康后果影响的必要性。新兴的智能手机应用提供了前所未有的机会,可以实时向大量个人传递健康风险交流信息,并随后研究其效果,但同时也带来了方法上的挑战。Smoke Sense 是一个公民科学项目,它为参与者提供了一个交互式智能手机应用平台,让他们了解空气质量信息,并记录自己的健康症状和为减少烟雾暴露而采取的行动。我们提出了结构嵌套均值模型的双重稳健估计方法,该方法通过具有地理核加权的局部估计方程方法考虑了空间和时间变化效应。此外,我们的分析框架还通过对估计函数进行反概率加权来处理信息缺失问题。我们通过大量模拟研究对该方法进行了评估,并将其应用于 Smoke Sense 数据,以增加有关健康预防措施与健康相关结果之间关系的知识库。我们的结果表明,保护性行为的效果随时间和空间而变化,并发现保护性行为对减少美国西南部地区的健康症状的效果比西北部地区更显著。
{"title":"Estimating spatially varying health effects of wildland fire smoke using mobile health data.","authors":"Lili Wu, Chenyin Gao, Shu Yang, Brian J Reich, Ana G Rappold","doi":"10.1093/jrsssc/qlae034","DOIUrl":"10.1093/jrsssc/qlae034","url":null,"abstract":"<p><p>Wildland fire smoke exposures are an increasing threat to public health, highlighting the need for studying the effects of protective behaviours on reducing health outcomes. Emerging smartphone applications provide unprecedented opportunities to deliver health risk communication messages to a large number of individuals in real-time and subsequently study the effectiveness, but also pose methodological challenges. Smoke Sense, a citizen science project, provides an interactive smartphone app platform for participants to engage with information about air quality, and ways to record their own health symptoms and actions taken to reduce smoke exposure. We propose a doubly robust estimator of the structural nested mean model that accounts for spatially and time-varying effects via a local estimating equation approach with geographical kernel weighting. Moreover, our analytical framework also handles informative missingness by inverse probability weighting of estimating functions. We evaluate the method using extensive simulation studies and apply it to Smoke Sense data to increase the knowledge base about the relationship between health preventive measures and health-related outcomes. Our results show that the protective behaviours' effects vary over space and time and find that protective behaviours have more significant effects on reducing health symptoms in the Southwest than the Northwest region of the U.S.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"73 5","pages":"1242-1261"},"PeriodicalIF":1.0,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11561730/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142649541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inverse set estimation and inversion of simultaneous confidence intervals. 反集估计和同时置信区间反演。
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-05-31 eCollection Date: 2024-08-01 DOI: 10.1093/jrsssc/qlae027
Junting Ren, Fabian J E Telschow, Armin Schwartzman

Motivated by the questions of risk assessment in climatology (temperature change in North America) and medicine (impact of statin usage and coronavirus disease 2019 on hospitalized patients), we address the problem of estimating the set in the domain of a function whose image equals a predefined subset of the real line. Existing methods require strict assumptions. We generalize the estimation of such sets to dense and nondense domains with protection against inflated Type I error in exploratory data analysis. This is achieved by proving that confidence sets of multiple upper, lower, or interval sets can be simultaneously constructed with the desired confidence nonasymptotically through inverting simultaneous confidence intervals. Nonparametric bootstrap algorithm and code are provided.

受气候学(北美气温变化)和医学(他汀类药物的使用和 2019 年冠状病毒疾病对住院病人的影响)中风险评估问题的启发,我们解决了估计函数域中的集合的问题,该函数的图像等于实线的预定义子集。现有方法需要严格的假设条件。我们将此类集合的估计方法推广到稠密域和非稠密域,并在探索性数据分析中防止 I 类错误的扩大。为此,我们证明了多个上集、下集或区间集的置信度集可以通过同时倒置置信区间,以非渐近的方式同时构建出所需的置信度。提供了非参数引导算法和代码。
{"title":"Inverse set estimation and inversion of simultaneous confidence intervals.","authors":"Junting Ren, Fabian J E Telschow, Armin Schwartzman","doi":"10.1093/jrsssc/qlae027","DOIUrl":"10.1093/jrsssc/qlae027","url":null,"abstract":"<p><p>Motivated by the questions of risk assessment in climatology (temperature change in North America) and medicine (impact of statin usage and coronavirus disease 2019 on hospitalized patients), we address the problem of estimating the set in the domain of a function whose image equals a predefined subset of the real line. Existing methods require strict assumptions. We generalize the estimation of such sets to dense and nondense domains with protection against inflated Type I error in exploratory data analysis. This is achieved by proving that confidence sets of multiple upper, lower, or interval sets can be simultaneously constructed with the desired confidence nonasymptotically through inverting simultaneous confidence intervals. Nonparametric bootstrap algorithm and code are provided.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"73 4","pages":"1082-1109"},"PeriodicalIF":1.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11321826/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141983698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Population-level task-evoked functional connectivity via Fourier analysis. 通过傅立叶分析实现群体级任务诱发功能连接。
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-03-14 eCollection Date: 2024-08-01 DOI: 10.1093/jrsssc/qlae015
Kun Meng, Ani Eloyan

Functional magnetic resonance imaging (fMRI) is a noninvasive and in-vivo imaging technique essential for measuring brain activity. Functional connectivity is used to study associations between brain regions, either while study subjects perform tasks or during periods of rest. In this paper, we propose a rigorous definition of task-evoked functional connectivity at the population level (ptFC). Importantly, our proposed ptFC is interpretable in the context of task-fMRI studies. An algorithm for estimating the ptFC is provided. We present the performance of the proposed algorithm compared to existing functional connectivity frameworks using simulations. Lastly, we apply the proposed algorithm to estimate the ptFC in a motor-task study from the Human Connectome Project.

功能磁共振成像(fMRI)是一种无创的体内成像技术,对测量大脑活动至关重要。功能连通性可用于研究大脑区域之间的关联,研究对象可在执行任务时或休息时进行研究。在本文中,我们提出了任务诱发的群体水平功能连通性(ptFC)的严格定义。重要的是,我们提出的ptFC可以在任务-MRI研究中进行解释。我们还提供了一种估算 ptFC 的算法。我们通过模拟展示了所提算法与现有功能连通性框架的性能比较。最后,我们在人类连接组计划的一项运动任务研究中应用了所提出的算法来估计ptFC。
{"title":"Population-level task-evoked functional connectivity via Fourier analysis.","authors":"Kun Meng, Ani Eloyan","doi":"10.1093/jrsssc/qlae015","DOIUrl":"10.1093/jrsssc/qlae015","url":null,"abstract":"<p><p>Functional magnetic resonance imaging (fMRI) is a noninvasive and in-vivo imaging technique essential for measuring brain activity. Functional connectivity is used to study associations between brain regions, either while study subjects perform tasks or during periods of rest. In this paper, we propose a rigorous definition of task-evoked functional connectivity at the population level (ptFC). Importantly, our proposed ptFC is interpretable in the context of task-fMRI studies. An algorithm for estimating the ptFC is provided. We present the performance of the proposed algorithm compared to existing functional connectivity frameworks using simulations. Lastly, we apply the proposed algorithm to estimate the ptFC in a motor-task study from the Human Connectome Project.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"73 4","pages":"857-879"},"PeriodicalIF":1.0,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11321825/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141983699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Testing unit root non-stationarity in the presence of missing data in univariate time series of mobile health studies. 测试移动健康研究单变量时间序列中存在缺失数据时的单位根非平稳性。
IF 1.6 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-02-29 eCollection Date: 2024-06-01 DOI: 10.1093/jrsssc/qlae010
Charlotte Fowler, Xiaoxuan Cai, Justin T Baker, Jukka-Pekka Onnela, Linda Valeri

The use of digital devices to collect data in mobile health studies introduces a novel application of time series methods, with the constraint of potential data missing at random or missing not at random (MNAR). In time-series analysis, testing for stationarity is an important preliminary step to inform appropriate subsequent analyses. The Dickey-Fuller test evaluates the null hypothesis of unit root non-stationarity, under no missing data. Beyond recommendations under data missing completely at random for complete case analysis or last observation carry forward imputation, researchers have not extended unit root non-stationarity testing to more complex missing data mechanisms. Multiple imputation with chained equations, Kalman smoothing imputation, and linear interpolation have also been used for time-series data, however such methods impose constraints on the autocorrelation structure and impact unit root testing. We propose maximum likelihood estimation and multiple imputation using state space model approaches to adapt the augmented Dickey-Fuller test to a context with missing data. We further develop sensitivity analyses to examine the impact of MNAR data. We evaluate the performance of existing and proposed methods across missing mechanisms in extensive simulations and in their application to a multi-year smartphone study of bipolar patients.

在移动健康研究中,使用数字设备收集数据为时间序列方法带来了一种新的应用,即潜在的随机或非随机数据缺失(MNAR)。在时间序列分析中,静态检验是一个重要的初步步骤,可为适当的后续分析提供依据。Dickey-Fuller 检验是在无缺失数据的情况下,对单位根非平稳性的零假设进行评估。除了针对完整病例分析或最后观察结转估算的完全随机缺失数据提出建议外,研究人员还没有将单位根非平稳性检验扩展到更复杂的缺失数据机制。链式方程多重归因、卡尔曼平滑归因和线性插值也被用于时间序列数据,但这些方法对自相关结构施加了限制,影响了单位根检验。我们提出了使用状态空间模型方法进行最大似然估计和多重估算的方法,以将增强的 Dickey-Fuller 检验调整到有缺失数据的情况下。我们进一步开发了敏感性分析,以检验 MNAR 数据的影响。我们通过大量模拟,并将其应用于一项针对双相情感障碍患者的多年期智能手机研究中,评估了现有方法和拟议方法在不同缺失机制下的性能。
{"title":"Testing unit root non-stationarity in the presence of missing data in univariate time series of mobile health studies.","authors":"Charlotte Fowler, Xiaoxuan Cai, Justin T Baker, Jukka-Pekka Onnela, Linda Valeri","doi":"10.1093/jrsssc/qlae010","DOIUrl":"10.1093/jrsssc/qlae010","url":null,"abstract":"<p><p>The use of digital devices to collect data in mobile health studies introduces a novel application of time series methods, with the constraint of potential data missing at random or missing not at random (MNAR). In time-series analysis, testing for stationarity is an important preliminary step to inform appropriate subsequent analyses. The Dickey-Fuller test evaluates the null hypothesis of unit root non-stationarity, under no missing data. Beyond recommendations under data missing completely at random for complete case analysis or last observation carry forward imputation, researchers have not extended unit root non-stationarity testing to more complex missing data mechanisms. Multiple imputation with chained equations, Kalman smoothing imputation, and linear interpolation have also been used for time-series data, however such methods impose constraints on the autocorrelation structure and impact unit root testing. We propose maximum likelihood estimation and multiple imputation using state space model approaches to adapt the augmented Dickey-Fuller test to a context with missing data. We further develop sensitivity analyses to examine the impact of MNAR data. We evaluate the performance of existing and proposed methods across missing mechanisms in extensive simulations and in their application to a multi-year smartphone study of bipolar patients.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"73 3","pages":"755-773"},"PeriodicalIF":1.6,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11175825/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141332377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Revisiting the effects of maternal education on adolescents' academic performance: Doubly robust estimation in a network-based observational study. 重新审视母亲教育对青少年学习成绩的影响:基于网络的观察研究中的双稳健估计。
IF 1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2024-02-13 eCollection Date: 2024-06-01 DOI: 10.1093/jrsssc/qlae008
Vanessa McNealis, Erica E M Moodie, Nema Dean

In many contexts, particularly when study subjects are adolescents, peer effects can invalidate typical statistical requirements in the data. For instance, it is plausible that a student's academic performance is influenced both by their own mother's educational level as well as that of their peers. Since the underlying social network is measured, the Add Health study provides a unique opportunity to examine the impact of maternal college education on adolescent school performance, both direct and indirect. However, causal inference on populations embedded in social networks poses technical challenges, since the typical no interference assumption no longer holds. While inverse probability-of-treatment weighted (IPW) estimators have been developed for this setting, they are often highly unstable. Motivated by the question of maternal education, we propose doubly robust (DR) estimators combining models for treatment and outcome that are consistent and asymptotically normal if either model is correctly specified. We present empirical results that illustrate the DR property and the efficiency gain of DR over IPW estimators even when the treatment model is misspecified. Contrary to previous studies, our robust analysis does not provide evidence of an indirect effect of maternal education on academic performance within adolescents' social circles in Add Health.

在很多情况下,特别是当研究对象是青少年时,同伴效应会使数据中典型的统计要求失效。例如,学生的学业成绩可能既受其母亲教育水平的影响,也受其同伴教育水平的影响。由于对基本社会网络进行了测量,"Add Health "研究提供了一个独特的机会来研究母亲的大学教育对青少年学习成绩的直接和间接影响。然而,由于典型的无干扰假设不再成立,因此对嵌入社会网络的人群进行因果推断面临技术挑战。虽然针对这种情况已经开发出了治疗概率反向加权(IPW)估算器,但这些估算器往往非常不稳定。受孕产妇教育问题的启发,我们提出了结合治疗模型和结果模型的双重稳健(DR)估计器,如果其中任何一个模型指定正确,这些估计器都是一致和渐近正常的。我们提出的实证结果表明了 DR 特性以及 DR 相对于 IPW 估计器的效率增益,即使在处理模型被错误指定的情况下也是如此。与以往的研究相反,我们的稳健分析没有提供证据表明,在 Add Health 的青少年社交圈中,母亲教育对学习成绩有间接影响。
{"title":"Revisiting the effects of maternal education on adolescents' academic performance: Doubly robust estimation in a network-based observational study.","authors":"Vanessa McNealis, Erica E M Moodie, Nema Dean","doi":"10.1093/jrsssc/qlae008","DOIUrl":"10.1093/jrsssc/qlae008","url":null,"abstract":"<p><p>In many contexts, particularly when study subjects are adolescents, peer effects can invalidate typical statistical requirements in the data. For instance, it is plausible that a student's academic performance is influenced both by their own mother's educational level as well as that of their peers. Since the underlying social network is measured, the Add Health study provides a unique opportunity to examine the impact of maternal college education on adolescent school performance, both direct and indirect. However, causal inference on populations embedded in social networks poses technical challenges, since the typical no interference assumption no longer holds. While inverse probability-of-treatment weighted (IPW) estimators have been developed for this setting, they are often highly unstable. Motivated by the question of maternal education, we propose doubly robust (DR) estimators combining models for treatment and outcome that are consistent and asymptotically normal if either model is correctly specified. We present empirical results that illustrate the DR property and the efficiency gain of DR over IPW estimators even when the treatment model is misspecified. Contrary to previous studies, our robust analysis does not provide evidence of an indirect effect of maternal education on academic performance within adolescents' social circles in Add Health.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"73 3","pages":"715-734"},"PeriodicalIF":1.0,"publicationDate":"2024-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11175826/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141332376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of the Royal Statistical Society Series C-Applied Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1