首页 > 最新文献

Annals of Applied Statistics最新文献

英文 中文
CONTRASTIVE LINEAR REGRESSION. 对比线性回归。
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-09-01 Epub Date: 2025-08-28 DOI: 10.1214/24-aoas1977
Boyang Zhang, Sarah Nyquist, Andrew Jones, Barbara E Engelhardt, Didong Li

Contrastive dimension reduction methods have been developed for case-control study data to identify variation that is enriched in the foreground (case) data X relative to the background (control) data Y . Here we develop contrastive regression for the setting where there is a response variable r associated with each foreground observation. This situation occurs frequently when, for example, the unaffected controls do not have a disease grade or intervention dosage, but the affected cases have a disease grade or intervention dosage, as in autism severity, solid tumors stages, polyp sizes, or warfarin dosages. Our contrastive regression model captures shared low-dimensional variation between the predictors in the case and control groups and then explains the case-specific response variables through the variance that remains in the predictors after shared variation is removed. We show that, in one single-cell RNA sequencing dataset on cellular differentiation in chronic rhinosinusitis with and without nasal polyps and in another single-nucleus RNA sequencing dataset on autism severity in postmortem brain samples from donors with and without autism, our contrastive linear regression performs feature ranking and identifies biologically-informative predictors associated with response that cannot be identified using other approaches.

针对病例对照研究数据,已经开发了对比降维方法,以识别前景(病例)数据X相对于背景(对照)数据Y中丰富的变化。在这里,我们开发了对比回归的设置,其中有一个响应变量r与每个前景观测相关联。这种情况经常发生,例如,未受影响的对照组没有疾病等级或干预剂量,但受影响的病例有疾病等级或干预剂量,如自闭症严重程度、实体瘤分期、息肉大小或华法林剂量。我们的对比回归模型捕获了病例组和对照组中预测因子之间共有的低维变异,然后通过去除共有变异后预测因子中保留的变异来解释特定病例的响应变量。我们的研究表明,在一个单细胞RNA测序数据集中,慢性鼻窦炎伴鼻息肉和不伴鼻息肉的细胞分化,以及在另一个单细胞RNA测序数据集中,来自有或没有自闭症的捐赠者的死后脑样本中自闭症严重程度,我们的对比线性回归进行了特征排序,并确定了与反应相关的生物学信息预测因子,这些预测因子无法用其他方法识别。
{"title":"CONTRASTIVE LINEAR REGRESSION.","authors":"Boyang Zhang, Sarah Nyquist, Andrew Jones, Barbara E Engelhardt, Didong Li","doi":"10.1214/24-aoas1977","DOIUrl":"10.1214/24-aoas1977","url":null,"abstract":"<p><p>Contrastive dimension reduction methods have been developed for case-control study data to identify variation that is enriched in the foreground (case) data <math><mi>X</mi></math> relative to the background (control) data <math><mi>Y</mi></math> . Here we develop contrastive regression for the setting where there is a response variable <math><mi>r</mi></math> associated with each foreground observation. This situation occurs frequently when, for example, the unaffected controls do not have a disease grade or intervention dosage, but the affected cases have a disease grade or intervention dosage, as in autism severity, solid tumors stages, polyp sizes, or warfarin dosages. Our contrastive regression model captures shared low-dimensional variation between the predictors in the case and control groups and then explains the case-specific response variables through the variance that remains in the predictors after shared variation is removed. We show that, in one single-cell RNA sequencing dataset on cellular differentiation in chronic rhinosinusitis with and without nasal polyps and in another single-nucleus RNA sequencing dataset on autism severity in postmortem brain samples from donors with and without autism, our contrastive linear regression performs feature ranking and identifies biologically-informative predictors associated with response that cannot be identified using other approaches.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"19 3","pages":"1868-1883"},"PeriodicalIF":1.4,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12692120/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145745602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
NETWORK-BASED MODELING OF EMOTIONAL EXPRESSIONS FOR MULTIPLE CANCERS VIA A LINGUISTIC ANALYSIS OF AN ONLINE HEALTH COMMUNITY. 通过对在线健康社区的语言分析,对多种癌症的情感表达进行基于网络的建模。
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-09-01 Epub Date: 2025-08-28 DOI: 10.1214/25-aoas2047
Xinyan Fan, Mengque Liu, Shuangge Ma

The diagnosis and treatment of cancer can evoke a variety of adverse emotions. Online health communities (OHCs) provide a safe platform for cancer patients and those closely related to express emotions without fear of judgement or stigma. In the literature, linguistic analysis of OHCs is usually limited to a single disease and based on methods with various technical limitations. In this article we analyze posts from September 2003 to September 2022 on eight cancers that are publicly available at the American Cancer Society's Cancer Survivors Network (CSN). We propose a novel network analysis technique based on low-rank matrices. The proposed approach decomposes the emotional expression semantic networks into an across-cancer time-independent component (which describes the "baseline" that is shared by multiple cancers), a cancer-specific time-independent component (which describes cancer-specific properties), and an across-cancer time-dependent component (which accommodates temporal effects on multiple cancer communities). For the second and third components, respectively, we consider a novel clustering structure and a change point structure. A penalization approach is developed, and its theoretical and computational properties are carefully established. The analysis of the CSN data leads to sensible networks and deeper insights into emotions for cancer overall and specific cancer types.

癌症的诊断和治疗可引起各种不良情绪。在线卫生社区(OHCs)为癌症患者和那些与表达情绪密切相关的人提供了一个安全的平台,而不必担心被评判或污名化。在文献中,OHCs的语言分析通常仅限于单一疾病,并且基于具有各种技术限制的方法。在这篇文章中,我们分析了从2003年9月到2022年9月在美国癌症协会癌症幸存者网络(CSN)上公开的八种癌症的帖子。提出了一种基于低秩矩阵的网络分析方法。提出的方法将情感表达语义网络分解为跨癌症时间独立组件(描述多种癌症共享的“基线”),癌症特异性时间独立组件(描述癌症特异性属性)和跨癌症时间依赖组件(适应对多种癌症社区的时间影响)。对于第二部分和第三部分,我们分别考虑了一种新的聚类结构和变化点结构。提出了一种惩罚方法,并详细建立了其理论和计算性质。通过对CSN数据的分析,我们可以建立合理的网络,并更深入地了解癌症整体和特定癌症类型的情绪。
{"title":"NETWORK-BASED MODELING OF EMOTIONAL EXPRESSIONS FOR MULTIPLE CANCERS VIA A LINGUISTIC ANALYSIS OF AN ONLINE HEALTH COMMUNITY.","authors":"Xinyan Fan, Mengque Liu, Shuangge Ma","doi":"10.1214/25-aoas2047","DOIUrl":"10.1214/25-aoas2047","url":null,"abstract":"<p><p>The diagnosis and treatment of cancer can evoke a variety of adverse emotions. Online health communities (OHCs) provide a safe platform for cancer patients and those closely related to express emotions without fear of judgement or stigma. In the literature, linguistic analysis of OHCs is usually limited to a single disease and based on methods with various technical limitations. In this article we analyze posts from September 2003 to September 2022 on eight cancers that are publicly available at the American Cancer Society's Cancer Survivors Network (CSN). We propose a novel network analysis technique based on low-rank matrices. The proposed approach decomposes the emotional expression semantic networks into an across-cancer time-independent component (which describes the \"baseline\" that is shared by multiple cancers), a cancer-specific time-independent component (which describes cancer-specific properties), and an across-cancer time-dependent component (which accommodates temporal effects on multiple cancer communities). For the second and third components, respectively, we consider a novel clustering structure and a change point structure. A penalization approach is developed, and its theoretical and computational properties are carefully established. The analysis of the CSN data leads to sensible networks and deeper insights into emotions for cancer overall and specific cancer types.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"19 3","pages":"2218-2236"},"PeriodicalIF":1.4,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12525517/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145309914","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
KULLBACK-LEIBLER-BASED DISCRETE FAILURE TIME MODELS FOR INTEGRATION OF PUBLISHED PREDICTION MODELS WITH NEW TIME-TO-EVENT DATASET. 基于kullback - leibler的离散故障时间模型,用于集成已发布的预测模型和新的时间到事件数据集。
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-06-01 Epub Date: 2025-05-28 DOI: 10.1214/24-aoas1955
Di Wang, Wen Ye, Randall Sung, Hui Jiang, Jeremy M G Taylor, Lisa Ly, Kevin He

Prediction of time-to-event data often suffers from rare event rates, small sample sizes, high dimensionality, and low signal-to-noise ratios. Incorporating published prediction models from external large-scale studies is expected to improve the performance of prognosis prediction from internal individual-level data. However, existing integration approaches typically assume that the underlying distributions of the external and internal data sources are similar, which is often invalid. To account for challenges, including heterogeneity, data sharing, and privacy constraints, we propose a failure time integration procedure, which utilizes a discrete hazard-based Kullback-Leibler discriminatory information measuring the discrepancy between the external models and the internal dataset. The asymptotic properties and simulation results show the advantage of the proposed method compared to those solely based on internal data. We apply the proposed method to improve prediction performance on a kidney transplant dataset from a local hospital by integrating this small-sized dataset with a published survival model obtained from the national transplant registry.

时间到事件数据的预测通常受到罕见事件率、小样本量、高维和低信噪比的影响。纳入来自外部大规模研究的已发表的预测模型有望提高来自内部个人水平数据的预后预测的性能。但是,现有的集成方法通常假设外部和内部数据源的底层分布相似,这通常是无效的。为了解决包括异质性、数据共享和隐私约束在内的挑战,我们提出了一种故障时间集成程序,该程序利用基于离散风险的Kullback-Leibler歧视性信息来测量外部模型与内部数据集之间的差异。渐近特性和仿真结果表明了该方法相对于仅基于内部数据的方法的优越性。我们将该方法应用于来自当地医院的肾移植数据集,通过将该小型数据集与从国家移植登记处获得的已发布的生存模型集成在一起,提高了该数据集的预测性能。
{"title":"KULLBACK-LEIBLER-BASED DISCRETE FAILURE TIME MODELS FOR INTEGRATION OF PUBLISHED PREDICTION MODELS WITH NEW TIME-TO-EVENT DATASET.","authors":"Di Wang, Wen Ye, Randall Sung, Hui Jiang, Jeremy M G Taylor, Lisa Ly, Kevin He","doi":"10.1214/24-aoas1955","DOIUrl":"10.1214/24-aoas1955","url":null,"abstract":"<p><p>Prediction of time-to-event data often suffers from rare event rates, small sample sizes, high dimensionality, and low signal-to-noise ratios. Incorporating published prediction models from external large-scale studies is expected to improve the performance of prognosis prediction from internal individual-level data. However, existing integration approaches typically assume that the underlying distributions of the external and internal data sources are similar, which is often invalid. To account for challenges, including heterogeneity, data sharing, and privacy constraints, we propose a failure time integration procedure, which utilizes a discrete hazard-based Kullback-Leibler discriminatory information measuring the discrepancy between the external models and the internal dataset. The asymptotic properties and simulation results show the advantage of the proposed method compared to those solely based on internal data. We apply the proposed method to improve prediction performance on a kidney transplant dataset from a local hospital by integrating this small-sized dataset with a published survival model obtained from the national transplant registry.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"19 2","pages":"1167-1189"},"PeriodicalIF":1.4,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12797872/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145971497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A DEEP NEURAL NETWORK TWO-PART MODEL AND FEATURE IMPORTANCE TEST FOR SEMI-CONTINUOUS DATA. 半连续数据的深度神经网络两部分模型及特征重要性检验。
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-06-01 Epub Date: 2025-05-28 DOI: 10.1214/25-aoas2013
Baiming Zou, Xinlei Mi, Shiyu Wan, Di Wu, James G Xenakis, Jianhua Hu, Fei Zou

Semi-continuous data frequently arise in clinical practice. For example, while many surgical patients still suffer from varying degrees of acute postoperative pain (POP) sometime after surgery (i.e., POP score > 0), others experience none (i.e., POP score = 0), indicating the existence of two distinct data processes at play. Existing parametric or semi-parametric two-part modeling methods for this type of semi-continuous data can fail to appropriately model the two underlying data processes as such methods rely heavily on (generalized) linear additive assumptions. However, many factors may interact to jointly influence the experience of POP non-additively and non-linearly. Motivated by this challenge and inspired by the flexibility of deep neural networks (DNN) to accurately approximate complex functions universally, we derive a DNN-based two-part model by adapting the conventional DNN methods with two additional components: a bootstrapping procedure along with a filtering algorithm to boost the stability of the conventional DNN, an approach we denote as sDNN. To improve the interpretability and transparency of sDNN, we further derive a feature importance testing procedure to identify important features associated with the outcome measurements of the two data processes, denoting this approach fsDNN. We show that fsDNN not only offers a statistical inference procedure for each feature under complex association but also that using the identified features can further improve the predictive performance of sDNN. The proposed sDNN- and fsDNN-based two-part models are applied to the analysis of real data from a POP study, in which application they clearly demonstrate advantages over the existing parametric and semi-parametric two-part models. Further, we conduct extensive numerical studies and draw comparisons with other machine learning methods to demonstrate that sDNN and fsDNN consistently outperform the existing two-part models and frequently used machine learning methods regardless of the data complexity. An R package implementing the proposed methods has been developed and is available in the Supplementary Material (Zou et al, 2025) and is also deposited on GitHub (https://github.com/BZou-lab/fsDNN).

临床实践中经常出现半连续数据。例如,虽然许多手术患者在手术后一段时间仍然遭受不同程度的急性术后疼痛(POP)(即POP评分> 0),但其他人则没有(即POP评分= 0),这表明存在两种不同的数据过程在起作用。对于这类半连续数据,现有的参数或半参数两部分建模方法可能无法适当地对两个潜在的数据过程进行建模,因为这些方法严重依赖于(广义的)线性可加性假设。然而,许多因素可能相互作用,共同影响POP体验的非加性和非线性。受到这一挑战的激励,并受到深度神经网络(DNN)精确近似复杂函数的灵活性的启发,我们通过将传统的DNN方法与两个额外组件相适应,推导出基于DNN的两部分模型:一个自举过程和一个滤波算法,以提高传统DNN的稳定性,我们将这种方法称为sDNN。为了提高sDNN的可解释性和透明度,我们进一步推导了一个特征重要性测试程序,以识别与两个数据处理的结果测量相关的重要特征,将该方法称为fsDNN。研究表明,fsDNN不仅为复杂关联下的每个特征提供了统计推理过程,而且利用识别出的特征可以进一步提高sDNN的预测性能。提出的基于sdn和fsdn的两部分模型应用于POP研究的实际数据分析,在应用中,它们明显优于现有的参数和半参数两部分模型。此外,我们进行了广泛的数值研究,并与其他机器学习方法进行了比较,以证明无论数据复杂性如何,sDNN和fsDNN始终优于现有的两部分模型和常用的机器学习方法。已经开发了实现所提出方法的R包,可在补充材料(Zou et al, 2025)中获得,也存放在GitHub (https://github.com/BZou-lab/fsDNN)上。
{"title":"A DEEP NEURAL NETWORK TWO-PART MODEL AND FEATURE IMPORTANCE TEST FOR SEMI-CONTINUOUS DATA.","authors":"Baiming Zou, Xinlei Mi, Shiyu Wan, Di Wu, James G Xenakis, Jianhua Hu, Fei Zou","doi":"10.1214/25-aoas2013","DOIUrl":"10.1214/25-aoas2013","url":null,"abstract":"<p><p>Semi-continuous data frequently arise in clinical practice. For example, while many surgical patients still suffer from varying degrees of acute postoperative pain (POP) sometime after surgery (i.e., POP score > 0), others experience none (i.e., POP score = 0), indicating the existence of two distinct data processes at play. Existing parametric or semi-parametric two-part modeling methods for this type of semi-continuous data can fail to appropriately model the two underlying data processes as such methods rely heavily on (generalized) linear additive assumptions. However, many factors may interact to jointly influence the experience of POP non-additively and non-linearly. Motivated by this challenge and inspired by the flexibility of deep neural networks (DNN) to accurately approximate complex functions universally, we derive a DNN-based two-part model by adapting the conventional DNN methods with two additional components: a bootstrapping procedure along with a filtering algorithm to boost the stability of the conventional DNN, an approach we denote as sDNN. To improve the interpretability and transparency of sDNN, we further derive a feature importance testing procedure to identify important features associated with the outcome measurements of the two data processes, denoting this approach fsDNN. We show that fsDNN not only offers a statistical inference procedure for each feature under complex association but also that using the identified features can further improve the predictive performance of sDNN. The proposed sDNN- and fsDNN-based two-part models are applied to the analysis of real data from a POP study, in which application they clearly demonstrate advantages over the existing parametric and semi-parametric two-part models. Further, we conduct extensive numerical studies and draw comparisons with other machine learning methods to demonstrate that sDNN and fsDNN consistently outperform the existing two-part models and frequently used machine learning methods regardless of the data complexity. An R package implementing the proposed methods has been developed and is available in the Supplementary Material (Zou et al, 2025) and is also deposited on GitHub (https://github.com/BZou-lab/fsDNN).</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"19 2","pages":"1314-1331"},"PeriodicalIF":1.3,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12263096/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144644080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BAYESIAN DATA AUGMENTATION FOR RECURRENT EVENTS UNDER INTERMITTENT ASSESSMENT IN OVERLAPPING INTERVALS WITH APPLICATIONS TO EMR DATA. 重复事件在重叠区间间歇评估下的贝叶斯数据增强与emr数据的应用。
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-06-01 Epub Date: 2025-05-28 DOI: 10.1214/24-aoas2007
Xin Liu, Patrick M Schnell

Electronic medical records (EMR) data contain rich information that can facilitate health-related studies but is collected primarily for purposes other than research. For recurrent events, EMR data often do not record event times or counts but only contain intermittently assessed and censored observations (i.e. upper and/or lower bounds for counts in a time interval) at uncontrolled times. This can result in non-contiguous or overlapping assessment intervals with censored event counts. Existing methods for analyzing intermittently assessed recurrent events assume disjoint assessment intervals with known counts (interval count data) due to a focus on prospective studies with controlled assessment times. We propose a Bayesian data augmentation method to analyze the complicated assessments in EMR data for recurrent events. Within a Gibbs sampler, event times are imputed by generating sets of event times from non-homogeneous Poisson processes and rejecting proposed sets that are incompatible with constraints imposed by assessment data. Based on the independent increments property of Poisson processes, we implement three techniques to speed up this rejection sampling imputation method for large EMR datasets: independent sampling by partitioning, truncated generation, and sequential sampling. In a simulation study we show our method accurately estimates parameters of log-linear Poisson process intensities. Although the proposed method can be applied generally to EMR data of recurrent events, our study is specifically motivated by identifying risk factors for falls due to cancer treatment and its supportive medications. We used the proposed method to analyze an EMR dataset comprising 5501 patients treated for breast cancer. Our analysis provides evidence supporting associations between certain risk factors (including classes of medications) and risk of falls.

电子医疗记录(EMR)数据包含丰富的信息,可以促进与健康有关的研究,但主要用于研究以外的目的。对于复发性事件,EMR数据通常不记录事件时间或计数,而只包含在不受控制的时间内间歇性评估和审查的观察结果(即时间间隔内计数的上限和/或下限)。这可能导致不连续或重叠的评估间隔与审查的事件计数。现有的分析间歇性评估的复发事件的方法,由于侧重于评估时间可控的前瞻性研究,假设具有已知计数(间隔计数数据)的不相交评估间隔。我们提出了一种贝叶斯数据增强方法来分析EMR数据中对复发事件的复杂评估。在吉布斯采样器中,通过从非齐次泊松过程中生成事件时间集并拒绝与评估数据施加的约束不兼容的建议集来估算事件时间。基于泊松过程的独立增量特性,我们实现了三种技术来加速这种大型EMR数据集的拒绝采样插入方法:分区独立采样、截断生成和顺序采样。仿真研究表明,该方法能准确地估计对数线性泊松过程强度的参数。虽然所提出的方法可以普遍应用于复发事件的EMR数据,但我们的研究是为了确定癌症治疗及其支持药物导致跌倒的危险因素。我们使用提出的方法分析了包含5501名乳腺癌治疗患者的EMR数据集。我们的分析提供了支持某些风险因素(包括药物类别)与跌倒风险之间关联的证据。
{"title":"BAYESIAN DATA AUGMENTATION FOR RECURRENT EVENTS UNDER INTERMITTENT ASSESSMENT IN OVERLAPPING INTERVALS WITH APPLICATIONS TO EMR DATA.","authors":"Xin Liu, Patrick M Schnell","doi":"10.1214/24-aoas2007","DOIUrl":"10.1214/24-aoas2007","url":null,"abstract":"<p><p>Electronic medical records (EMR) data contain rich information that can facilitate health-related studies but is collected primarily for purposes other than research. For recurrent events, EMR data often do not record event times or counts but only contain intermittently assessed and censored observations (i.e. upper and/or lower bounds for counts in a time interval) at uncontrolled times. This can result in non-contiguous or overlapping assessment intervals with censored event counts. Existing methods for analyzing intermittently assessed recurrent events assume disjoint assessment intervals with known counts (interval count data) due to a focus on prospective studies with controlled assessment times. We propose a Bayesian data augmentation method to analyze the complicated assessments in EMR data for recurrent events. Within a Gibbs sampler, event times are imputed by generating sets of event times from non-homogeneous Poisson processes and rejecting proposed sets that are incompatible with constraints imposed by assessment data. Based on the independent increments property of Poisson processes, we implement three techniques to speed up this rejection sampling imputation method for large EMR datasets: independent sampling by partitioning, truncated generation, and sequential sampling. In a simulation study we show our method accurately estimates parameters of log-linear Poisson process intensities. Although the proposed method can be applied generally to EMR data of recurrent events, our study is specifically motivated by identifying risk factors for falls due to cancer treatment and its supportive medications. We used the proposed method to analyze an EMR dataset comprising 5501 patients treated for breast cancer. Our analysis provides evidence supporting associations between certain risk factors (including classes of medications) and risk of falls.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"19 2","pages":"1332-1361"},"PeriodicalIF":1.4,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12393837/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144976823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DYNAMIC PREDICTION WITH MULTIVARIATE LONGITUDINAL OUTCOMES AND LONGITUDINAL MAGNETIC RESONANCE IMAGING DATA. 动态预测与多元纵向结果和纵向磁共振成像数据。
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-03-01 Epub Date: 2025-03-17 DOI: 10.1214/24-aoas1970
Haotian Zou, Luo Xiao, Donglin Zeng, Sheng Luo

Alzheimer's Disease (AD) is a common neurodegenerative disorder impairing multiple domains. Recent AD studies, for example, the Alzheimer's Disease Neuroimaging Initiative (ADNI) study, collect multimodal data to better understand AD severity and progression. To facilitate precision medicine for high-risk individuals, it is essential to develop an AD predictive model that leverages multimodal data and provides accurate personalized predictions of dementia occurrences. In this article we propose a multivariate functional mixed model with longitudinal magnetic resonance imaging data (MFMM-LMRI) that jointly models longitudinal neurological scores, longitudinal voxelwise MRI data, and the survival outcome as dementia onset. We model longitudinal MRI data using the joint and individual variation explained (JIVE) approach. We investigate two functional forms linking the longitudinal and survival processes. We adopt the Markov chain Monte Carlo (MCMC) method to obtain posterior samples. We establish a dynamic prediction framework that predicts longitudinal trajectories and the probability of dementia occurrence. The simulation study with various sample sizes and event rates supports the validity of the method. We apply the MFMM-LMRI to the motivating ADNI study and conclude that additional ApoE-ϵ4 alleles and a higher latent disease profile are associated with a higher risk of dementia onset. We detect a significant association between the longitudinal MRI data and the survival outcome. The instantaneous model with longitudinal MRI data has the best fitting and predictive performance.

阿尔茨海默病(AD)是一种常见的神经退行性疾病。最近的阿尔茨海默病研究,例如,阿尔茨海默病神经影像学倡议(ADNI)研究,收集多模式数据,以更好地了解阿尔茨海默病的严重程度和进展。为了促进对高危人群的精准医疗,开发一种利用多模态数据的阿尔茨海默病预测模型并提供对痴呆症发生的准确个性化预测至关重要。在本文中,我们提出了一个具有纵向磁共振成像数据(MFMM-LMRI)的多元功能混合模型,该模型联合模拟纵向神经学评分、纵向体向MRI数据和痴呆发病时的生存结果。我们使用关节和个体变异解释(JIVE)方法对纵向MRI数据进行建模。我们研究了两种连接纵向和生存过程的功能形式。我们采用马尔科夫链蒙特卡罗(MCMC)方法获得后验样本。我们建立了一个动态预测框架,预测纵向轨迹和痴呆发生的概率。不同样本量和事件率的仿真研究证明了该方法的有效性。我们将MFMM-LMRI应用于ADNI研究,并得出结论,额外的ApoE-ϵ4等位基因和更高的潜伏性疾病特征与更高的痴呆发病风险相关。我们发现纵向MRI数据与生存结果之间存在显著关联。纵向MRI数据的瞬时模型具有最佳的拟合和预测性能。
{"title":"DYNAMIC PREDICTION WITH MULTIVARIATE LONGITUDINAL OUTCOMES AND LONGITUDINAL MAGNETIC RESONANCE IMAGING DATA.","authors":"Haotian Zou, Luo Xiao, Donglin Zeng, Sheng Luo","doi":"10.1214/24-aoas1970","DOIUrl":"10.1214/24-aoas1970","url":null,"abstract":"<p><p>Alzheimer's Disease (AD) is a common neurodegenerative disorder impairing multiple domains. Recent AD studies, for example, the Alzheimer's Disease Neuroimaging Initiative (ADNI) study, collect multimodal data to better understand AD severity and progression. To facilitate precision medicine for high-risk individuals, it is essential to develop an AD predictive model that leverages multimodal data and provides accurate personalized predictions of dementia occurrences. In this article we propose a multivariate functional mixed model with longitudinal magnetic resonance imaging data (MFMM-LMRI) that jointly models longitudinal neurological scores, longitudinal voxelwise MRI data, and the survival outcome as dementia onset. We model longitudinal MRI data using the joint and individual variation explained (JIVE) approach. We investigate two functional forms linking the longitudinal and survival processes. We adopt the Markov chain Monte Carlo (MCMC) method to obtain posterior samples. We establish a dynamic prediction framework that predicts longitudinal trajectories and the probability of dementia occurrence. The simulation study with various sample sizes and event rates supports the validity of the method. We apply the MFMM-LMRI to the motivating ADNI study and conclude that additional ApoE-<i>ϵ</i>4 alleles and a higher latent disease profile are associated with a higher risk of dementia onset. We detect a significant association between the longitudinal MRI data and the survival outcome. The instantaneous model with longitudinal MRI data has the best fitting and predictive performance.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"19 1","pages":"505-528"},"PeriodicalIF":1.3,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12206078/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144530914","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LOW-RANK LONGITUDINAL FACTOR REGRESSION WITH APPLICATION TO CHEMICAL MIXTURES. 低秩纵向因子回归及其在化学混合物中的应用。
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-03-01 Epub Date: 2025-03-17 DOI: 10.1214/24-aoas1988
Glenn Palmer, Amy H Herring, David B Dunson

Developmental epidemiology commonly focuses on assessing the association between multiple early life exposures and childhood health. Statistical analyses of data from such studies focus on inferring the contributions of individual exposures, while also characterizing time-varying and interacting effects. Such inferences are made more challenging by correlations among exposures, nonlinearity, and the curse of dimensionality. Motivated by studying the effects of prenatal bisphenol A (BPA) and phthalate exposures on glucose metabolism in adolescence using data from the ELEMENT study, we propose a low-rank longitudinal factor regression (LowFR) model for tractable inference on flexible longitudinal exposure effects. LowFR handles highly-correlated exposures using a Bayesian dynamic factor model, which is fit jointly with a health outcome via a novel factor regression approach. The model collapses on simpler and intuitive submodels when appropriate, while expanding to allow considerable flexibility in time-varying and interaction effects when supported by the data. After demonstrating LowFR's effectiveness in simulations, we use it to analyze the ELEMENT data and find that diethyl and dibutyl phthalate metabolite levels in trimesters 1 and 2 are associated with altered glucose metabolism in adolescence.

发育流行病学通常侧重于评估多次早期生活暴露与儿童健康之间的关系。这些研究数据的统计分析侧重于推断个人暴露的贡献,同时也描述了时变和相互作用的影响。这样的推断是更具挑战性的相关性暴露,非线性,和诅咒的维度。在研究产前双酚A (BPA)和邻苯二甲酸盐暴露对青春期葡萄糖代谢的影响的基础上,我们提出了一个低秩纵向因素回归(LowFR)模型,用于柔性纵向暴露效应的易于推断。LowFR使用贝叶斯动态因子模型处理高相关暴露,该模型通过新颖的因子回归方法与健康结果联合拟合。在适当的时候,模型在更简单和直观的子模型上崩溃,同时在数据支持的情况下,扩展到允许在时变和交互效果方面具有相当大的灵活性。在模拟中证明了LowFR的有效性后,我们用它来分析ELEMENT数据,发现妊娠1和2个月的邻苯二甲酸二乙酯和邻苯二甲酸二丁酯代谢物水平与青春期葡萄糖代谢的改变有关。
{"title":"LOW-RANK LONGITUDINAL FACTOR REGRESSION WITH APPLICATION TO CHEMICAL MIXTURES.","authors":"Glenn Palmer, Amy H Herring, David B Dunson","doi":"10.1214/24-aoas1988","DOIUrl":"https://doi.org/10.1214/24-aoas1988","url":null,"abstract":"<p><p>Developmental epidemiology commonly focuses on assessing the association between multiple early life exposures and childhood health. Statistical analyses of data from such studies focus on inferring the contributions of individual exposures, while also characterizing time-varying and interacting effects. Such inferences are made more challenging by correlations among exposures, nonlinearity, and the curse of dimensionality. Motivated by studying the effects of prenatal bisphenol A (BPA) and phthalate exposures on glucose metabolism in adolescence using data from the ELEMENT study, we propose a low-rank longitudinal factor regression (LowFR) model for tractable inference on flexible longitudinal exposure effects. LowFR handles highly-correlated exposures using a Bayesian dynamic factor model, which is fit jointly with a health outcome via a novel factor regression approach. The model collapses on simpler and intuitive submodels when appropriate, while expanding to allow considerable flexibility in time-varying and interaction effects when supported by the data. After demonstrating LowFR's effectiveness in simulations, we use it to analyze the ELEMENT data and find that diethyl and dibutyl phthalate metabolite levels in trimesters 1 and 2 are associated with altered glucose metabolism in adolescence.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"19 1","pages":"769-797"},"PeriodicalIF":1.3,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12013532/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144057647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
INFERRING SYNERGISTIC AND ANTAGONISTIC INTERACTIONS IN MIXTURES OF EXPOSURES. 推断暴露混合物中的协同和拮抗相互作用。
IF 1.4 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-03-01 Epub Date: 2025-03-17 DOI: 10.1214/24-aoas1948
Shounak Chattopadhyay, Stephanie M Engel, David Dunson

There is abundant interest in assessing the joint effects of multiple exposures on human health. This is often referred to as the mixtures problem in environmental epidemiology and toxicology. Classically, studies have examined the adverse health effects of different chemicals one at a time, but there is concern that certain chemicals may act together to amplify each other's effects. Such amplification is referred to as synergistic interaction, while chemicals that inhibit each other's effects have antagonistic interactions. Current approaches for assessing the health effects of chemical mixtures do not explicitly consider synergy or antagonism in the modeling, instead focusing on either parametric or unconstrained nonparametric dose response surface modeling. The parametric case can be too inflexible, while nonparametric methods face a curse of dimensionality that leads to overly wiggly and uninterpretable surface estimates. We propose a Bayesian approach that decomposes the response surface into additive main effects and pairwise interaction effects and then detects synergistic and antagonistic interactions. Variable selection decisions for each interaction component are also provided. This Synergistic Antagonistic Interaction Detection (SAID) framework is evaluated relative to existing approaches using simulation experiments and an application to data from NHANES.

人们对评估多重接触对人类健康的共同影响非常感兴趣。这通常被称为环境流行病学和毒理学中的混合物问题。传统上,研究一次只检查一种不同化学物质对健康的不利影响,但人们担心某些化学物质可能会共同作用,放大彼此的影响。这种放大被称为协同作用,而相互抑制作用的化学物质具有拮抗作用。目前评估化学混合物对健康影响的方法在建模中没有明确考虑协同作用或拮抗作用,而是侧重于参数或无约束的非参数剂量反应面建模。参数化的情况可能太不灵活,而非参数化的方法面临维度的诅咒,导致过度扭曲和不可解释的表面估计。我们提出了一种贝叶斯方法,该方法将响应面分解为可加性主效应和成对相互作用效应,然后检测协同和拮抗相互作用。还提供了每个交互组件的可变选择决策。该协同拮抗相互作用检测(SAID)框架通过模拟实验和NHANES数据应用,相对于现有方法进行了评估。
{"title":"INFERRING SYNERGISTIC AND ANTAGONISTIC INTERACTIONS IN MIXTURES OF EXPOSURES.","authors":"Shounak Chattopadhyay, Stephanie M Engel, David Dunson","doi":"10.1214/24-aoas1948","DOIUrl":"10.1214/24-aoas1948","url":null,"abstract":"<p><p>There is abundant interest in assessing the joint effects of multiple exposures on human health. This is often referred to as the mixtures problem in environmental epidemiology and toxicology. Classically, studies have examined the adverse health effects of different chemicals one at a time, but there is concern that certain chemicals may act together to amplify each other's effects. Such amplification is referred to as <i>synergistic</i> interaction, while chemicals that inhibit each other's effects have <i>antagonistic</i> interactions. Current approaches for assessing the health effects of chemical mixtures do not explicitly consider synergy or antagonism in the modeling, instead focusing on either parametric or unconstrained nonparametric dose response surface modeling. The parametric case can be too inflexible, while nonparametric methods face a curse of dimensionality that leads to overly wiggly and uninterpretable surface estimates. We propose a Bayesian approach that decomposes the response surface into additive main effects and pairwise interaction effects and then detects synergistic and antagonistic interactions. Variable selection decisions for each interaction component are also provided. This Synergistic Antagonistic Interaction Detection (SAID) framework is evaluated relative to existing approaches using simulation experiments and an application to data from NHANES.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"19 1","pages":"169-190"},"PeriodicalIF":1.4,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12393835/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144976792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HETEROGENEOUS TREATMENT AND SPILLOVER EFFECTS UNDER CLUSTERED NETWORK INTERFERENCE. 集群网络干扰下的异质性处理与溢出效应。
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-03-01 Epub Date: 2025-03-17 DOI: 10.1214/24-aoas1913
Falco J Bargagli-Stoffi, Costanza Tortú, Laura Forastiere

The bulk of causal inference studies rule out the presence of interference between units. However, in many real-world scenarios, units are interconnected by social, physical, or virtual ties, and the effect of the treatment can spill from one unit to other connected individuals in the network. In this paper, we develop a machine learning method that uses tree-based algorithms and a Horvitz-Thompson estimator to assess the heterogeneity of treatment and spillover effects with respect to individual, neighborhood, and network characteristics in the context of clustered networks and interference within clusters. The proposed network causal tree (NCT) algorithm has several advantages. First, it allows the investigation of the heterogeneity of the treatment effect, avoiding potential bias due to the presence of interference. Second, understanding the heterogeneity of both treatment and spillover effects can guide policymakers in scaling up interventions, designing targeting strategies, and increasing cost-effectiveness. We investigate the performance of our NCT method using a Monte Carlo simulation study and illustrate its application to assess the heterogeneous effects of information sessions on the uptake of a new weather insurance policy in rural China.

大量的因果推理研究排除了单元之间存在的干扰。然而,在许多现实世界的场景中,单位是通过社会、物理或虚拟关系相互连接的,治疗的效果可以从一个单位溢出到网络中其他连接的个体。在本文中,我们开发了一种机器学习方法,该方法使用基于树的算法和Horvitz-Thompson估计器来评估在集群网络和集群内部干扰的背景下,关于个体、社区和网络特征的处理和溢出效应的异质性。本文提出的网络因果树(NCT)算法具有几个优点。首先,它允许研究治疗效果的异质性,避免由于存在干扰而产生的潜在偏差。其次,了解治疗和溢出效应的异质性可以指导政策制定者扩大干预措施,设计目标战略,提高成本效益。我们使用蒙特卡罗模拟研究来研究我们的NCT方法的性能,并说明其应用于评估信息会话对中国农村新天气保险政策吸收的异质效应。
{"title":"HETEROGENEOUS TREATMENT AND SPILLOVER EFFECTS UNDER CLUSTERED NETWORK INTERFERENCE.","authors":"Falco J Bargagli-Stoffi, Costanza Tortú, Laura Forastiere","doi":"10.1214/24-aoas1913","DOIUrl":"10.1214/24-aoas1913","url":null,"abstract":"<p><p>The bulk of causal inference studies rule out the presence of interference between units. However, in many real-world scenarios, units are interconnected by social, physical, or virtual ties, and the effect of the treatment can spill from one unit to other connected individuals in the network. In this paper, we develop a machine learning method that uses tree-based algorithms and a Horvitz-Thompson estimator to assess the heterogeneity of treatment and spillover effects with respect to individual, neighborhood, and network characteristics in the context of clustered networks and interference within clusters. The proposed network causal tree (NCT) algorithm has several advantages. First, it allows the investigation of the heterogeneity of the treatment effect, avoiding potential bias due to the presence of interference. Second, understanding the heterogeneity of both treatment and spillover effects can guide policymakers in scaling up interventions, designing targeting strategies, and increasing cost-effectiveness. We investigate the performance of our NCT method using a Monte Carlo simulation study and illustrate its application to assess the heterogeneous effects of information sessions on the uptake of a new weather insurance policy in rural China.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"19 1","pages":"28-55"},"PeriodicalIF":1.3,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12245184/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144610248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CAUSAL HEALTH IMPACTS OF POWER PLANT EMISSION CONTROLS UNDER MODELED AND UNCERTAIN PHYSICAL PROCESS INTERFERENCE. 模拟和不确定物理过程干扰下电厂排放控制的因果健康影响。
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-12-01 Epub Date: 2024-10-31 DOI: 10.1214/24-aoas1904
Nathan B Wikle, Corwin M Zigler

Causal inference with spatial environmental data is often challenging due to the presence of interference: outcomes for observational units depend on some combination of local and nonlocal treatment. This is especially relevant when estimating the effect of power plant emissions controls on population health, as pollution exposure is dictated by: (i) the location of point-source emissions as well as (ii) the transport of pollutants across space via dynamic physical-chemical processes. In this work we estimate the effectiveness of air quality interventions at coal-fired power plants in reducing two adverse health outcomes in Texas in 2016: pediatric asthma ED visits and Medicare all-cause mortality. We develop methods for causal inference with interference when the underlying network structure is not known with certainty and instead must be estimated from ancillary data. Notably, uncertainty in the interference structure is propagated to the resulting causal effect estimates. We offer a Bayesian, spatial mechanistic model for the interference mapping, which we combine with a flexible nonparametric outcome model to marginalize estimates of causal effects over uncertainty in the structure of interference. our analysis finds some evidence that emissions controls at upwind power plants reduce asthma ED visits and all-cause mortality; however, accounting for uncertainty in the interference renders the results largely inconclusive.

由于存在干扰,空间环境数据的因果推断往往具有挑战性:观测单位的结果取决于局部和非局部处理的某种组合。在估计发电厂排放控制对人口健康的影响时,这一点尤其重要,因为污染暴露取决于:(一)点源排放的地点以及(二)污染物通过动态物理化学过程在空间上的转移。在这项工作中,我们估计了2016年德克萨斯州燃煤电厂空气质量干预措施在减少两种不良健康结果方面的有效性:儿科哮喘急诊就诊和医疗保险全因死亡率。当底层网络结构不确定且必须从辅助数据中估计时,我们开发了具有干扰的因果推理方法。值得注意的是,干涉结构中的不确定性被传播到由此产生的因果效应估计中。我们为干扰映射提供了一个贝叶斯空间机制模型,我们将其与一个灵活的非参数结果模型相结合,以边缘化干扰结构中不确定性的因果效应估计。我们的分析发现,一些证据表明,对逆风发电厂的排放控制可以减少哮喘急诊就诊和全因死亡率;然而,考虑到干扰的不确定性,结果基本上是不确定的。
{"title":"CAUSAL HEALTH IMPACTS OF POWER PLANT EMISSION CONTROLS UNDER MODELED AND UNCERTAIN PHYSICAL PROCESS INTERFERENCE.","authors":"Nathan B Wikle, Corwin M Zigler","doi":"10.1214/24-aoas1904","DOIUrl":"10.1214/24-aoas1904","url":null,"abstract":"<p><p>Causal inference with spatial environmental data is often challenging due to the presence of interference: outcomes for observational units depend on some combination of local and nonlocal treatment. This is especially relevant when estimating the effect of power plant emissions controls on population health, as pollution exposure is dictated by: (i) the location of point-source emissions as well as (ii) the transport of pollutants across space via dynamic physical-chemical processes. In this work we estimate the effectiveness of air quality interventions at coal-fired power plants in reducing two adverse health outcomes in Texas in 2016: pediatric asthma ED visits and Medicare all-cause mortality. We develop methods for causal inference with interference when the underlying network structure is not known with certainty and instead must be estimated from ancillary data. Notably, uncertainty in the interference structure is propagated to the resulting causal effect estimates. We offer a Bayesian, spatial mechanistic model for the interference mapping, which we combine with a flexible nonparametric outcome model to marginalize estimates of causal effects over uncertainty in the structure of interference. our analysis finds some evidence that emissions controls at upwind power plants reduce asthma ED visits and all-cause mortality; however, accounting for uncertainty in the interference renders the results largely inconclusive.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"18 4","pages":"2753-2774"},"PeriodicalIF":1.3,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11619076/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142787678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Annals of Applied Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1