首页 > 最新文献

Annals of Applied Statistics最新文献

英文 中文
BAYESIAN MITIGATION OF SPATIAL COARSENING FOR A HAWKES MODEL APPLIED TO GUNFIRE, WILDFIRE AND VIRAL CONTAGION. 应用于枪火、野火和病毒传染的霍克斯模型的空间粗化的贝叶斯缓解。
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2022-03-01 Epub Date: 2022-03-28 DOI: 10.1214/21-aoas1517
Andrew J Holbrook, Xiang Ji, Marc A Suchard

Self-exciting spatiotemporal Hawkes processes have found increasing use in the study of large-scale public health threats, ranging from gun violence and earthquakes to wildfires and viral contagion. Whereas many such applications feature locational uncertainty, that is, the exact spatial positions of individual events are unknown, most Hawkes model analyses to date have ignored spatial coarsening present in the data. Three particular 21st century public health crises-urban gun violence, rural wildfires and global viral spread-present qualitatively and quantitatively varying uncertainty regimes that exhibit: (a) different collective magnitudes of spatial coarsening, (b) uniform and mixed magnitude coarsening, (c) differently shaped uncertainty regions and-less orthodox-(d) locational data distributed within the "wrong" effective space. We explicitly model such uncertainties in a Bayesian manner and jointly infer unknown locations together with all parameters of a reasonably flexible Hawkes model, obtaining results that are practically and statistically distinct from those obtained while ignoring spatial coarsening. This work also features two different secondary contributions: first, to facilitate Bayesian inference of locations and background rate parameters, we make a subtle yet crucial change to an established kernel-based rate model, and second, to facilitate the same Bayesian inference at scale, we develop a massively parallel implementation of the model's log-likelihood gradient with respect to locations and thus avoid its quadratic computational cost in the context of Hamiltonian Monte Carlo. Our examples involve thousands of observations and allow us to demonstrate practicality at moderate scales.

自激时空霍克斯过程越来越多地应用于大规模公共健康威胁的研究,从枪支暴力和地震到野火和病毒传染。许多此类应用都具有位置不确定性,即单个事件的确切空间位置是未知的,而迄今为止的大多数霍克斯模型分析都忽略了数据中存在的空间粗化现象。21 世纪的三个特殊公共卫生危机--城市枪支暴力、农村野火和全球病毒传播--呈现出定性和定量不同的不确定性机制,表现出:(a)不同的集体空间粗化幅度,(b)均匀和混合幅度的粗化,(c)不同形状的不确定性区域,以及(d)分布在 "错误 "有效空间内的定位数据。我们以贝叶斯方式对这些不确定性进行了明确建模,并将未知位置与合理灵活的霍克斯模型的所有参数一起进行了联合推断,得到的结果与忽略空间粗化时得到的结果在实践和统计上都截然不同。这项工作还有两个不同的次要贡献:首先,为了便于对位置和背景速率参数进行贝叶斯推断,我们对一个已建立的基于核的速率模型做了一个微妙而关键的改动;其次,为了便于在规模上进行同样的贝叶斯推断,我们开发了一种大规模并行实施该模型关于位置的对数似然梯度的方法,从而避免了在汉密尔顿蒙特卡罗背景下的二次计算成本。我们的例子涉及成千上万的观测数据,使我们能够证明在中等规模下的实用性。
{"title":"BAYESIAN MITIGATION OF SPATIAL COARSENING FOR A HAWKES MODEL APPLIED TO GUNFIRE, WILDFIRE AND VIRAL CONTAGION.","authors":"Andrew J Holbrook, Xiang Ji, Marc A Suchard","doi":"10.1214/21-aoas1517","DOIUrl":"10.1214/21-aoas1517","url":null,"abstract":"<p><p>Self-exciting spatiotemporal Hawkes processes have found increasing use in the study of large-scale public health threats, ranging from gun violence and earthquakes to wildfires and viral contagion. Whereas many such applications feature locational uncertainty, that is, the exact spatial positions of individual events are unknown, most Hawkes model analyses to date have ignored spatial coarsening present in the data. Three particular 21st century public health crises-urban gun violence, rural wildfires and global viral spread-present qualitatively and quantitatively varying uncertainty regimes that exhibit: (a) different collective magnitudes of spatial coarsening, (b) uniform and mixed magnitude coarsening, (c) differently shaped uncertainty regions and-less orthodox-(d) locational data distributed within the \"wrong\" effective space. We explicitly model such uncertainties in a Bayesian manner and jointly infer unknown locations together with all parameters of a reasonably flexible Hawkes model, obtaining results that are practically and statistically distinct from those obtained while ignoring spatial coarsening. This work also features two different secondary contributions: first, to facilitate Bayesian inference of locations and background rate parameters, we make a subtle yet crucial change to an established kernel-based rate model, and second, to facilitate the same Bayesian inference at scale, we develop a massively parallel implementation of the model's log-likelihood gradient with respect to locations and thus avoid its quadratic computational cost in the context of Hamiltonian Monte Carlo. Our examples involve thousands of observations and allow us to demonstrate practicality at moderate scales.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"16 1","pages":"573-595"},"PeriodicalIF":1.3,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536472/pdf/nihms-1797628.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9194379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BIDIMENSIONAL LINKED MATRIX FACTORIZATION FOR PAN-OMICS PAN-CANCER ANALYSIS. 用于泛组学泛癌分析的二维链接矩阵因子分解。
IF 1.8 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2022-03-01 Epub Date: 2022-03-28 DOI: 10.1214/21-AOAS1495
Eric F Lock, Jun Young Park, Katherine A Hoadley

Several modern applications require the integration of multiple large data matrices that have shared rows and/or columns. For example, cancer studies that integrate multiple omics platforms across multiple types of cancer, pan-omics pan-cancer analysis, have extended our knowledge of molecular heterogeneity beyond what was observed in single tumor and single platform studies. However, these studies have been limited by available statistical methodology. We propose a flexible approach to the simultaneous factorization and decomposition of variation across such bidimensionally linked matrices, BIDIFAC+. BIDIFAC+ decomposes variation into a series of low-rank components that may be shared across any number of row sets (e.g., omics platforms) or column sets (e.g., cancer types). This builds on a growing literature for the factorization and decomposition of linked matrices which has primarily focused on multiple matrices that are linked in one dimension (rows or columns) only. Our objective function extends nuclear norm penalization, is motivated by random matrix theory, gives a unique decomposition under relatively mild conditions, and can be shown to give the mode of a Bayesian posterior distribution. We apply BIDIFAC+ to pan-omics pan-cancer data from TCGA, identifying shared and specific modes of variability across four different omics platforms and 29 different cancer types.

一些现代应用程序需要集成具有共享行和/或列的多个大型数据矩阵。例如,整合多种类型癌症的多组学平台的癌症研究,即全组学全癌分析,扩展了我们对分子异质性的认识,超出了单肿瘤和单平台研究的范围。然而,这些研究受到现有统计方法的限制。我们提出了一种灵活的方法来同时分解和分解这种二维链接矩阵的变化,BIDIFC+。BIDIFAC+将变化分解为一系列低阶分量,这些分量可以在任何数量的行集(例如组学平台)或列集(例如癌症类型)之间共享。这建立在越来越多的链接矩阵的因子分解和分解文献的基础上,这些文献主要关注仅在一维(行或列)中链接的多个矩阵。我们的目标函数扩展了核范数惩罚,受随机矩阵理论的激励,在相对温和的条件下给出了唯一的分解,并且可以证明给出了贝叶斯后验分布的模式。我们将BIDIFAC+应用于TCGA的全组学全癌数据,确定了四个不同组学平台和29种不同癌症类型的共享和特定变异模式。
{"title":"BIDIMENSIONAL LINKED MATRIX FACTORIZATION FOR PAN-OMICS PAN-CANCER ANALYSIS.","authors":"Eric F Lock, Jun Young Park, Katherine A Hoadley","doi":"10.1214/21-AOAS1495","DOIUrl":"https://doi.org/10.1214/21-AOAS1495","url":null,"abstract":"<p><p>Several modern applications require the integration of multiple large data matrices that have shared rows and/or columns. For example, cancer studies that integrate multiple omics platforms across multiple types of cancer, <i>pan-omics pan-cancer analysis</i>, have extended our knowledge of molecular heterogeneity beyond what was observed in single tumor and single platform studies. However, these studies have been limited by available statistical methodology. We propose a flexible approach to the simultaneous factorization and decomposition of variation across such <i>bidimensionally linked</i> matrices, BIDIFAC+. BIDIFAC+ decomposes variation into a series of low-rank components that may be shared across any number of row sets (e.g., omics platforms) or column sets (e.g., cancer types). This builds on a growing literature for the factorization and decomposition of linked matrices which has primarily focused on multiple matrices that are linked in one dimension (rows or columns) only. Our objective function extends nuclear norm penalization, is motivated by random matrix theory, gives a unique decomposition under relatively mild conditions, and can be shown to give the mode of a Bayesian posterior distribution. We apply BIDIFAC+ to pan-omics pan-cancer data from TCGA, identifying shared and specific modes of variability across <i>four</i> different omics platforms and 29 different cancer types.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"16 1","pages":"193-215"},"PeriodicalIF":1.8,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9060567/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71523301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
BAGEL: A BAYESIAN GRAPHICAL MODEL FOR INFERRING DRUG EFFECT LONGITUDINALLY ON DEPRESSION IN PEOPLE WITH HIV. 百吉饼:一种贝叶斯图形模型,用于纵向推断药物对艾滋病毒感染者抑郁的影响。
IF 1.8 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2022-03-01 DOI: 10.1214/21-AOAS1492
Yuliang Li, Yang Ni, Leah H Rubin, Amanda B Spence, Yanxun Xu

Access and adherence to antiretroviral therapy (ART) has transformed the face of HIV infection from a fatal to a chronic disease. However, ART is also known for its side effects. Studies have reported that ART is associated with depressive symptomatology. Large-scale HIV clinical databases with individuals' longitudinal depression records, ART medications, and clinical characteristics offer researchers unprecedented opportunities to study the effects of ART drugs on depression over time. We develop BAGEL, a Bayesian graphical model to investigate longitudinal effects of ART drugs on a range of depressive symptoms while adjusting for participants' demographic, behavior, and clinical characteristics, and taking into account the heterogeneous population through a Bayesian nonparametric prior. We evaluate BAGEL through simulation studies. Application to a dataset from the Women's Interagency HIV Study yields interpretable and clinically useful results. BAGEL not only can improve our understanding of ART drugs effects on disparate depression symptoms, but also has clinical utility in guiding informed and effective treatment selection to facilitate precision medicine in HIV.

获得和坚持抗逆转录病毒治疗已使艾滋病毒感染的面貌从一种致命疾病转变为一种慢性病。然而,ART也因其副作用而闻名。研究报告称,抗逆转录病毒治疗与抑郁症状有关。包含个体抑郁纵向记录、抗逆转录病毒药物和临床特征的大规模HIV临床数据库为研究人员提供了前所未有的机会来研究抗逆转录病毒药物对抑郁症的长期影响。我们开发了BAGEL,一个贝叶斯图形模型来研究抗逆转录病毒药物对一系列抑郁症状的纵向影响,同时调整参与者的人口统计学、行为和临床特征,并通过贝叶斯非参数先验考虑异质性人群。我们通过模拟研究来评估BAGEL。对来自妇女跨机构艾滋病毒研究的数据集的应用产生了可解释和临床有用的结果。BAGEL不仅可以提高我们对ART药物对不同抑郁症状的疗效的认识,而且在指导知情和有效的治疗选择以促进HIV精准医疗方面具有临床应用价值。
{"title":"BAGEL: A BAYESIAN GRAPHICAL MODEL FOR INFERRING DRUG EFFECT LONGITUDINALLY ON DEPRESSION IN PEOPLE WITH HIV.","authors":"Yuliang Li,&nbsp;Yang Ni,&nbsp;Leah H Rubin,&nbsp;Amanda B Spence,&nbsp;Yanxun Xu","doi":"10.1214/21-AOAS1492","DOIUrl":"https://doi.org/10.1214/21-AOAS1492","url":null,"abstract":"<p><p>Access and adherence to antiretroviral therapy (ART) has transformed the face of HIV infection from a fatal to a chronic disease. However, ART is also known for its side effects. Studies have reported that ART is associated with depressive symptomatology. Large-scale HIV clinical databases with individuals' longitudinal depression records, ART medications, and clinical characteristics offer researchers unprecedented opportunities to study the effects of ART drugs on depression over time. We develop BAGEL, a Bayesian graphical model to investigate longitudinal effects of ART drugs on a range of depressive symptoms while adjusting for participants' demographic, behavior, and clinical characteristics, and taking into account the heterogeneous population through a Bayesian nonparametric prior. We evaluate BAGEL through simulation studies. Application to a dataset from the Women's Interagency HIV Study yields interpretable and clinically useful results. BAGEL not only can improve our understanding of ART drugs effects on disparate depression symptoms, but also has clinical utility in guiding informed and effective treatment selection to facilitate precision medicine in HIV.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"16 1","pages":"21-39"},"PeriodicalIF":1.8,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9236217/pdf/nihms-1778597.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10737070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
JOINT AND INDIVIDUAL ANALYSIS OF BREAST CANCER HISTOLOGIC IMAGES AND GENOMIC COVARIATES. 对乳腺癌组织学图像和基因组协变量进行联合和单独分析。
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2021-12-01 Epub Date: 2021-12-21 DOI: 10.1214/20-aoas1433
Iain Carmichael, Benjamin C Calhoun, Katherine A Hoadley, Melissa A Troester, Joseph Geradts, Heather D Couture, Linnea Olsson, Charles M Perou, Marc Niethammer, Jan Hannig, J S Marron

The two main approaches in the study of breast cancer are histopathology (analyzing visual characteristics of tumors) and genomics. While both histopathology and genomics are fundamental to cancer research, the connections between these fields have been relatively superficial. We bridge this gap by investigating the Carolina Breast Cancer Study through the development of an integrative, exploratory analysis framework. Our analysis gives insights - some known, some novel - that are engaging to both pathologists and geneticists. Our analysis framework is based on Angle-based Joint and Individual Variation Explained (AJIVE) for statistical data integration and exploits Convolutional Neural Networks (CNNs) as a powerful, automatic method for image feature extraction. CNNs raise interpretability issues that we address by developing novel methods to explore visual modes of variation captured by statistical algorithms (e.g. PCA or AJIVE) applied to CNN features.

研究乳腺癌的两种主要方法是组织病理学(分析肿瘤的视觉特征)和基因组学。虽然组织病理学和基因组学都是癌症研究的基础,但这两个领域之间的联系却相对肤浅。我们通过开发一个综合探索性分析框架来研究卡罗莱纳乳腺癌研究,从而弥补了这一差距。我们的分析提供了对病理学家和遗传学家都有启发的见解--有些是已知的,有些是新颖的。我们的分析框架基于基于角度的联合和个体差异解释(AJIVE)进行统计数据整合,并利用卷积神经网络(CNN)作为图像特征提取的强大自动方法。卷积神经网络提出了可解释性问题,我们通过开发新颖的方法来解决这些问题,以探索应用于卷积神经网络特征的统计算法(如 PCA 或 AJIVE)所捕捉到的视觉变异模式。
{"title":"JOINT AND INDIVIDUAL ANALYSIS OF BREAST CANCER HISTOLOGIC IMAGES AND GENOMIC COVARIATES.","authors":"Iain Carmichael, Benjamin C Calhoun, Katherine A Hoadley, Melissa A Troester, Joseph Geradts, Heather D Couture, Linnea Olsson, Charles M Perou, Marc Niethammer, Jan Hannig, J S Marron","doi":"10.1214/20-aoas1433","DOIUrl":"10.1214/20-aoas1433","url":null,"abstract":"<p><p>The two main approaches in the study of breast cancer are histopathology (analyzing visual characteristics of tumors) and genomics. While both histopathology and genomics are fundamental to cancer research, the connections between these fields have been relatively superficial. We bridge this gap by investigating the Carolina Breast Cancer Study through the development of an integrative, exploratory analysis framework. Our analysis gives insights - some known, some novel - that are engaging to both pathologists and geneticists. Our analysis framework is based on Angle-based Joint and Individual Variation Explained (AJIVE) for statistical data integration and exploits Convolutional Neural Networks (CNNs) as a powerful, automatic method for image feature extraction. CNNs raise interpretability issues that we address by developing novel methods to explore visual modes of variation captured by statistical algorithms (e.g. PCA or AJIVE) applied to CNN features.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"15 4","pages":"1697-1722"},"PeriodicalIF":1.3,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9007558/pdf/nihms-1780328.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10147676","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ASSESSING SELECTION BIAS IN REGRESSION COEFFICIENTS ESTIMATED FROM NONPROBABILITY SAMPLES WITH APPLICATIONS TO GENETICS AND DEMOGRAPHIC SURVEYS. 评估从非概率样本估计的回归系数中的选择偏差,并应用于遗传学和人口统计学调查。
IF 1.8 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2021-09-01 DOI: 10.1214/21-aoas1453
Brady T West, Roderick J Little, Rebecca R Andridge, Philip S Boonstra, Erin B Ware, Anita Pandit, Fernanda Alvarado-Leiton

Selection bias is a serious potential problem for inference about relationships of scientific interest based on samples without well-defined probability sampling mechanisms. Motivated by the potential for selection bias in: (a) estimated relationships of polygenic scores (PGSs) with phenotypes in genetic studies of volunteers and (b) estimated differences in subgroup means in surveys of smartphone users, we derive novel measures of selection bias for estimates of the coefficients in linear and probit regression models fitted to nonprobability samples, when aggregate-level auxiliary data are available for the selected sample and the target population. The measures arise from normal pattern-mixture models that allow analysts to examine the sensitivity of their inferences to assumptions about nonignorable selection in these samples. We examine the effectiveness of the proposed measures in a simulation study and then use them to quantify the selection bias in: (a) estimated PGS-phenotype relationships in a large study of volunteers recruited via Facebook and (b) estimated subgroup differences in mean past-year employment duration in a nonprobability sample of low-educated smartphone users. We evaluate the performance of the measures in these applications using benchmark estimates from large probability samples.

选择偏差是基于没有明确定义的概率抽样机制的样本来推断科学兴趣关系的一个严重的潜在问题。考虑到以下方面可能存在的选择偏差:(a)志愿者遗传研究中多基因得分(pgs)与表型的估计关系,以及(b)智能手机用户调查中亚组均值的估计差异,我们推导出了新的选择偏差测量方法,用于拟合非概率样本的线性和概率回归模型的系数估计,当所选样本和目标人群可获得总体水平的辅助数据时。这些措施来自正常的模式混合模型,允许分析人员检查他们的推断对这些样本中不可忽略的选择的假设的敏感性。我们在模拟研究中检验了所提出措施的有效性,然后使用它们来量化选择偏差:(a)在通过Facebook招募的志愿者的大型研究中估计pgs表型关系;(b)在低教育程度智能手机用户的非概率样本中估计过去一年平均就业时间的亚组差异。我们使用来自大概率样本的基准估计来评估这些应用程序中度量的性能。
{"title":"ASSESSING SELECTION BIAS IN REGRESSION COEFFICIENTS ESTIMATED FROM NONPROBABILITY SAMPLES WITH APPLICATIONS TO GENETICS AND DEMOGRAPHIC SURVEYS.","authors":"Brady T West,&nbsp;Roderick J Little,&nbsp;Rebecca R Andridge,&nbsp;Philip S Boonstra,&nbsp;Erin B Ware,&nbsp;Anita Pandit,&nbsp;Fernanda Alvarado-Leiton","doi":"10.1214/21-aoas1453","DOIUrl":"https://doi.org/10.1214/21-aoas1453","url":null,"abstract":"<p><p>Selection bias is a serious potential problem for inference about relationships of scientific interest based on samples without well-defined probability sampling mechanisms. Motivated by the potential for selection bias in: (a) estimated relationships of polygenic scores (PGSs) with phenotypes in genetic studies of volunteers and (b) estimated differences in subgroup means in surveys of smartphone users, we derive novel measures of selection bias for estimates of the coefficients in linear and probit regression models fitted to nonprobability samples, when aggregate-level auxiliary data are available for the selected sample and the target population. The measures arise from normal pattern-mixture models that allow analysts to examine the sensitivity of their inferences to assumptions about nonignorable selection in these samples. We examine the effectiveness of the proposed measures in a simulation study and then use them to quantify the selection bias in: (a) estimated PGS-phenotype relationships in a large study of volunteers recruited via Facebook and (b) estimated subgroup differences in mean past-year employment duration in a nonprobability sample of low-educated smartphone users. We evaluate the performance of the measures in these applications using benchmark estimates from large probability samples.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"15 3","pages":"1556-1581"},"PeriodicalIF":1.8,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8887878/pdf/nihms-1773953.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10686307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A COMPOSITIONAL MODEL TO ASSESS EXPRESSION CHANGES FROM SINGLE-CELL RNA-SEQ DATA. 从单细胞rna-seq数据中评估表达变化的组成模型。
IF 1.8 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2021-06-01 DOI: 10.1214/20-aoas1423
Xiuyu Ma, Keegan Korthauer, Christina Kendziorski, Michael A Newton

On the problem of scoring genes for evidence of changes in the distribution of single-cell expression, we introduce an empirical Bayesian mixture approach and evaluate its operating characteristics in a range of numerical experiments. The proposed approach leverages cell-subtype structure revealed in cluster analysis in order to boost gene-level information on expression changes. Cell clustering informs gene-level analysis through a specially-constructed prior distribution over pairs of multinomial probability vectors; this prior meshes with available model-based tools that score patterns of differential expression over multiple subtypes. We derive an explicit formula for the posterior probability that a gene has the same distribution in two cellular conditions, allowing for a gene-specific mixture over subtypes in each condition. Advantage is gained by the compositional structure of the model not only in which a host of gene-specific mixture components are allowed but also in which the mixing proportions are constrained at the whole cell level. This structure leads to a novel form of information sharing through which the cell-clustering results support gene-level scoring of differential distribution. The result, according to our numerical experiments, is improved sensitivity compared to several standard approaches for detecting distributional expression changes.

在对单细胞表达分布变化的证据进行基因评分的问题上,我们引入了经验贝叶斯混合方法,并在一系列数值实验中评估了其操作特性。该方法利用聚类分析中揭示的细胞亚型结构,以提高表达变化的基因水平信息。细胞聚类通过对多项概率向量的特殊构造的先验分布通知基因水平分析;这种先验与可用的基于模型的工具相匹配,这些工具对多个亚型的差异表达模式进行评分。我们推导了一个明确的后验概率公式,即基因在两种细胞条件下具有相同的分布,允许在每种条件下的基因特异性混合在亚型上。该模型的优势在于其组成结构不仅允许大量的基因特异性混合成分,而且混合比例在整个细胞水平上受到限制。这种结构导致了一种新的信息共享形式,通过这种形式,细胞聚类结果支持差异分布的基因水平评分。结果,根据我们的数值实验,与检测分布表达变化的几种标准方法相比,灵敏度有所提高。
{"title":"A COMPOSITIONAL MODEL TO ASSESS EXPRESSION CHANGES FROM SINGLE-CELL RNA-SEQ DATA.","authors":"Xiuyu Ma,&nbsp;Keegan Korthauer,&nbsp;Christina Kendziorski,&nbsp;Michael A Newton","doi":"10.1214/20-aoas1423","DOIUrl":"https://doi.org/10.1214/20-aoas1423","url":null,"abstract":"<p><p>On the problem of scoring genes for evidence of changes in the distribution of single-cell expression, we introduce an empirical Bayesian mixture approach and evaluate its operating characteristics in a range of numerical experiments. The proposed approach leverages cell-subtype structure revealed in cluster analysis in order to boost gene-level information on expression changes. Cell clustering informs gene-level analysis through a specially-constructed prior distribution over pairs of multinomial probability vectors; this prior meshes with available model-based tools that score patterns of differential expression over multiple subtypes. We derive an explicit formula for the posterior probability that a gene has the same distribution in two cellular conditions, allowing for a gene-specific mixture over subtypes in each condition. Advantage is gained by the compositional structure of the model not only in which a host of gene-specific mixture components are allowed but also in which the mixing proportions are constrained at the whole cell level. This structure leads to a novel form of information sharing through which the cell-clustering results support gene-level scoring of differential distribution. The result, according to our numerical experiments, is improved sensitivity compared to several standard approaches for detecting distributional expression changes.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"15 2","pages":"880-901"},"PeriodicalIF":1.8,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10275512/pdf/nihms-1901161.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9762402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MODEL-BASED FEATURE SELECTION AND CLUSTERING OF RNA-SEQ DATA FOR UNSUPERVISED SUBTYPE DISCOVERY. 基于模型的特征选择和 rna-seq 数据聚类,用于无监督亚型发现。
IF 1.8 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2021-03-01 Epub Date: 2021-03-18 DOI: 10.1214/20-aoas1407
David K Lim, Naim U Rashid, Joseph G Ibrahim

Clustering is a form of unsupervised learning that aims to uncover latent groups within data based on similarity across a set of features. A common application of this in biomedical research is in delineating novel cancer subtypes from patient gene expression data, given a set of informative genes. However, it is typically unknown a priori what genes may be informative in discriminating between clusters, and what the optimal number of clusters are. Few methods exist for performing unsupervised clustering of RNA-seq samples, and none currently adjust for between-sample global normalization factors, select cluster-discriminatory genes, or account for potential confounding variables during clustering. To address these issues, we propose the Feature Selection and Clustering of RNA-seq (FSCseq): a model-based clustering algorithm that utilizes a finite mixture of regression (FMR) model and the quadratic penalty method with a Smoothly-Clipped Absolute Deviation (SCAD) penalty. The maximization is done by a penalized Classification EM algorithm, allowing us to include normalization factors and confounders in our modeling framework. Given the fitted model, our framework allows for subtype prediction in new patients via posterior probabilities of cluster membership, even in the presence of batch effects. Based on simulations and real data analysis, we show the advantages of our method relative to competing approaches.

聚类是一种无监督学习,旨在根据一组特征的相似性发现数据中的潜在群体。这种方法在生物医学研究中的一个常见应用是,在给定一组信息基因的情况下,从病人的基因表达数据中划分出新的癌症亚型。然而,人们通常不知道哪些基因在区分群组时可能具有参考价值,也不知道最佳群组数目是多少。对 RNA-seq 样本进行无监督聚类的方法寥寥无几,目前没有一种方法能调整样本间的全局归一化因子、选择聚类区分基因或在聚类过程中考虑潜在的混杂变量。为了解决这些问题,我们提出了 RNA-seq 特征选择和聚类(FSCseq):一种基于模型的聚类算法,它利用有限混合回归(FMR)模型和带有平滑绝对偏差(SCAD)惩罚的二次惩罚法。最大化是通过受惩罚的分类 EM 算法完成的,这样我们就可以在建模框架中加入归一化因素和混杂因素。有了拟合模型,即使存在批次效应,我们的框架也能通过群组成员的后验概率对新患者进行亚型预测。基于模拟和真实数据分析,我们展示了我们的方法相对于其他方法的优势。
{"title":"MODEL-BASED FEATURE SELECTION AND CLUSTERING OF RNA-SEQ DATA FOR UNSUPERVISED SUBTYPE DISCOVERY.","authors":"David K Lim, Naim U Rashid, Joseph G Ibrahim","doi":"10.1214/20-aoas1407","DOIUrl":"10.1214/20-aoas1407","url":null,"abstract":"<p><p>Clustering is a form of unsupervised learning that aims to uncover latent groups within data based on similarity across a set of features. A common application of this in biomedical research is in delineating novel cancer subtypes from patient gene expression data, given a set of informative genes. However, it is typically unknown <i>a priori</i> what genes may be informative in discriminating between clusters, and what the optimal number of clusters are. Few methods exist for performing unsupervised clustering of RNA-seq samples, and none currently adjust for between-sample global normalization factors, select cluster-discriminatory genes, or account for potential confounding variables during clustering. To address these issues, we propose the Feature Selection and Clustering of RNA-seq (FSCseq): a model-based clustering algorithm that utilizes a finite mixture of regression (FMR) model and the quadratic penalty method with a Smoothly-Clipped Absolute Deviation (SCAD) penalty. The maximization is done by a penalized Classification EM algorithm, allowing us to include normalization factors and confounders in our modeling framework. Given the fitted model, our framework allows for subtype prediction in new patients via posterior probabilities of cluster membership, even in the presence of batch effects. Based on simulations and real data analysis, we show the advantages of our method relative to competing approaches.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"15 1","pages":"481-508"},"PeriodicalIF":1.8,"publicationDate":"2021-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8386505/pdf/nihms-1716637.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9546884","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
INTEGRATIVE STATISTICAL METHODS FOR EXPOSURE MIXTURES AND HEALTH. 暴露混合物与健康的综合统计方法。
IF 1.8 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2020-12-01 DOI: 10.1214/20-AOAS1364
Brian J Reich, Yawen Guan, Denis Fourches, Joshua L Warren, Stefanie E Sarnat, Howard H Chang

Humans are concurrently exposed to chemically, structurally and toxicologically diverse chemicals. A critical challenge for environmental epidemiology is to quantify the risk of adverse health outcomes resulting from exposures to such chemical mixtures and to identify which mixture constituents may be driving etiologic associations. A variety of statistical methods have been proposed to address these critical research questions. However, they generally rely solely on measured exposure and health data available within a specific study. Advancements in understanding of the role of mixtures on human health impacts may be better achieved through the utilization of external data and knowledge from multiple disciplines with innovative statistical tools. In this paper we develop new methods for health analyses that incorporate auxiliary information about the chemicals in a mixture, such as physicochemical, structural and/or toxicological data. We expect that the constituents identified using auxiliary information will be more biologically meaningful than those identified by methods that solely utilize observed correlations between measured exposure. We develop flexible Bayesian models by specifying prior distributions for the exposures and their effects that include auxiliary information and examine this idea over a spectrum of analyses from regression to factor analysis. The methods are applied to study the effects of volatile organic compounds on emergency room visits in Atlanta. We find that including cheminformatic information about the exposure variables improves prediction and provides a more interpretable model for emergency room visits for respiratory diseases.

人类同时暴露在化学、结构和毒性不同的化学品中。环境流行病学面临的一项关键挑战是量化暴露于这类化学混合物造成的不良健康后果的风险,并确定哪些混合物成分可能导致病因学关联。已经提出了各种统计方法来解决这些关键的研究问题。然而,它们通常仅依赖于特定研究中可获得的测量暴露和健康数据。通过利用来自多个学科的外部数据和知识以及创新的统计工具,可以更好地增进对混合物对人类健康影响的作用的了解。在本文中,我们开发了新的健康分析方法,这些方法结合了混合物中化学物质的辅助信息,如物理化学、结构和/或毒理学数据。我们期望使用辅助信息识别的成分将比仅利用测量暴露之间观察到的相关性的方法识别的成分更具生物学意义。我们通过指定暴露及其影响的先验分布(包括辅助信息)来开发灵活的贝叶斯模型,并通过从回归分析到因子分析的一系列分析来检验这一想法。应用这些方法研究了挥发性有机化合物对亚特兰大急诊室就诊的影响。我们发现,包括暴露变量的化学信息可以改善预测,并为呼吸系统疾病的急诊室就诊提供更可解释的模型。
{"title":"INTEGRATIVE STATISTICAL METHODS FOR EXPOSURE MIXTURES AND HEALTH.","authors":"Brian J Reich,&nbsp;Yawen Guan,&nbsp;Denis Fourches,&nbsp;Joshua L Warren,&nbsp;Stefanie E Sarnat,&nbsp;Howard H Chang","doi":"10.1214/20-AOAS1364","DOIUrl":"https://doi.org/10.1214/20-AOAS1364","url":null,"abstract":"<p><p>Humans are concurrently exposed to chemically, structurally and toxicologically diverse chemicals. A critical challenge for environmental epidemiology is to quantify the risk of adverse health outcomes resulting from exposures to such chemical mixtures and to identify which mixture constituents may be driving etiologic associations. A variety of statistical methods have been proposed to address these critical research questions. However, they generally rely solely on measured exposure and health data available within a specific study. Advancements in understanding of the role of mixtures on human health impacts may be better achieved through the utilization of external data and knowledge from multiple disciplines with innovative statistical tools. In this paper we develop new methods for health analyses that incorporate auxiliary information about the chemicals in a mixture, such as physicochemical, structural and/or toxicological data. We expect that the constituents identified using auxiliary information will be more biologically meaningful than those identified by methods that solely utilize observed correlations between measured exposure. We develop flexible Bayesian models by specifying prior distributions for the exposures and their effects that include auxiliary information and examine this idea over a spectrum of analyses from regression to factor analysis. The methods are applied to study the effects of volatile organic compounds on emergency room visits in Atlanta. We find that including cheminformatic information about the exposure variables improves prediction and provides a more interpretable model for emergency room visits for respiratory diseases.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"14 4","pages":"1945-1963"},"PeriodicalIF":1.8,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8914338/pdf/nihms-1780774.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10265042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
LOG-CONTRAST REGRESSION WITH FUNCTIONAL COMPOSITIONAL PREDICTORS: LINKING PRETERM INFANT'S GUT MICROBIOME TRAJECTORIES TO NEUROBEHAVIORAL OUTCOME. 功能成分预测因子的对数对比回归:将早产儿肠道微生物组轨迹与神经行为结果联系起来。
IF 1.8 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2020-09-01 Epub Date: 2020-09-18 DOI: 10.1214/20-aoas1357
Zhe Sun, Wanli Xu, Xiaomei Cong, Gen Li, Kun Chen

The neonatal intensive care unit (NICU) experience is known to be one of the most crucial factors that drive preterm infant's neurodevelopmental and health outcome. It is hypothesized that stressful early life experience of very preterm neonate is imprinting gut microbiome by the regulation of the so-called brain-gut axis, and consequently, certain microbiome markers are predictive of later infant neurodevelopment. To investigate, a preterm infant study was conducted; infant fecal samples were collected during the infants' first month of postnatal age, resulting in functional compositional microbiome data, and neurobehavioral outcomes were measured when infants reached 36-38 weeks of post-menstrual age. To identify potential microbiome markers and estimate how the trajectories of gut microbiome compositions during early postnatal stage impact later neurobehavioral outcomes of the preterm infants, we innovate a sparse log-contrast regression with functional compositional predictors. The functional simplex structure is strictly preserved, and the functional compositional predictors are allowed to have sparse, smoothly varying, and accumulating effects on the outcome through time. Through a pragmatic basis expansion step, the problem boils down to a linearly constrained sparse group regression, for which we develop an efficient algorithm and obtain theoretical performance guarantees. Our approach yields insightful results in the preterm infant study. The identified microbiome markers and the estimated time dynamics of their impact on the neurobehavioral outcome shed lights on the linkage between stress accumulation in early postnatal stage and neurodevelpomental process of infants.

新生儿重症监护病房(NICU)的经验被认为是驱动早产儿神经发育和健康结果的最关键因素之一。据推测,极早产新生儿的早期压力生活经历通过所谓的脑-肠轴的调节来印记肠道微生物组,因此,某些微生物组标记可以预测婴儿后来的神经发育。为了调查,进行了一项早产儿研究;在婴儿出生后第一个月收集婴儿粪便样本,获得功能组成微生物组数据,并在婴儿达到月经后36-38周时测量神经行为结果。为了确定潜在的微生物组标记物,并估计出生后早期肠道微生物组组成的轨迹如何影响早产儿后来的神经行为结果,我们创新了一种带有功能组成预测因子的稀疏对数对比回归。功能单纯形结构被严格保留,功能组成预测因子随时间的推移对结果具有稀疏、平滑变化和累积的影响。通过一个实用的基展开步骤,将问题归结为一个线性约束稀疏群回归,并为此开发了一种高效的算法,获得了理论上的性能保证。我们的方法在早产儿研究中产生了深刻的结果。鉴定的微生物组标记物及其对神经行为结果影响的估计时间动态,揭示了出生后早期应激积累与婴儿神经发育过程之间的联系。
{"title":"LOG-CONTRAST REGRESSION WITH FUNCTIONAL COMPOSITIONAL PREDICTORS: LINKING PRETERM INFANT'S GUT MICROBIOME TRAJECTORIES TO NEUROBEHAVIORAL OUTCOME.","authors":"Zhe Sun,&nbsp;Wanli Xu,&nbsp;Xiaomei Cong,&nbsp;Gen Li,&nbsp;Kun Chen","doi":"10.1214/20-aoas1357","DOIUrl":"https://doi.org/10.1214/20-aoas1357","url":null,"abstract":"<p><p>The neonatal intensive care unit (NICU) experience is known to be one of the most crucial factors that drive preterm infant's neurodevelopmental and health outcome. It is hypothesized that stressful early life experience of very preterm neonate is imprinting gut microbiome by the regulation of the so-called brain-gut axis, and consequently, certain microbiome markers are predictive of later infant neurodevelopment. To investigate, a preterm infant study was conducted; infant fecal samples were collected during the infants' first month of postnatal age, resulting in functional compositional microbiome data, and neurobehavioral outcomes were measured when infants reached 36-38 weeks of post-menstrual age. To identify potential microbiome markers and estimate how the trajectories of gut microbiome compositions during early postnatal stage impact later neurobehavioral outcomes of the preterm infants, we innovate a sparse log-contrast regression with functional compositional predictors. The functional simplex structure is strictly preserved, and the functional compositional predictors are allowed to have sparse, smoothly varying, and accumulating effects on the outcome through time. Through a pragmatic basis expansion step, the problem boils down to a linearly constrained sparse group regression, for which we develop an efficient algorithm and obtain theoretical performance guarantees. Our approach yields insightful results in the preterm infant study. The identified microbiome markers and the estimated time dynamics of their impact on the neurobehavioral outcome shed lights on the linkage between stress accumulation in early postnatal stage and neurodevelpomental process of infants.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"14 3","pages":"1535-1556"},"PeriodicalIF":1.8,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8218926/pdf/nihms-1601428.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39100587","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Inferring a consensus problem list using penalized multistage models for ordered data. 利用有序数据的惩罚性多阶段模型推断共识问题列表。
IF 1.8 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2020-09-01 Epub Date: 2020-09-18 DOI: 10.1214/20-aoas1361
Philip S Boonstra, John C Krauss

A patient's medical problem list describes his or her current health status and aids in the coordination and transfer of care between providers. Because a problem list is generated once and then subsequently modified or updated, what is not usually observable is the provider-effect. That is, to what extent does a patient's problem in the electronic medical record actually reflect a consensus communication of that patient's current health status? To that end, we report on and analyze a unique interview-based design in which multiple medical providers independently generate problem lists for each of three patient case abstracts of varying clinical difficulty. Due to the uniqueness of both our data and the scientific objectives of our analysis, we apply and extend so-called multistage models for ordered lists and equip the models with variable selection penalties to induce sparsity. Each problem has a corresponding non-negative parameter estimate, interpreted as a relative log-odds ratio, with larger values suggesting greater importance and zero values suggesting unimportant problems. We use these fitted penalized models to quantify and report the extent of consensus. We conduct a simulation study to evaluate the performance of our methodology and then analyze the motivating problem list data. For the three case abstracts, the proportions of problems with model-estimated non-zero log-odds ratios were 10/28, 16/47, and 13/30. Physicians exhibited consensus on the highest ranked problems in the first and last case abstracts but agreement quickly deteriorated; in contrast, physicians broadly disagreed on the relevant problems for the middle - and most difficult - case abstract.

病人的医疗问题清单描述了他或她目前的健康状况,有助于医疗服务提供者之间协调和转移医疗服务。由于问题清单是一次性生成的,随后会进行修改或更新,因此通常无法观察到医疗服务提供者的效果。也就是说,电子病历中患者的问题在多大程度上反映了该患者当前健康状况的共识?为此,我们报告并分析了一种基于访谈的独特设计,在该设计中,多个医疗服务提供者分别独立地为三个临床难度不同的病例摘要生成问题清单。由于数据的独特性和分析的科学目标,我们应用并扩展了有序列表的所谓多阶段模型,并为模型配备了变量选择惩罚以诱导稀疏性。每个问题都有一个相应的非负参数估计值,它被解释为一个相对对数比率,数值越大表示问题越重要,数值为零则表示问题不重要。我们使用这些拟合的惩罚模型来量化和报告共识的程度。我们进行了一项模拟研究来评估我们方法的性能,然后分析了激励问题列表数据。在三个病例摘要中,经模型估计对数比率不为零的问题比例分别为 10/28、16/47 和 13/30。在第一个和最后一个病例摘要中,医生们对排名最高的问题达成了共识,但很快就出现了分歧;相比之下,医生们对中间--也是最难的--病例摘要中的相关问题存在广泛分歧。
{"title":"Inferring a consensus problem list using penalized multistage models for ordered data.","authors":"Philip S Boonstra, John C Krauss","doi":"10.1214/20-aoas1361","DOIUrl":"10.1214/20-aoas1361","url":null,"abstract":"<p><p>A patient's medical problem list describes his or her current health status and aids in the coordination and transfer of care between providers. Because a problem list is generated once and then subsequently modified or updated, what is not usually observable is the provider-effect. That is, to what extent does a patient's problem in the electronic medical record actually reflect a consensus communication of that patient's current health status? To that end, we report on and analyze a unique interview-based design in which multiple medical providers independently generate problem lists for each of three patient case abstracts of varying clinical difficulty. Due to the uniqueness of both our data and the scientific objectives of our analysis, we apply and extend so-called multistage models for ordered lists and equip the models with variable selection penalties to induce sparsity. Each problem has a corresponding non-negative parameter estimate, interpreted as a relative log-odds ratio, with larger values suggesting greater importance and zero values suggesting unimportant problems. We use these fitted penalized models to quantify and report the extent of consensus. We conduct a simulation study to evaluate the performance of our methodology and then analyze the motivating problem list data. For the three case abstracts, the proportions of problems with model-estimated non-zero log-odds ratios were 10/28, 16/47, and 13/30. Physicians exhibited consensus on the highest ranked problems in the first and last case abstracts but agreement quickly deteriorated; in contrast, physicians broadly disagreed on the relevant problems for the middle - and most difficult - case abstract.</p>","PeriodicalId":50772,"journal":{"name":"Annals of Applied Statistics","volume":"14 3","pages":"1557-1580"},"PeriodicalIF":1.8,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8345315/pdf/nihms-1696242.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39291448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Annals of Applied Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1