首页 > 最新文献

Wiley Interdisciplinary Reviews-Computational Statistics最新文献

英文 中文
SAREV: A review on statistical analytics of single-cell RNA sequencing data. SAREV:单细胞RNA测序数据统计分析综述
IF 3.2 2区 数学 Q1 Mathematics Pub Date : 2022-07-01 Epub Date: 2021-05-20 DOI: 10.1002/wics.1558
Dorothy Ellis, Dongyuan Wu, Susmita Datta

Due to the development of next-generation RNA sequencing (NGS) technologies, there has been tremendous progress in research involving determining the role of genomics, transcriptomics and epigenomics in complex biological systems. However, scientists have realized that information obtained using earlier technology, frequently called 'bulk RNA-seq' data, provides information averaged across all the cells present in a tissue. Relatively newly developed single cell (scRNA-seq) technology allows us to provide transcriptomic information at a single-cell resolution. Nevertheless, these high-resolution data have their own complex natures and demand novel statistical data analysis methods to provide effective and highly accurate results on complex biological systems. In this review, we cover many such recently developed statistical methods for researchers wanting to pursue scRNA-seq statistical and computational research as well as scientific research about these existing methods and free software tools available for their generated data. This review is certainly not exhaustive due to page limitations. We have tried to cover the popular methods starting from quality control to the downstream analysis of finding differentially expressed genes and concluding with a brief description of network analysis.

由于下一代RNA测序(NGS)技术的发展,在确定基因组学、转录组学和表观基因组学在复杂生物系统中的作用方面的研究取得了巨大进展。然而,科学家们已经意识到,使用早期技术获得的信息,通常被称为“批量RNA-seq”数据,提供了组织中所有细胞的平均信息。相对较新开发的单细胞(scRNA-seq)技术使我们能够以单细胞分辨率提供转录组信息。然而,这些高分辨率数据具有其自身的复杂性,需要新的统计数据分析方法来对复杂的生物系统提供有效和高度准确的结果。在这篇综述中,我们介绍了许多最近开发的统计方法,供希望进行scRNA-seq统计和计算研究的研究人员使用,以及对这些现有方法和可用于生成数据的免费软件工具的科学研究。由于篇幅限制,这篇综述肯定不是详尽无遗的。我们试图涵盖从质量控制到寻找差异表达基因的下游分析的流行方法,最后简要描述网络分析。
{"title":"SAREV: A review on statistical analytics of single-cell RNA sequencing data.","authors":"Dorothy Ellis,&nbsp;Dongyuan Wu,&nbsp;Susmita Datta","doi":"10.1002/wics.1558","DOIUrl":"10.1002/wics.1558","url":null,"abstract":"<p><p>Due to the development of next-generation RNA sequencing (NGS) technologies, there has been tremendous progress in research involving determining the role of genomics, transcriptomics and epigenomics in complex biological systems. However, scientists have realized that information obtained using earlier technology, frequently called 'bulk RNA-seq' data, provides information averaged across all the cells present in a tissue. Relatively newly developed single cell (scRNA-seq) technology allows us to provide transcriptomic information at a single-cell resolution. Nevertheless, these high-resolution data have their own complex natures and demand novel statistical data analysis methods to provide effective and highly accurate results on complex biological systems. In this review, we cover many such recently developed statistical methods for researchers wanting to pursue scRNA-seq statistical and computational research as well as scientific research about these existing methods and free software tools available for their generated data. This review is certainly not exhaustive due to page limitations. We have tried to cover the popular methods starting from quality control to the downstream analysis of finding differentially expressed genes and concluding with a brief description of network analysis.</p>","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/wics.1558","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9729203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
On the safe use of prior densities for Bayesian model selection 贝叶斯模型选择中先验密度的安全使用
IF 3.2 2区 数学 Q1 Mathematics Pub Date : 2022-06-10 DOI: 10.1002/wics.1595
F. Llorente, Luca Martino, E. Curbelo, J. Lopez-Santiago, D. Delgado
The application of Bayesian inference for the purpose of model selection is very popular nowadays. In this framework, models are compared through their marginal likelihoods, or their quotients, called Bayes factors. However, marginal likelihoods depend on the prior choice. For model selection, even diffuse priors can be actually very informative, unlike for the parameter estimation problem. Furthermore, when the prior is improper, the marginal likelihood of the corresponding model is undetermined. In this work, we discuss the issue of prior sensitivity of the marginal likelihood and its role in model selection. We also comment on the use of uninformative priors, which are very common choices in practice. Several practical suggestions are discussed and many possible solutions, proposed in the literature, to design objective priors for model selection are described. Some of them also allow the use of improper priors. The connection between the marginal likelihood approach and the well‐known information criteria is also presented. We describe the main issues and possible solutions by illustrative numerical examples, providing also some related code. One of them involving a real‐world application on exoplanet detection.
贝叶斯推理在模型选择中的应用在当今非常流行。在这个框架中,模型通过它们的边际可能性或商进行比较,称为贝叶斯因子。然而,边际可能性取决于先前的选择。对于模型选择,与参数估计问题不同,即使是扩散先验实际上也可以提供非常丰富的信息。此外,当先验不合适时,相应模型的边际似然是不确定的。在这项工作中,我们讨论了边际似然的先验敏感性问题及其在模型选择中的作用。我们还评论了无信息先验的使用,这是实践中非常常见的选择。讨论了一些实用的建议,并描述了文献中提出的许多可能的解决方案,以设计用于模型选择的目标先验。其中一些还允许使用不适当的先验。还介绍了边际似然法和众所周知的信息准则之间的联系。我们通过举例说明了主要问题和可能的解决方案,并提供了一些相关的代码。其中一项涉及到系外行星探测的真实应用。
{"title":"On the safe use of prior densities for Bayesian model selection","authors":"F. Llorente, Luca Martino, E. Curbelo, J. Lopez-Santiago, D. Delgado","doi":"10.1002/wics.1595","DOIUrl":"https://doi.org/10.1002/wics.1595","url":null,"abstract":"The application of Bayesian inference for the purpose of model selection is very popular nowadays. In this framework, models are compared through their marginal likelihoods, or their quotients, called Bayes factors. However, marginal likelihoods depend on the prior choice. For model selection, even diffuse priors can be actually very informative, unlike for the parameter estimation problem. Furthermore, when the prior is improper, the marginal likelihood of the corresponding model is undetermined. In this work, we discuss the issue of prior sensitivity of the marginal likelihood and its role in model selection. We also comment on the use of uninformative priors, which are very common choices in practice. Several practical suggestions are discussed and many possible solutions, proposed in the literature, to design objective priors for model selection are described. Some of them also allow the use of improper priors. The connection between the marginal likelihood approach and the well‐known information criteria is also presented. We describe the main issues and possible solutions by illustrative numerical examples, providing also some related code. One of them involving a real‐world application on exoplanet detection.","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44402673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Document clustering 文档聚类
IF 3.2 2区 数学 Q1 Mathematics Pub Date : 2022-06-08 DOI: 10.1002/wics.1588
Irene Cozzolino, M. Ferraro
Nowadays, the explosive growth in text data emphasizes the need for developing new and computationally efficient methods and credible theoretical support tailored for analyzing such large‐scale data. Given the vast amount of this kind of unstructured data, the majority of it is not classified, hence unsupervised learning techniques show to be useful in this field. Document clustering has proven to be an efficient tool in organizing textual documents and it has been widely applied in different areas from information retrieval to topic modeling. Before introducing the proposals of document clustering algorithms, the principal steps of the whole process, including the mathematical representation of documents and the preprocessing phase, are discussed. Then, the main clustering algorithms used for text data are critically analyzed, considering prototype‐based, graph‐based, hierarchical, and model‐based approaches.
如今,文本数据的爆炸性增长强调了开发新的、计算高效的方法以及为分析此类大规模数据量身定制的可信理论支持的必要性。鉴于这类非结构化数据数量巨大,其中大多数都没有分类,因此无监督学习技术在该领域显示出了有用性。文档聚类已被证明是组织文本文档的有效工具,它已被广泛应用于从信息检索到主题建模的各个领域。在介绍文档聚类算法的建议之前,讨论了整个过程的主要步骤,包括文档的数学表示和预处理阶段。然后,考虑到基于原型、基于图、分层和基于模型的方法,对用于文本数据的主要聚类算法进行了批判性分析。
{"title":"Document clustering","authors":"Irene Cozzolino, M. Ferraro","doi":"10.1002/wics.1588","DOIUrl":"https://doi.org/10.1002/wics.1588","url":null,"abstract":"Nowadays, the explosive growth in text data emphasizes the need for developing new and computationally efficient methods and credible theoretical support tailored for analyzing such large‐scale data. Given the vast amount of this kind of unstructured data, the majority of it is not classified, hence unsupervised learning techniques show to be useful in this field. Document clustering has proven to be an efficient tool in organizing textual documents and it has been widely applied in different areas from information retrieval to topic modeling. Before introducing the proposals of document clustering algorithms, the principal steps of the whole process, including the mathematical representation of documents and the preprocessing phase, are discussed. Then, the main clustering algorithms used for text data are critically analyzed, considering prototype‐based, graph‐based, hierarchical, and model‐based approaches.","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48927700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A review of normalization and differential abundance methods for microbiome counts data 微生物组计数数据的归一化和差分丰度方法综述
IF 3.2 2区 数学 Q1 Mathematics Pub Date : 2022-05-18 DOI: 10.1002/wics.1586
Dionne Swift, Kellen Cresswell, Robert Johnson, Spiro C. Stilianoudakis, Xingtao Wei
The recent development of cost‐effective high‐throughput DNA sequencing technologies has tremendously increased microbiome research. However, it has been well documented that the observed microbiome data suffers from compositionality, sparsity, and high variability. All of which pose serious challenges when analyzing microbiome data. Over the last decade, there has been considerable amount of interest into statistical and computational methods to tackle these challenges. The choice of inference aids in the selection of the appropriate statistical methods since only a few methods allow inferences for absolute abundance while most methods allow inferences for relative abundances. An overview of recent methods for differential abundance analysis and normalization of microbiome data is presented, focusing on methods that are accessible but have not been widely covered in previous literature. In detailed descriptions of each method, we discuss assumptions and if and how these methods address the challenges of microbiome data. These methods are compared based on accuracy metrics in real and simulated settings. The goal is to provide a comprehensive but non‐exhaustive set of potential and easily‐accessible tools for differential abundance and normalization of microbiome data.
近年来,低成本高通量DNA测序技术的发展极大地促进了微生物组的研究。然而,已经有充分的证据表明,观察到的微生物组数据存在组合性、稀疏性和高变异性。所有这些都给分析微生物组数据带来了严峻的挑战。在过去的十年中,人们对解决这些挑战的统计和计算方法产生了相当大的兴趣。推理的选择有助于选择适当的统计方法,因为只有少数方法允许对绝对丰度进行推理,而大多数方法允许对相对丰度进行推理。概述了微生物组数据的差异丰度分析和规范化的最新方法,重点介绍了以前文献中尚未广泛覆盖的方法。在每种方法的详细描述中,我们讨论了假设,以及这些方法是否以及如何解决微生物组数据的挑战。在真实和模拟环境下,对这些方法的精度指标进行了比较。目标是为微生物组数据的差异丰度和规范化提供一套全面但非详尽的潜在和易于获取的工具。
{"title":"A review of normalization and differential abundance methods for microbiome counts data","authors":"Dionne Swift, Kellen Cresswell, Robert Johnson, Spiro C. Stilianoudakis, Xingtao Wei","doi":"10.1002/wics.1586","DOIUrl":"https://doi.org/10.1002/wics.1586","url":null,"abstract":"The recent development of cost‐effective high‐throughput DNA sequencing technologies has tremendously increased microbiome research. However, it has been well documented that the observed microbiome data suffers from compositionality, sparsity, and high variability. All of which pose serious challenges when analyzing microbiome data. Over the last decade, there has been considerable amount of interest into statistical and computational methods to tackle these challenges. The choice of inference aids in the selection of the appropriate statistical methods since only a few methods allow inferences for absolute abundance while most methods allow inferences for relative abundances. An overview of recent methods for differential abundance analysis and normalization of microbiome data is presented, focusing on methods that are accessible but have not been widely covered in previous literature. In detailed descriptions of each method, we discuss assumptions and if and how these methods address the challenges of microbiome data. These methods are compared based on accuracy metrics in real and simulated settings. The goal is to provide a comprehensive but non‐exhaustive set of potential and easily‐accessible tools for differential abundance and normalization of microbiome data.","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45693764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Projection‐based techniques for high‐dimensional optimal transport problems 高维最优运输问题的基于投影的技术
IF 3.2 2区 数学 Q1 Mathematics Pub Date : 2022-05-13 DOI: 10.1002/wics.1587
Jingyi Zhang, Ping Ma, Wenxuan Zhong, Cheng Meng
Optimal transport (OT) methods seek a transformation map (or plan) between two probability measures, such that the transformation has the minimum transportation cost. Such a minimum transport cost, with a certain power transform, is called the Wasserstein distance. Recently, OT methods have drawn great attention in statistics, machine learning, and computer science, especially in deep generative neural networks. Despite its broad applications, the estimation of high‐dimensional Wasserstein distances is a well‐known challenging problem owing to the curse‐of‐dimensionality. There are some cutting‐edge projection‐based techniques that tackle high‐dimensional OT problems. Three major approaches of such techniques are introduced, respectively, the slicing approach, the iterative projection approach, and the projection robust OT approach. Open challenges are discussed at the end of the review.
最优运输(OT)方法寻求两个概率测度之间的转换图(或计划),使得转换具有最小的运输成本。在一定的功率变换下,这样的最小运输成本被称为Wasserstein距离。近年来,OT方法在统计学、机器学习和计算机科学中引起了极大的关注,尤其是在深度生成神经网络中。尽管应用广泛,但由于维数的诅咒,高维Wasserstein距离的估计是一个众所周知的具有挑战性的问题。有一些基于前沿投影的技术可以解决高维OT问题。介绍了这类技术的三种主要方法,分别是切片方法、迭代投影方法和投影鲁棒OT方法。审查结束时讨论了悬而未决的挑战。
{"title":"Projection‐based techniques for high‐dimensional optimal transport problems","authors":"Jingyi Zhang, Ping Ma, Wenxuan Zhong, Cheng Meng","doi":"10.1002/wics.1587","DOIUrl":"https://doi.org/10.1002/wics.1587","url":null,"abstract":"Optimal transport (OT) methods seek a transformation map (or plan) between two probability measures, such that the transformation has the minimum transportation cost. Such a minimum transport cost, with a certain power transform, is called the Wasserstein distance. Recently, OT methods have drawn great attention in statistics, machine learning, and computer science, especially in deep generative neural networks. Despite its broad applications, the estimation of high‐dimensional Wasserstein distances is a well‐known challenging problem owing to the curse‐of‐dimensionality. There are some cutting‐edge projection‐based techniques that tackle high‐dimensional OT problems. Three major approaches of such techniques are introduced, respectively, the slicing approach, the iterative projection approach, and the projection robust OT approach. Open challenges are discussed at the end of the review.","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48984047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Issue Information 问题信息
IF 3.2 2区 数学 Q1 Mathematics Pub Date : 2022-05-01 DOI: 10.1002/wics.1562
{"title":"Issue Information","authors":"","doi":"10.1002/wics.1562","DOIUrl":"https://doi.org/10.1002/wics.1562","url":null,"abstract":"","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41831957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrative clustering methods for multi-omics data. 多组学数据的集成聚类方法。
IF 3.2 2区 数学 Q1 Mathematics Pub Date : 2022-05-01 DOI: 10.1002/wics.1553
Xiaoyu Zhang, Zhenwei Zhou, Hanfei Xu, Ching-Ti Liu

Integrative analysis of multi-omics data has drawn much attention from the scientific community due to the technological advancements which have generated various omics data. Leveraging these multi-omics data potentially provides a more comprehensive view of the disease mechanism or biological processes. Integrative multi-omics clustering is an unsupervised integrative method specifically used to find coherent groups of samples or features by utilizing information across multi-omics data. It aims to better stratify diseases and to suggest biological mechanisms and potential targeted therapies for the diseases. However, applying integrative multi-omics clustering is both statistically and computationally challenging due to various reasons such as high dimensionality and heterogeneity. In this review, we summarized integrative multi-omics clustering methods into three general categories: concatenated clustering, clustering of clusters, and interactive clustering based on when and how the multi-omics data are processed for clustering. We further classified the methods into different approaches under each category based on the main statistical strategy used during clustering. In addition, we have provided recommended practices tailored to four real-life scenarios to help researchers to strategize their selection in integrative multi-omics clustering methods for their future studies.

由于技术的进步产生了各种组学数据,多组学数据的综合分析受到了科学界的广泛关注。利用这些多组学数据有可能为疾病机制或生物学过程提供更全面的观点。整合多组学聚类是一种无监督的整合方法,专门用于利用跨多组学数据的信息找到连贯的样本组或特征。它旨在更好地对疾病进行分层,并提出疾病的生物学机制和潜在的靶向治疗方法。然而,由于高维度和异质性等原因,应用集成多组学聚类在统计和计算上都具有挑战性。本文根据多组学数据的聚类处理时间和方式,将多组学聚类方法分为串联聚类、聚类的聚类和交互聚类三大类。基于聚类过程中使用的主要统计策略,我们进一步将每种方法分类为不同的方法。此外,我们还提供了针对四种现实场景的推荐实践,以帮助研究人员在未来的研究中制定综合多组学聚类方法的选择策略。
{"title":"Integrative clustering methods for multi-omics data.","authors":"Xiaoyu Zhang,&nbsp;Zhenwei Zhou,&nbsp;Hanfei Xu,&nbsp;Ching-Ti Liu","doi":"10.1002/wics.1553","DOIUrl":"https://doi.org/10.1002/wics.1553","url":null,"abstract":"<p><p>Integrative analysis of multi-omics data has drawn much attention from the scientific community due to the technological advancements which have generated various omics data. Leveraging these multi-omics data potentially provides a more comprehensive view of the disease mechanism or biological processes. Integrative multi-omics clustering is an unsupervised integrative method specifically used to find coherent groups of samples or features by utilizing information across multi-omics data. It aims to better stratify diseases and to suggest biological mechanisms and potential targeted therapies for the diseases. However, applying integrative multi-omics clustering is both statistically and computationally challenging due to various reasons such as high dimensionality and heterogeneity. In this review, we summarized integrative multi-omics clustering methods into three general categories: <i>concatenated clustering</i>, <i>clustering of clusters</i>, and <i>interactive clustering</i> based on when and how the multi-omics data are processed for clustering. We further classified the methods into different approaches under each category based on the main statistical strategy used during clustering. In addition, we have provided recommended practices tailored to four real-life scenarios to help researchers to strategize their selection in integrative multi-omics clustering methods for their future studies.</p>","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/wics.1553","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9379724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Statistical inference for stochastic differential equations 随机微分方程的统计推断
IF 3.2 2区 数学 Q1 Mathematics Pub Date : 2022-04-27 DOI: 10.1002/wics.1585
P. Craigmile, Radu Herbei, Geoffrey Liu, Grant Schneider
Many scientific fields have experienced growth in the use of stochastic differential equations (SDEs), also known as diffusion processes, to model scientific phenomena over time. SDEs can simultaneously capture the known deterministic dynamics of underlying variables of interest (e.g., ocean flow, chemical and physical characteristics of a body of water, presence, absence, and spread of a disease), while enabling a modeler to capture the unknown random dynamics in a stochastic setting. We focus on reviewing a wide range of statistical inference methods for likelihood‐based frequentist and Bayesian parametric inference based on discretely‐sampled diffusions. Exact parametric inference is not usually possible because the transition density is not available in closed form. Thus, we review the literature on approximate numerical methods (e.g., Euler, Milstein, local linearization, and Aït‐Sahalia) and simulation‐based approaches (e.g., data augmentation and exact sampling) that are used to carry out parametric statistical inference on SDE processes. We close with a brief discussion of other methods of inference for SDEs and more complex SDE processes such as spatio‐temporal SDEs.
许多科学领域都在使用随机微分方程(SDEs),也称为扩散过程,随着时间的推移来模拟科学现象。SDEs可以同时捕捉潜在感兴趣变量的已知确定性动态(例如,洋流、水体的化学和物理特征、疾病的存在、不存在和传播),同时使建模者能够捕捉随机环境中未知的随机动态。我们重点回顾了基于似然的频率推断和基于离散采样扩散的贝叶斯参数推断的广泛的统计推断方法。精确的参数推断通常是不可能的,因为跃迁密度不能以封闭形式得到。因此,我们回顾了关于近似数值方法(例如,Euler, Milstein,局部线性化和Aït‐Sahalia)和基于模拟的方法(例如,数据增强和精确抽样)的文献,这些方法用于对SDE过程进行参数统计推断。最后,我们简要讨论了其他推断SDE和更复杂的SDE过程(如时空SDE)的方法。
{"title":"Statistical inference for stochastic differential equations","authors":"P. Craigmile, Radu Herbei, Geoffrey Liu, Grant Schneider","doi":"10.1002/wics.1585","DOIUrl":"https://doi.org/10.1002/wics.1585","url":null,"abstract":"Many scientific fields have experienced growth in the use of stochastic differential equations (SDEs), also known as diffusion processes, to model scientific phenomena over time. SDEs can simultaneously capture the known deterministic dynamics of underlying variables of interest (e.g., ocean flow, chemical and physical characteristics of a body of water, presence, absence, and spread of a disease), while enabling a modeler to capture the unknown random dynamics in a stochastic setting. We focus on reviewing a wide range of statistical inference methods for likelihood‐based frequentist and Bayesian parametric inference based on discretely‐sampled diffusions. Exact parametric inference is not usually possible because the transition density is not available in closed form. Thus, we review the literature on approximate numerical methods (e.g., Euler, Milstein, local linearization, and Aït‐Sahalia) and simulation‐based approaches (e.g., data augmentation and exact sampling) that are used to carry out parametric statistical inference on SDE processes. We close with a brief discussion of other methods of inference for SDEs and more complex SDE processes such as spatio‐temporal SDEs.","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47620930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Function minimization and nonlinear least squares in R 函数最小化与R中的非线性最小二乘
IF 3.2 2区 数学 Q1 Mathematics Pub Date : 2022-03-24 DOI: 10.1002/wics.1580
J. Nash
This review will look at function minimization and nonlinear least squares, possibly bounds constrained, using R. These tools derive from the more general context of numerical optimization and mathematical programming. How R developers have tried to make the application of such tools easier for users not familiar with optimization is highlighted. Some limitations of methods and their implementations are mentioned to provide perspective.
这篇综述将使用R来研究函数最小化和非线性最小二乘,可能是边界约束的。这些工具源于数值优化和数学规划的更一般的背景。强调了R开发人员如何试图让不熟悉优化的用户更容易地应用这些工具。文中提到了方法及其实现的一些局限性,以提供透视图。
{"title":"Function minimization and nonlinear least squares in R","authors":"J. Nash","doi":"10.1002/wics.1580","DOIUrl":"https://doi.org/10.1002/wics.1580","url":null,"abstract":"This review will look at function minimization and nonlinear least squares, possibly bounds constrained, using R. These tools derive from the more general context of numerical optimization and mathematical programming. How R developers have tried to make the application of such tools easier for users not familiar with optimization is highlighted. Some limitations of methods and their implementations are mentioned to provide perspective.","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45021708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Echelon analysis and its software for spatial lattice data 空间格点数据的梯队分析及其软件
IF 3.2 2区 数学 Q1 Mathematics Pub Date : 2022-03-12 DOI: 10.1002/wics.1579
K. Kurihara, Fumio Ishioka
In this study, we explore the use of echelon analysis and its software named EcheScan for spatial lattice data. EcheScan is developed as a web application via an internet browser in R language and Shiny server for echelon analysis. The technique of echelon is proposed to analyze the topological structure for spatial lattice data. The echelon tree provides a dendrogram representation. Regional features, such as hierarchical spatial data structure and hotspots clusters, are shown in an echelon dendrogram. In addition, we introduce the conception of echelon with the values and neighbors for lattice data. We also explain the use of EcheScan for one‐ and two‐dimensional regular lattice data. Furthermore, coronavirus disease 2019 death data corresponding to 50 US states are illustrated using EcheScan as an example of geospatial lattice data.
在这项研究中,我们探索了梯队分析及其软件EcheScan对空间点阵数据的使用。EcheScan是一个web应用程序,使用R语言的浏览器和Shiny服务器进行梯队分析。提出了对空间点阵数据进行拓扑结构分析的梯队技术。梯队树提供了一种树形图表示。区域特征,如层次空间数据结构和热点集群,在一个梯级树状图中显示。此外,我们还引入了格数据的具有值和邻域的阶梯形的概念。我们还解释了EcheScan在一维和二维正则晶格数据中的应用。此外,以EcheScan为例,说明了美国50个州对应的2019年冠状病毒病死亡数据。
{"title":"Echelon analysis and its software for spatial lattice data","authors":"K. Kurihara, Fumio Ishioka","doi":"10.1002/wics.1579","DOIUrl":"https://doi.org/10.1002/wics.1579","url":null,"abstract":"In this study, we explore the use of echelon analysis and its software named EcheScan for spatial lattice data. EcheScan is developed as a web application via an internet browser in R language and Shiny server for echelon analysis. The technique of echelon is proposed to analyze the topological structure for spatial lattice data. The echelon tree provides a dendrogram representation. Regional features, such as hierarchical spatial data structure and hotspots clusters, are shown in an echelon dendrogram. In addition, we introduce the conception of echelon with the values and neighbors for lattice data. We also explain the use of EcheScan for one‐ and two‐dimensional regular lattice data. Furthermore, coronavirus disease 2019 death data corresponding to 50 US states are illustrated using EcheScan as an example of geospatial lattice data.","PeriodicalId":47779,"journal":{"name":"Wiley Interdisciplinary Reviews-Computational Statistics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48195347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Wiley Interdisciplinary Reviews-Computational Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1