首页 > 最新文献

Annual Review of Statistics and Its Application最新文献

英文 中文
Statistical Brain Network Analysis 统计脑网络分析
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-11-28 DOI: 10.1146/annurev-statistics-040522-020722
Sean L. Simpson, Heather M. Shappell, Mohsen Bahrami
The recent fusion of network science and neuroscience has catalyzed a paradigm shift in how we study the brain and led to the field of brain network analysis. Brain network analyses hold great potential in helping us understand normal and abnormal brain function by providing profound clinical insight into links between system-level properties and health and behavioral outcomes. Nonetheless, methods for statistically analyzing networks at the group and individual levels have lagged behind. We have attempted to address this need by developing three complementary statistical frameworks—a mixed modeling framework, a distance regression framework, and a hidden semi-Markov modeling framework. These tools serve as synergistic fusions of statistical approaches with network science methods, providing needed analytic foundations for whole-brain network data. Here we delineate these approaches, briefly survey related tools, and discuss potential future avenues of research. We hope this review catalyzes further statistical interest and methodological development in the field.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
最近网络科学和神经科学的融合促进了我们如何研究大脑的范式转变,并导致了大脑网络分析领域的发展。通过对系统级属性与健康和行为结果之间的联系提供深刻的临床见解,脑网络分析在帮助我们理解正常和异常的脑功能方面具有巨大的潜力。尽管如此,对群体和个人层面的网络进行统计分析的方法仍然落后。我们试图通过开发三个互补的统计框架来解决这一需求——一个混合建模框架,一个距离回归框架和一个隐藏的半马尔科夫建模框架。这些工具作为统计方法与网络科学方法的协同融合,为全脑网络数据提供了所需的分析基础。在这里,我们概述了这些方法,简要调查了相关工具,并讨论了潜在的未来研究途径。我们希望这篇综述能催化该领域进一步的统计兴趣和方法发展。预计《统计年鉴及其应用》第11卷的最终在线出版日期为2024年3月。修订后的估计数请参阅http://www.annualreviews.org/page/journal/pubdates。
{"title":"Statistical Brain Network Analysis","authors":"Sean L. Simpson, Heather M. Shappell, Mohsen Bahrami","doi":"10.1146/annurev-statistics-040522-020722","DOIUrl":"https://doi.org/10.1146/annurev-statistics-040522-020722","url":null,"abstract":"The recent fusion of network science and neuroscience has catalyzed a paradigm shift in how we study the brain and led to the field of brain network analysis. Brain network analyses hold great potential in helping us understand normal and abnormal brain function by providing profound clinical insight into links between system-level properties and health and behavioral outcomes. Nonetheless, methods for statistically analyzing networks at the group and individual levels have lagged behind. We have attempted to address this need by developing three complementary statistical frameworks—a mixed modeling framework, a distance regression framework, and a hidden semi-Markov modeling framework. These tools serve as synergistic fusions of statistical approaches with network science methods, providing needed analytic foundations for whole-brain network data. Here we delineate these approaches, briefly survey related tools, and discuss potential future avenues of research. We hope this review catalyzes further statistical interest and methodological development in the field.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"80 14","pages":""},"PeriodicalIF":7.9,"publicationDate":"2023-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138449678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Relational Event Modeling 关系事件建模
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-11-28 DOI: 10.1146/annurev-statistics-040722-060248
Federica Bianchi, Edoardo Filippi-Mazzola, Alessandro Lomi, Ernst C. Wit
Advances in information technology have increased the availability of time-stamped relational data, such as those produced by email exchanges or interaction through social media. Whereas the associated information flows could be aggregated into cross-sectional panels, the temporal ordering of the events frequently contains information that requires new models for the analysis of continuous-time interactions, subject to both endogenous and exogenous influences. The introduction of the relational event model (REM) has been a major development that has stimulated new questions and led to further methodological developments. In this review, we track the intellectual history of the REM, define its core properties, and discuss why and how it has been considered useful in empirical research. We describe how the demands of novel applications have stimulated methodological, computational, and inferential advancements.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
信息技术的进步增加了时间戳关系数据的可用性,例如通过电子邮件交换或通过社交媒体互动产生的数据。虽然相关的信息流可以汇总成横截面面板,但事件的时间顺序往往包含需要新的模型来分析受内生和外生影响的连续时间相互作用的信息。关系事件模型(REM)的引入是一项重大发展,它激发了新的问题,并导致了进一步的方法论发展。在这篇综述中,我们追溯了快速眼动的思想史,定义了它的核心属性,并讨论了它为什么以及如何在实证研究中被认为是有用的。我们描述了新应用的需求如何刺激了方法、计算和推理的进步。预计《统计年鉴及其应用》第11卷的最终在线出版日期为2024年3月。修订后的估计数请参阅http://www.annualreviews.org/page/journal/pubdates。
{"title":"Relational Event Modeling","authors":"Federica Bianchi, Edoardo Filippi-Mazzola, Alessandro Lomi, Ernst C. Wit","doi":"10.1146/annurev-statistics-040722-060248","DOIUrl":"https://doi.org/10.1146/annurev-statistics-040722-060248","url":null,"abstract":"Advances in information technology have increased the availability of time-stamped relational data, such as those produced by email exchanges or interaction through social media. Whereas the associated information flows could be aggregated into cross-sectional panels, the temporal ordering of the events frequently contains information that requires new models for the analysis of continuous-time interactions, subject to both endogenous and exogenous influences. The introduction of the relational event model (REM) has been a major development that has stimulated new questions and led to further methodological developments. In this review, we track the intellectual history of the REM, define its core properties, and discuss why and how it has been considered useful in empirical research. We describe how the demands of novel applications have stimulated methodological, computational, and inferential advancements.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"80 15","pages":""},"PeriodicalIF":7.9,"publicationDate":"2023-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138449677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Competing Risks: Concepts, Methods, and Software 竞争风险:概念、方法和软件
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-11-22 DOI: 10.1146/annurev-statistics-040522-094556
Ronald B. Geskus
The role of competing risks in the analysis of time-to-event data is increasingly acknowledged. Software is readily available. However, confusion remains regarding the proper analysis: When and how do I need to take the presence of competing risks into account? Which quantities are relevant for my research question? How can they be estimated and what assumptions do I need to make? The main quantities in a competing risks analysis are the cause-specific cumulative incidence, the cause-specific hazard, and the subdistribution hazard. We describe their nonparametric estimation, give an overview of regression models for each of these quantities, and explain their difference in interpretation. We discuss the proper analysis in relation to the type of study question, and we suggest software in R and Stata. Our focus is on competing risks analysis in medical research, but methods can equally be applied in other fields like social science, engineering, and economics.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
竞争风险在事件时间数据分析中的作用日益得到承认。软件是现成的。然而,关于正确的分析仍然存在困惑:何时以及如何考虑竞争风险的存在?哪些数量与我的研究问题相关?如何估计它们,我需要做什么假设?竞争风险分析的主要量是原因特异性累积发生率、原因特异性危害和亚分布危害。我们描述了它们的非参数估计,概述了这些数量的回归模型,并解释了它们在解释上的差异。我们讨论了与学习问题类型相关的适当分析,我们建议使用R和Stata软件。我们的重点是医学研究中的竞争风险分析,但方法同样可以应用于其他领域,如社会科学、工程和经济学。预计《统计年鉴及其应用》第11卷的最终在线出版日期为2024年3月。修订后的估计数请参阅http://www.annualreviews.org/page/journal/pubdates。
{"title":"Competing Risks: Concepts, Methods, and Software","authors":"Ronald B. Geskus","doi":"10.1146/annurev-statistics-040522-094556","DOIUrl":"https://doi.org/10.1146/annurev-statistics-040522-094556","url":null,"abstract":"The role of competing risks in the analysis of time-to-event data is increasingly acknowledged. Software is readily available. However, confusion remains regarding the proper analysis: When and how do I need to take the presence of competing risks into account? Which quantities are relevant for my research question? How can they be estimated and what assumptions do I need to make? The main quantities in a competing risks analysis are the cause-specific cumulative incidence, the cause-specific hazard, and the subdistribution hazard. We describe their nonparametric estimation, give an overview of regression models for each of these quantities, and explain their difference in interpretation. We discuss the proper analysis in relation to the type of study question, and we suggest software in R and Stata. Our focus is on competing risks analysis in medical research, but methods can equally be applied in other fields like social science, engineering, and economics.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"27 8","pages":""},"PeriodicalIF":7.9,"publicationDate":"2023-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138293820","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Interpretable Machine Learning for Discovery: Statistical Challenges and Opportunities 用于发现的可解释机器学习:统计学的挑战和机遇
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-11-17 DOI: 10.1146/annurev-statistics-040120-030919
Genevera I. Allen, Luqin Gan, Lili Zheng
New technologies have led to vast troves of large and complex data sets across many scientific domains and industries. People routinely use machine learning techniques not only to process, visualize, and make predictions from these big data, but also to make data-driven discoveries. These discoveries are often made using interpretable machine learning, or machine learning models and techniques that yield human-understandable insights. In this article, we discuss and review the field of interpretable machine learning, focusing especially on the techniques, as they are often employed to generate new knowledge or make discoveries from large data sets. We outline the types of discoveries that can be made using interpretable machine learning in both supervised and unsupervised settings. Additionally, we focus on the grand challenge of how to validate these discoveries in a data-driven manner, which promotes trust in machine learning systems and reproducibility in science. We discuss validation both from a practical perspective, reviewing approaches based on data-splitting and stability, as well as from a theoretical perspective, reviewing statistical results on model selection consistency and uncertainty quantification via statistical inference. Finally, we conclude by highlighting open challenges in using interpretable machine learning techniques to make discoveries, including gaps between theory and practice for validating data-driven discoveries.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
新技术带来了跨越许多科学领域和行业的大量复杂数据集。人们经常使用机器学习技术,不仅可以处理、可视化并从这些大数据中做出预测,还可以进行数据驱动的发现。这些发现通常是使用可解释的机器学习,或者机器学习模型和技术来产生人类可以理解的见解。在本文中,我们讨论和回顾了可解释机器学习领域,特别关注这些技术,因为它们经常被用来产生新知识或从大型数据集中发现。我们概述了在监督和无监督设置中使用可解释机器学习可以获得的发现类型。此外,我们专注于如何以数据驱动的方式验证这些发现的重大挑战,这促进了对机器学习系统的信任和科学的可重复性。我们从实践的角度,回顾了基于数据分裂和稳定性的方法,从理论的角度,回顾了模型选择一致性和不确定性量化的统计结果。最后,我们强调了使用可解释机器学习技术进行发现的开放性挑战,包括验证数据驱动发现的理论与实践之间的差距。预计《统计年鉴及其应用》第11卷的最终在线出版日期为2024年3月。修订后的估计数请参阅http://www.annualreviews.org/page/journal/pubdates。
{"title":"Interpretable Machine Learning for Discovery: Statistical Challenges and Opportunities","authors":"Genevera I. Allen, Luqin Gan, Lili Zheng","doi":"10.1146/annurev-statistics-040120-030919","DOIUrl":"https://doi.org/10.1146/annurev-statistics-040120-030919","url":null,"abstract":"New technologies have led to vast troves of large and complex data sets across many scientific domains and industries. People routinely use machine learning techniques not only to process, visualize, and make predictions from these big data, but also to make data-driven discoveries. These discoveries are often made using interpretable machine learning, or machine learning models and techniques that yield human-understandable insights. In this article, we discuss and review the field of interpretable machine learning, focusing especially on the techniques, as they are often employed to generate new knowledge or make discoveries from large data sets. We outline the types of discoveries that can be made using interpretable machine learning in both supervised and unsupervised settings. Additionally, we focus on the grand challenge of how to validate these discoveries in a data-driven manner, which promotes trust in machine learning systems and reproducibility in science. We discuss validation both from a practical perspective, reviewing approaches based on data-splitting and stability, as well as from a theoretical perspective, reviewing statistical results on model selection consistency and uncertainty quantification via statistical inference. Finally, we conclude by highlighting open challenges in using interpretable machine learning techniques to make discoveries, including gaps between theory and practice for validating data-driven discoveries.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"59 9","pages":""},"PeriodicalIF":7.9,"publicationDate":"2023-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138293187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Causal Inference in the Social Sciences 社会科学中的因果推理
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-11-17 DOI: 10.1146/annurev-statistics-033121-114601
Guido W. Imbens
Knowledge of causal effects is of great importance to decision makers in a wide variety of settings. In many cases, however, these causal effects are not known to the decision makers and need to be estimated from data. This fundamental problem has been known and studied for many years in many disciplines. In the past thirty years, however, the amount of empirical as well as methodological research in this area has increased dramatically, and so has its scope. It has become more interdisciplinary, and the focus has been more specifically on methods for credibly estimating causal effects in a wide range of both experimental and observational settings. This work has greatly impacted empirical work in the social and biomedical sciences. In this article, I review some of this work and discuss open questions.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
在各种情况下,因果关系的知识对决策者来说是非常重要的。然而,在许多情况下,决策者并不知道这些因果关系,需要根据数据进行估计。这个基本问题已经在许多学科中被认识和研究了许多年。然而,在过去的三十年中,这一领域的实证研究和方法论研究的数量急剧增加,其范围也随之扩大。它已经变得更加跨学科,重点更具体地放在在广泛的实验和观察环境中可靠地估计因果关系的方法上。这项工作极大地影响了社会和生物医学科学的实证工作。在本文中,我将回顾其中的一些工作,并讨论一些悬而未决的问题。预计《统计年鉴及其应用》第11卷的最终在线出版日期为2024年3月。修订后的估计数请参阅http://www.annualreviews.org/page/journal/pubdates。
{"title":"Causal Inference in the Social Sciences","authors":"Guido W. Imbens","doi":"10.1146/annurev-statistics-033121-114601","DOIUrl":"https://doi.org/10.1146/annurev-statistics-033121-114601","url":null,"abstract":"Knowledge of causal effects is of great importance to decision makers in a wide variety of settings. In many cases, however, these causal effects are not known to the decision makers and need to be estimated from data. This fundamental problem has been known and studied for many years in many disciplines. In the past thirty years, however, the amount of empirical as well as methodological research in this area has increased dramatically, and so has its scope. It has become more interdisciplinary, and the focus has been more specifically on methods for credibly estimating causal effects in a wide range of both experimental and observational settings. This work has greatly impacted empirical work in the social and biomedical sciences. In this article, I review some of this work and discuss open questions.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"59 8","pages":""},"PeriodicalIF":7.9,"publicationDate":"2023-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138293188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Distributed Computing and Inference for Big Data 面向大数据的分布式计算与推理
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-11-17 DOI: 10.1146/annurev-statistics-040522-021241
Ling Zhou, Ziyang Gong, Pengcheng Xiang
Data are distributed across different sites due to computing facility limitations or data privacy considerations. Conventional centralized methods—those in which all datasets are stored and processed in a central computing facility—are not applicable in practice. Therefore, it has become necessary to develop distributed learning approaches that have good inference or predictive accuracy while remaining free of individual data or obeying policies and regulations to protect privacy. In this article, we introduce the basic idea of distributed learning and conduct a selected review on various distributed learning methods, which are categorized by their statistical accuracy, computational efficiency, heterogeneity, and privacy. This categorization can help evaluate newly proposed methods from different aspects. Moreover, we provide up-to-date descriptions of the existing theoretical results that cover statistical equivalency and computational efficiency under different statistical learning frameworks. Finally, we provide existing software implementations and benchmark datasets, and we discuss future research opportunities.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
由于计算设施的限制或数据隐私的考虑,数据分布在不同的站点上。传统的集中式方法——即所有数据集在一个中央计算设施中存储和处理——在实践中并不适用。因此,有必要开发具有良好推理或预测准确性的分布式学习方法,同时保持个人数据的自由或遵守保护隐私的政策和法规。在本文中,我们介绍了分布式学习的基本思想,并对各种分布式学习方法进行了选择性回顾,这些方法根据其统计准确性、计算效率、异质性和隐私性进行了分类。这种分类有助于从不同方面评估新提出的方法。此外,我们还提供了涵盖不同统计学习框架下统计等效性和计算效率的现有理论结果的最新描述。最后,我们提供了现有的软件实现和基准数据集,并讨论了未来的研究机会。预计《统计年鉴及其应用》第11卷的最终在线出版日期为2024年3月。修订后的估计数请参阅http://www.annualreviews.org/page/journal/pubdates。
{"title":"Distributed Computing and Inference for Big Data","authors":"Ling Zhou, Ziyang Gong, Pengcheng Xiang","doi":"10.1146/annurev-statistics-040522-021241","DOIUrl":"https://doi.org/10.1146/annurev-statistics-040522-021241","url":null,"abstract":"Data are distributed across different sites due to computing facility limitations or data privacy considerations. Conventional centralized methods—those in which all datasets are stored and processed in a central computing facility—are not applicable in practice. Therefore, it has become necessary to develop distributed learning approaches that have good inference or predictive accuracy while remaining free of individual data or obeying policies and regulations to protect privacy. In this article, we introduce the basic idea of distributed learning and conduct a selected review on various distributed learning methods, which are categorized by their statistical accuracy, computational efficiency, heterogeneity, and privacy. This categorization can help evaluate newly proposed methods from different aspects. Moreover, we provide up-to-date descriptions of the existing theoretical results that cover statistical equivalency and computational efficiency under different statistical learning frameworks. Finally, we provide existing software implementations and benchmark datasets, and we discuss future research opportunities.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"59 7","pages":""},"PeriodicalIF":7.9,"publicationDate":"2023-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138293189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Geometric Methods for Cosmological Data on the Sphere 球面上宇宙学数据的几何方法
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-11-06 DOI: 10.1146/annurev-statistics-040522-093748
Javier Carrón Duque, Domenico Marinucci
This review is devoted to recent developments in the statistical analysis of spherical data, strongly motivated by applications in cosmology. We start from a brief discussion of cosmological questions and motivations, arguing that most cosmological observables are spherical random fields. Then, we introduce some mathematical background on spherical random fields, including spectral representations and the construction of needlet and wavelet frames. We then focus on some specific issues, including tools and algorithms for map reconstruction (i.e., separating the different physical components that contribute to the observed field), geometric tools for testing the assumptions of Gaussianity and isotropy, and multiple testing methods to detect contamination in the field due to point sources. Although these tools are introduced in the cosmological context, they can be applied to other situations dealing with spherical data. Finally, we discuss more recent and challenging issues, such as the analysis of polarization data, which can be viewed as realizations of random fields taking values in spin fiber bundles.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
这篇综述致力于球面数据统计分析的最新发展,其强烈动机是宇宙学的应用。我们从宇宙学问题和动机的简短讨论开始,认为大多数宇宙学可观察性都是球面随机场。然后,我们介绍了球面随机场的一些数学背景,包括谱表示以及针状和小波框架的构造。然后,我们关注一些具体问题,包括用于地图重建的工具和算法(即,分离对观测场有贡献的不同物理分量),用于测试高斯性和各向同性假设的几何工具,以及用于检测场中由于点源造成的污染的多种测试方法。尽管这些工具是在宇宙学背景下引入的,但它们也可以应用于处理球面数据的其他情况。最后,我们讨论了最近的和具有挑战性的问题,例如偏振数据的分析,这可以被视为自旋光纤束中取值的随机场的实现。《统计及其应用年度评论》第11卷预计最终在线出版日期为2024年3月。请参阅http://www.annualreviews.org/page/journal/pubdates用于修订估算。
{"title":"Geometric Methods for Cosmological Data on the Sphere","authors":"Javier Carrón Duque, Domenico Marinucci","doi":"10.1146/annurev-statistics-040522-093748","DOIUrl":"https://doi.org/10.1146/annurev-statistics-040522-093748","url":null,"abstract":"This review is devoted to recent developments in the statistical analysis of spherical data, strongly motivated by applications in cosmology. We start from a brief discussion of cosmological questions and motivations, arguing that most cosmological observables are spherical random fields. Then, we introduce some mathematical background on spherical random fields, including spectral representations and the construction of needlet and wavelet frames. We then focus on some specific issues, including tools and algorithms for map reconstruction (i.e., separating the different physical components that contribute to the observed field), geometric tools for testing the assumptions of Gaussianity and isotropy, and multiple testing methods to detect contamination in the field due to point sources. Although these tools are introduced in the cosmological context, they can be applied to other situations dealing with spherical data. Finally, we discuss more recent and challenging issues, such as the analysis of polarization data, which can be viewed as realizations of random fields taking values in spin fiber bundles.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"11 3","pages":""},"PeriodicalIF":7.9,"publicationDate":"2023-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71473809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stochastic Models of Rainfall 降雨的随机模型
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-10-31 DOI: 10.1146/annurev-statistics-040622-023838
Paul J. Northrop
Rainfall is the main input to most hydrological systems. To assess flood risk for a catchment area, hydrologists use models that require long series of subdaily, perhaps even subhourly, rainfall data, ideally from locations that cover the area. If historical data are not sufficient for this purpose, an alternative is to simulate synthetic data from a suitably calibrated model. We review stochastic models that have a mechanistic structure, intended to mimic physical features of the rainfall processes, and are constructed using stationary point processes. We describe models for temporal and spatial-temporal rainfall and consider how they can be fitted to data. We provide an example application using a temporal model and an illustration of data simulated from a spatial-temporal model. We discuss how these models can contribute to the simulation of future rainfall that reflects our changing climate.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
降雨是大多数水文系统的主要输入。为了评估集水区的洪水风险,水文学家使用的模型需要一系列的亚日甚至亚小时的降雨数据,最好是来自覆盖该地区的位置。如果历史数据不足以达到此目的,则可选择模拟来自适当校准模型的合成数据。我们回顾了具有机械结构的随机模型,旨在模拟降雨过程的物理特征,并使用驻点过程构建。我们描述了时间和时空降雨的模型,并考虑如何将其与数据拟合。我们提供了一个使用时间模型的示例应用程序,并说明了从时空模型模拟的数据。我们讨论了这些模型如何有助于模拟未来的降雨量,以反映我们不断变化的气候。《统计及其应用年度评论》第11卷预计最终在线出版日期为2024年3月。请参阅http://www.annualreviews.org/page/journal/pubdates用于修订估算。
{"title":"Stochastic Models of Rainfall","authors":"Paul J. Northrop","doi":"10.1146/annurev-statistics-040622-023838","DOIUrl":"https://doi.org/10.1146/annurev-statistics-040622-023838","url":null,"abstract":"Rainfall is the main input to most hydrological systems. To assess flood risk for a catchment area, hydrologists use models that require long series of subdaily, perhaps even subhourly, rainfall data, ideally from locations that cover the area. If historical data are not sufficient for this purpose, an alternative is to simulate synthetic data from a suitably calibrated model. We review stochastic models that have a mechanistic structure, intended to mimic physical features of the rainfall processes, and are constructed using stationary point processes. We describe models for temporal and spatial-temporal rainfall and consider how they can be fitted to data. We provide an example application using a temporal model and an illustration of data simulated from a spatial-temporal model. We discuss how these models can contribute to the simulation of future rainfall that reflects our changing climate.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"88 23","pages":""},"PeriodicalIF":7.9,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71435578","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Update on Measurement Error Modeling 测量误差建模研究进展
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-10-13 DOI: 10.1146/annurev-statistics-040722-043616
Mushan Li, Yanyuan Ma
The issues caused by measurement errors have been recognized for almost 90 years, and research in this area has flourished since the 1980s. We review some of the classical methods in both density estimation and regression problems with measurement errors. In both problems, we consider when the original error-free model is parametric, nonparametric, and semiparametric, in combination with different error types. We also summarize and explain some new approaches, including recent developments and challenges in the high-dimensional setting.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
测量误差引起的问题已经被认识了近90年,自20世纪80年代以来,这一领域的研究开始蓬勃发展。我们回顾了一些经典的密度估计和回归问题的测量误差的方法。在这两个问题中,我们结合不同的误差类型,考虑了原始无误差模型是参数、非参数和半参数的情况。我们还总结和解释了一些新的方法,包括高维环境中的最新发展和挑战。预计《统计年鉴及其应用》第11卷的最终在线出版日期为2024年3月。修订后的估计数请参阅http://www.annualreviews.org/page/journal/pubdates。
{"title":"An Update on Measurement Error Modeling","authors":"Mushan Li, Yanyuan Ma","doi":"10.1146/annurev-statistics-040722-043616","DOIUrl":"https://doi.org/10.1146/annurev-statistics-040722-043616","url":null,"abstract":"The issues caused by measurement errors have been recognized for almost 90 years, and research in this area has flourished since the 1980s. We review some of the classical methods in both density estimation and regression problems with measurement errors. In both problems, we consider when the original error-free model is parametric, nonparametric, and semiparametric, in combination with different error types. We also summarize and explain some new approaches, including recent developments and challenges in the high-dimensional setting.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"20 14","pages":""},"PeriodicalIF":7.9,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50164713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of Microbiome Data 微生物组数据分析
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-10-13 DOI: 10.1146/annurev-statistics-040522-120734
Christine B. Peterson, Satabdi Saha, Kim-Anh Do
The microbiome represents a hidden world of tiny organisms populating not only our surroundings but also our own bodies. By enabling comprehensive profiling of these invisible creatures, modern genomic sequencing tools have given us an unprecedented ability to characterize these populations and uncover their outsize impact on our environment and health. Statistical analysis of microbiome data is critical to infer patterns from the observed abundances. The application and development of analytical methods in this area require careful consideration of the unique aspects of microbiome profiles. We begin this review with a brief overview of microbiome data collection and processing and describe the resulting data structure. We then provide an overview of statistical methods for key tasks in microbiome data analysis, including data visualization, comparison of microbial abundance across groups, regression modeling, and network inference. We conclude with a discussion and highlight interesting future directions.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
微生物群代表了一个隐藏的世界,里面的微生物不仅存在于我们周围的环境中,也存在于我们自己的身体中。通过对这些看不见的生物进行全面的分析,现代基因组测序工具赋予了我们前所未有的能力来描述这些种群,并揭示它们对我们的环境和健康的巨大影响。微生物组数据的统计分析对于从观察到的丰度推断模式至关重要。分析方法在这一领域的应用和发展需要仔细考虑微生物组谱的独特方面。我们首先简要介绍微生物组数据的收集和处理,并描述由此产生的数据结构。然后,我们概述了微生物组数据分析中关键任务的统计方法,包括数据可视化、组间微生物丰度比较、回归建模和网络推理。最后,我们进行了讨论,并强调了有趣的未来方向。预计《统计年鉴及其应用》第11卷的最终在线出版日期为2024年3月。修订后的估计数请参阅http://www.annualreviews.org/page/journal/pubdates。
{"title":"Analysis of Microbiome Data","authors":"Christine B. Peterson, Satabdi Saha, Kim-Anh Do","doi":"10.1146/annurev-statistics-040522-120734","DOIUrl":"https://doi.org/10.1146/annurev-statistics-040522-120734","url":null,"abstract":"The microbiome represents a hidden world of tiny organisms populating not only our surroundings but also our own bodies. By enabling comprehensive profiling of these invisible creatures, modern genomic sequencing tools have given us an unprecedented ability to characterize these populations and uncover their outsize impact on our environment and health. Statistical analysis of microbiome data is critical to infer patterns from the observed abundances. The application and development of analytical methods in this area require careful consideration of the unique aspects of microbiome profiles. We begin this review with a brief overview of microbiome data collection and processing and describe the resulting data structure. We then provide an overview of statistical methods for key tasks in microbiome data analysis, including data visualization, comparison of microbial abundance across groups, regression modeling, and network inference. We conclude with a discussion and highlight interesting future directions.Expected final online publication date for the Annual Review of Statistics and Its Application, Volume 11 is March 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"20 16","pages":""},"PeriodicalIF":7.9,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50164711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
Annual Review of Statistics and Its Application
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1