首页 > 最新文献

Annual Review of Statistics and Its Application最新文献

英文 中文
Model-Based Spatial Data Fusion 基于模型的空间数据融合
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-08-27 DOI: 10.1146/annurev-statistics-042424-052920
Alan E. Gelfand, Erin M. Schliep
With increased data collection, the need to fuse data sources has emerged as an important and rapidly growing research activity in the statistical community. In considering spatial and spatio-temporal datasets to examine complex environmental and ecological processes of interest, we often have multiple sources that are jointly informative about features of interest of the processes. Model-based data fusion aims to leverage information from these sources to improve inference and prediction. In the spatial statistics setting, these data could be geostatistical; areal; or point patterns with varying spatial resolutions, supports, and domains. Given two or more sources, we explore stochastic modeling to implement a suitable fusion with full inference and uncertainty quantification. We illustrate these ideas using three environmental and ecological examples: precipitation, marine mammal abundance, and joint species distributions.
随着数据收集的增加,融合数据源的需要已成为统计界一项重要和迅速增长的研究活动。在考虑空间和时空数据集来检查感兴趣的复杂环境和生态过程时,我们通常有多个来源,这些来源共同提供有关过程感兴趣特征的信息。基于模型的数据融合旨在利用这些来源的信息来改进推理和预测。在空间统计设置中,这些数据可以是地统计数据;区域;或具有不同空间分辨率、支持和域的点模式。在给定两个或多个源的情况下,我们探索随机建模来实现充分推理和不确定性量化的适当融合。我们用三个环境和生态的例子来说明这些观点:降水、海洋哺乳动物丰度和共同物种分布。
{"title":"Model-Based Spatial Data Fusion","authors":"Alan E. Gelfand, Erin M. Schliep","doi":"10.1146/annurev-statistics-042424-052920","DOIUrl":"https://doi.org/10.1146/annurev-statistics-042424-052920","url":null,"abstract":"With increased data collection, the need to fuse data sources has emerged as an important and rapidly growing research activity in the statistical community. In considering spatial and spatio-temporal datasets to examine complex environmental and ecological processes of interest, we often have multiple sources that are jointly informative about features of interest of the processes. Model-based data fusion aims to leverage information from these sources to improve inference and prediction. In the spatial statistics setting, these data could be geostatistical; areal; or point patterns with varying spatial resolutions, supports, and domains. Given two or more sources, we explore stochastic modeling to implement a suitable fusion with full inference and uncertainty quantification. We illustrate these ideas using three environmental and ecological examples: precipitation, marine mammal abundance, and joint species distributions.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"10 1","pages":""},"PeriodicalIF":7.9,"publicationDate":"2025-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144910905","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Demystifying Inference After Adaptive Experiments 自适应实验后推理的揭秘
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-07-04 DOI: 10.1146/annurev-statistics-040522-015431
Aurélien Bibaut, Nathan Kallus
Adaptive experiments such as multi-armed bandits adapt the treatment-allocation policy and/or the decision to stop the experiment to the data observed so far. This has the potential to improve outcomes for study participants within the experiment, to improve the chance of identifying the best treatments after the experiment, and to avoid wasting data. As an experiment (and not just a continually optimizing system), it is still desirable to draw statistical inferences with frequentist guarantees. The concentration inequalities and union bounds that generally underlie adaptive experimentation algorithms can yield overly conservative inferences, but at the same time, the asymptotic normality we would usually appeal to in nonadaptive settings can be imperiled by adaptivity. In this article we aim to explain why, how, and when adaptivity is in fact an issue for inference and, when it is, to understand the various ways to fix it: reweighting to stabilize variances and recover asymptotic normality, using always-valid inference based on joint normality of an asymptotic limiting sequence, and characterizing and inverting the nonnormal distributions induced by adaptivity.
适应性实验,如多武装强盗,根据迄今为止观察到的数据调整处理分配政策和/或停止实验的决定。这有可能改善实验参与者的结果,提高实验后确定最佳治疗方法的机会,并避免浪费数据。作为一个实验(而不仅仅是一个不断优化的系统),用频率保证得出统计推断仍然是可取的。通常作为自适应实验算法基础的集中不等式和联合边界可能会产生过于保守的推断,但与此同时,我们通常在非自适应环境中所呼吁的渐近正态性可能会受到自适应性的危害。在本文中,我们的目的是解释为什么,如何,以及何时自适应实际上是推理的一个问题,以及当它是,了解解决它的各种方法:重新加权以稳定方差和恢复渐近正态,使用基于渐近极限序列的联合正态的始终有效的推理,以及表征和反转由自适应引起的非正态分布。
{"title":"Demystifying Inference After Adaptive Experiments","authors":"Aurélien Bibaut, Nathan Kallus","doi":"10.1146/annurev-statistics-040522-015431","DOIUrl":"https://doi.org/10.1146/annurev-statistics-040522-015431","url":null,"abstract":"Adaptive experiments such as multi-armed bandits adapt the treatment-allocation policy and/or the decision to stop the experiment to the data observed so far. This has the potential to improve outcomes for study participants within the experiment, to improve the chance of identifying the best treatments after the experiment, and to avoid wasting data. As an experiment (and not just a continually optimizing system), it is still desirable to draw statistical inferences with frequentist guarantees. The concentration inequalities and union bounds that generally underlie adaptive experimentation algorithms can yield overly conservative inferences, but at the same time, the asymptotic normality we would usually appeal to in nonadaptive settings can be imperiled by adaptivity. In this article we aim to explain why, how, and when adaptivity is in fact an issue for inference and, when it is, to understand the various ways to fix it: reweighting to stabilize variances and recover asymptotic normality, using always-valid inference based on joint normality of an asymptotic limiting sequence, and characterizing and inverting the nonnormal distributions induced by adaptivity.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"26 1","pages":"407-423"},"PeriodicalIF":7.9,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144565895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Survey on Statistical Theory of Deep Learning: Approximation, Training Dynamics, and Generative Models 深度学习统计理论概览:逼近、训练动态和生成模型
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-11-21 DOI: 10.1146/annurev-statistics-040522-013920
Namjoon Suh, Guang Cheng
In this article, we review the literature on statistical theories of neural networks from three perspectives: approximation, training dynamics, and generative models. In the first part, results on excess risks for neural networks are reviewed in the nonparametric framework of regression. These results rely on explicit constructions of neural networks, leading to fast convergence rates of excess risks. Nonetheless, their underlying analysis only applies to the global minimizer in the highly nonconvex landscape of deep neural networks. This motivates us to review the training dynamics of neural networks in the second part. Specifically, we review articles that attempt to answer the question of how a neural network trained via gradient-based methods finds a solution that can generalize well on unseen data. In particular, two well-known paradigms are reviewed: the neural tangent kernel and mean-field paradigms. Last, we review the most recent theoretical advancements in generative models, including generative adversarial networks, diffusion models, and in-context learning in large language models from two of the same perspectives, approximation and training dynamics.
在本文中,我们从逼近、训练动态和生成模型三个角度回顾了有关神经网络统计理论的文献。在第一部分中,我们回顾了在非参数回归框架下神经网络的超额风险结果。这些结果依赖于神经网络的明确构造,从而导致超额风险的快速收敛率。然而,它们的基本分析只适用于深度神经网络高度非凸景观中的全局最小化。这促使我们在第二部分回顾神经网络的训练动态。具体来说,我们回顾了一些文章,这些文章试图回答这样一个问题:通过基于梯度的方法训练的神经网络如何找到一个能在未见数据上很好泛化的解决方案。我们特别回顾了两种著名的范式:神经正切核和均值场范式。最后,我们回顾了生成模型的最新理论进展,包括生成对抗网络、扩散模型,以及从近似和训练动态这两个相同的角度对大型语言模型进行的上下文学习。
{"title":"A Survey on Statistical Theory of Deep Learning: Approximation, Training Dynamics, and Generative Models","authors":"Namjoon Suh, Guang Cheng","doi":"10.1146/annurev-statistics-040522-013920","DOIUrl":"https://doi.org/10.1146/annurev-statistics-040522-013920","url":null,"abstract":"In this article, we review the literature on statistical theories of neural networks from three perspectives: approximation, training dynamics, and generative models. In the first part, results on excess risks for neural networks are reviewed in the nonparametric framework of regression. These results rely on explicit constructions of neural networks, leading to fast convergence rates of excess risks. Nonetheless, their underlying analysis only applies to the global minimizer in the highly nonconvex landscape of deep neural networks. This motivates us to review the training dynamics of neural networks in the second part. Specifically, we review articles that attempt to answer the question of how a neural network trained via gradient-based methods finds a solution that can generalize well on unseen data. In particular, two well-known paradigms are reviewed: the neural tangent kernel and mean-field paradigms. Last, we review the most recent theoretical advancements in generative models, including generative adversarial networks, diffusion models, and in-context learning in large language models from two of the same perspectives, approximation and training dynamics.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"111 1","pages":""},"PeriodicalIF":7.9,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142684813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Models and Rating Systems for Head-to-Head Competition 正面交锋的模型和评级系统
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-11-20 DOI: 10.1146/annurev-statistics-040722-061813
Mark E. Glickman, Albyn C. Jones
One of the most important tasks in sports analytics is the development of binary response models for head-to-head game outcomes to estimate team and player strength. We discuss commonly used probability models for game outcomes, including the Bradley–Terry and Thurstone–Mosteller models, as well as extensions to ties as a third outcome and to the inclusion of a home-field advantage. We consider dynamic extensions to these models to account for the evolution of competitor strengths over time. Full likelihood-based analyses of these time-varying models can be simplified into rating systems, such as the Elo and Glicko rating systems. We present other modern rating systems, including popular methods for online gaming, and novel systems that have been implemented for online chess and Go. The discussion of the analytic methods are accompanied by examples of where these approaches have been implemented for various gaming organizations, as well as a detailed application to National Basketball Association game outcomes.
体育分析中最重要的任务之一是为正面交锋的比赛结果建立二元响应模型,以估计球队和球员的实力。我们讨论了常用的比赛结果概率模型,包括 Bradley-Terry 模型和 Thurstone-Mosteller 模型,以及将平局作为第三种结果和包含主场优势的扩展模型。我们考虑对这些模型进行动态扩展,以考虑竞争对手实力随时间的变化。这些时变模型的完全似然分析可简化为评级系统,如 Elo 和 Glicko 评级系统。我们还介绍了其他现代评级系统,包括用于在线对弈的流行方法,以及用于在线国际象棋和围棋的新型系统。在讨论分析方法的同时,我们还举例说明了这些方法在各种游戏组织中的应用,以及在美国国家篮球协会比赛结果中的详细应用。
{"title":"Models and Rating Systems for Head-to-Head Competition","authors":"Mark E. Glickman, Albyn C. Jones","doi":"10.1146/annurev-statistics-040722-061813","DOIUrl":"https://doi.org/10.1146/annurev-statistics-040722-061813","url":null,"abstract":"One of the most important tasks in sports analytics is the development of binary response models for head-to-head game outcomes to estimate team and player strength. We discuss commonly used probability models for game outcomes, including the Bradley–Terry and Thurstone–Mosteller models, as well as extensions to ties as a third outcome and to the inclusion of a home-field advantage. We consider dynamic extensions to these models to account for the evolution of competitor strengths over time. Full likelihood-based analyses of these time-varying models can be simplified into rating systems, such as the Elo and Glicko rating systems. We present other modern rating systems, including popular methods for online gaming, and novel systems that have been implemented for online chess and Go. The discussion of the analytic methods are accompanied by examples of where these approaches have been implemented for various gaming organizations, as well as a detailed application to National Basketball Association game outcomes.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"1 1","pages":""},"PeriodicalIF":7.9,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142679222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Review of Reinforcement Learning in Financial Applications 金融应用中的强化学习回顾
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-11-15 DOI: 10.1146/annurev-statistics-112723-034423
Yahui Bai, Yuhe Gao, Runzhe Wan, Sheng Zhang, Rui Song
In recent years, there has been a growing trend of applying reinforcement learning (RL) in financial applications. This approach has shown great potential for decision-making tasks in finance. In this review, we present a comprehensive study of the applications of RL in finance and conduct a series of meta-analyses to investigate the common themes in the literature, such as the factors that most significantly affect RL's performance compared with traditional methods. Moreover, we identify challenges, including explainability, Markov decision process modeling, and robustness, that hinder the broader utilization of RL in the financial industry and discuss recent advancements in overcoming these challenges. Finally, we propose future research directions, such as benchmarking, contextual RL, multi-agent RL, and model-based RL to address these challenges and to further enhance the implementation of RL in finance.
近年来,在金融应用中应用强化学习(RL)的趋势越来越明显。这种方法在金融决策任务中显示出巨大的潜力。在这篇综述中,我们对强化学习在金融领域的应用进行了全面研究,并进行了一系列元分析,以探讨文献中的共同主题,例如与传统方法相比,哪些因素对强化学习的性能影响最大。此外,我们还发现了一些挑战,包括可解释性、马尔可夫决策过程建模和稳健性,这些挑战阻碍了 RL 在金融业的广泛应用,并讨论了在克服这些挑战方面的最新进展。最后,我们提出了未来的研究方向,如基准测试、情境 RL、多代理 RL 和基于模型的 RL,以应对这些挑战并进一步加强 RL 在金融领域的应用。
{"title":"A Review of Reinforcement Learning in Financial Applications","authors":"Yahui Bai, Yuhe Gao, Runzhe Wan, Sheng Zhang, Rui Song","doi":"10.1146/annurev-statistics-112723-034423","DOIUrl":"https://doi.org/10.1146/annurev-statistics-112723-034423","url":null,"abstract":"In recent years, there has been a growing trend of applying reinforcement learning (RL) in financial applications. This approach has shown great potential for decision-making tasks in finance. In this review, we present a comprehensive study of the applications of RL in finance and conduct a series of meta-analyses to investigate the common themes in the literature, such as the factors that most significantly affect RL's performance compared with traditional methods. Moreover, we identify challenges, including explainability, Markov decision process modeling, and robustness, that hinder the broader utilization of RL in the financial industry and discuss recent advancements in overcoming these challenges. Finally, we propose future research directions, such as benchmarking, contextual RL, multi-agent RL, and model-based RL to address these challenges and to further enhance the implementation of RL in finance.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"25 1","pages":""},"PeriodicalIF":7.9,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142642981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint Modeling of Longitudinal and Survival Data 纵向数据和生存数据的联合建模
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-11-14 DOI: 10.1146/annurev-statistics-112723-034334
Jane-Ling Wang, Qixian Zhong
In medical studies, time-to-event outcomes such as time to death or relapse of a disease are routinely recorded along with longitudinal data that are observed intermittently during the follow-up period. For various reasons, marginal approaches to model the event time, corresponding to separate approaches for survival data/longitudinal data, tend to induce bias and lose efficiency. Instead, a joint modeling approach that brings the two types of data together can reduce or eliminate the bias and yield a more efficient estimation procedure. A well-established avenue for joint modeling is the joint likelihood approach that often produces semiparametric efficient estimators for the finite-dimensional parameter vectors in both models. Through a transformation survival model with an unspecified baseline hazard function, this review introduces joint modeling that accommodates both baseline covariates and time-varying covariates. The focus is on the major challenges faced by joint modeling and how they can be overcome. A review of available software implementations and a brief discussion of future directions of the field are also included.
在医学研究中,时间到事件的结果(如死亡时间或疾病复发时间)通常与在随访期间间歇观察到的纵向数据一起记录。由于种种原因,对生存数据/纵向数据分别采用边际方法来建立事件时间模型,往往会产生偏差并降低效率。相反,将两类数据结合在一起的联合建模方法可以减少或消除偏差,并产生更有效的估算程序。联合建模的一个行之有效的方法是联合似然法,这种方法通常能对两个模型中的有限维参数向量产生半参数有效估计。本综述通过一个具有未指定基线危险函数的转化生存模型,介绍了同时考虑基线协变量和时变协变量的联合建模方法。重点是联合建模面临的主要挑战以及如何克服这些挑战。本综述还包括对现有软件实现的回顾,以及对该领域未来发展方向的简要讨论。
{"title":"Joint Modeling of Longitudinal and Survival Data","authors":"Jane-Ling Wang, Qixian Zhong","doi":"10.1146/annurev-statistics-112723-034334","DOIUrl":"https://doi.org/10.1146/annurev-statistics-112723-034334","url":null,"abstract":"In medical studies, time-to-event outcomes such as time to death or relapse of a disease are routinely recorded along with longitudinal data that are observed intermittently during the follow-up period. For various reasons, marginal approaches to model the event time, corresponding to separate approaches for survival data/longitudinal data, tend to induce bias and lose efficiency. Instead, a joint modeling approach that brings the two types of data together can reduce or eliminate the bias and yield a more efficient estimation procedure. A well-established avenue for joint modeling is the joint likelihood approach that often produces semiparametric efficient estimators for the finite-dimensional parameter vectors in both models. Through a transformation survival model with an unspecified baseline hazard function, this review introduces joint modeling that accommodates both baseline covariates and time-varying covariates. The focus is on the major challenges faced by joint modeling and how they can be overcome. A review of available software implementations and a brief discussion of future directions of the field are also included.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"246 1","pages":""},"PeriodicalIF":7.9,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142637200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Neural Methods for Amortized Inference 用于摊销推理的神经方法
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-11-12 DOI: 10.1146/annurev-statistics-112723-034123
Andrew Zammit-Mangion, Matthew Sainsbury-Dale, Raphaël Huser
Simulation-based methods for statistical inference have evolved dramatically over the past 50 years, keeping pace with technological advancements. The field is undergoing a new revolution as it embraces the representational capacity of neural networks, optimization libraries, and graphics processing units for learning complex mappings between data and inferential targets. The resulting tools are amortized, in the sense that, after an initial setup cost, they allow rapid inference through fast feed-forward operations. In this article we review recent progress in the context of point estimation, approximate Bayesian inference, summary-statistic construction, and likelihood approximation. We also cover software and include a simple illustration to showcase the wide array of tools available for amortized inference and the benefits they offer over Markov chain Monte Carlo methods. The article concludes with an overview of relevant topics and an outlook on future research directions.
在过去的 50 年中,基于模拟的统计推断方法随着技术的进步而发生了巨大的变化。随着神经网络、优化库和图形处理单元在学习数据与推理目标之间复杂映射时的表征能力不断增强,这一领域正在经历一场新的革命。由此产生的工具具有摊销性,即在初始设置成本之后,可通过快速前馈操作进行快速推理。在这篇文章中,我们回顾了在点估计、近似贝叶斯推断、汇总统计构造和似然逼近等方面的最新进展。我们还介绍了相关软件,并通过一个简单的插图展示了可用于摊销推断的各种工具,以及这些工具与马尔科夫链蒙特卡罗方法相比所具有的优势。文章最后概述了相关主题并展望了未来的研究方向。
{"title":"Neural Methods for Amortized Inference","authors":"Andrew Zammit-Mangion, Matthew Sainsbury-Dale, Raphaël Huser","doi":"10.1146/annurev-statistics-112723-034123","DOIUrl":"https://doi.org/10.1146/annurev-statistics-112723-034123","url":null,"abstract":"Simulation-based methods for statistical inference have evolved dramatically over the past 50 years, keeping pace with technological advancements. The field is undergoing a new revolution as it embraces the representational capacity of neural networks, optimization libraries, and graphics processing units for learning complex mappings between data and inferential targets. The resulting tools are amortized, in the sense that, after an initial setup cost, they allow rapid inference through fast feed-forward operations. In this article we review recent progress in the context of point estimation, approximate Bayesian inference, summary-statistic construction, and likelihood approximation. We also cover software and include a simple illustration to showcase the wide array of tools available for amortized inference and the benefits they offer over Markov chain Monte Carlo methods. The article concludes with an overview of relevant topics and an outlook on future research directions.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"95 1","pages":""},"PeriodicalIF":7.9,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142601277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Infectious Disease Modeling 传染病建模
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-11-12 DOI: 10.1146/annurev-statistics-112723-034351
Jing Huang, Jeffrey S. Morris
Infectious diseases pose a persistent challenge to public health worldwide. Recent global health crises, such as the COVID-19 pandemic and Ebola outbreaks, have underscored the vital role of infectious disease modeling in guiding public health policy and response. Infectious disease modeling is a critical tool for society, informing risk mitigation measures, prompting timely interventions, and aiding preparedness for healthcare delivery systems. This article synthesizes the current landscape of infectious disease modeling, emphasizing the integration of statistical methods in understanding and predicting the spread of infectious diseases. We begin by examining the historical context and the foundational models that have shaped the field, such as the SIR (susceptible, infectious, recovered) and SEIR (susceptible, exposed, infectious, recovered) models. Subsequently, we delve into the methodological innovations that have arisen, including stochastic modeling, network-based approaches, and the use of big data analytics. We also explore the integration of machine learning techniques in enhancing model accuracy and responsiveness. The review identifies the challenges of parameter estimation, model validation, and the incorporation of real-time data streams. Moreover, we discuss the ethical implications of modeling, such as privacy concerns and the communication of risk. The article concludes by discussing future directions for research, highlighting the need for data integration and interdisciplinary collaboration for advancing infectious disease modeling.
传染病对全球公共卫生构成了持续挑战。最近的全球健康危机,如 COVID-19 大流行和埃博拉疫情,凸显了传染病建模在指导公共卫生政策和应对措施方面的重要作用。传染病建模是社会的重要工具,可为降低风险的措施提供信息,促进及时干预,并帮助医疗保健服务系统做好准备。本文综述了传染病建模的现状,强调了在理解和预测传染病传播过程中统计方法的整合。我们首先考察了历史背景和塑造这一领域的基础模型,如 SIR(易感者、感染者、康复者)和 SEIR(易感者、暴露者、感染者、康复者)模型。随后,我们深入探讨了已出现的方法创新,包括随机建模、基于网络的方法和大数据分析的使用。我们还探讨了机器学习技术在提高模型准确性和响应速度方面的整合。综述指出了参数估计、模型验证和实时数据流整合方面的挑战。此外,我们还讨论了建模所涉及的伦理问题,如隐私问题和风险交流。文章最后讨论了未来的研究方向,强调了数据整合和跨学科合作对推进传染病建模的必要性。
{"title":"Infectious Disease Modeling","authors":"Jing Huang, Jeffrey S. Morris","doi":"10.1146/annurev-statistics-112723-034351","DOIUrl":"https://doi.org/10.1146/annurev-statistics-112723-034351","url":null,"abstract":"Infectious diseases pose a persistent challenge to public health worldwide. Recent global health crises, such as the COVID-19 pandemic and Ebola outbreaks, have underscored the vital role of infectious disease modeling in guiding public health policy and response. Infectious disease modeling is a critical tool for society, informing risk mitigation measures, prompting timely interventions, and aiding preparedness for healthcare delivery systems. This article synthesizes the current landscape of infectious disease modeling, emphasizing the integration of statistical methods in understanding and predicting the spread of infectious diseases. We begin by examining the historical context and the foundational models that have shaped the field, such as the SIR (susceptible, infectious, recovered) and SEIR (susceptible, exposed, infectious, recovered) models. Subsequently, we delve into the methodological innovations that have arisen, including stochastic modeling, network-based approaches, and the use of big data analytics. We also explore the integration of machine learning techniques in enhancing model accuracy and responsiveness. The review identifies the challenges of parameter estimation, model validation, and the incorporation of real-time data streams. Moreover, we discuss the ethical implications of modeling, such as privacy concerns and the communication of risk. The article concludes by discussing future directions for research, highlighting the need for data integration and interdisciplinary collaboration for advancing infectious disease modeling.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"15 1","pages":""},"PeriodicalIF":7.9,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142601274","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tensors in High-Dimensional Data Analysis: Methodological Opportunities and Theoretical Challenges 高维数据分析中的张量:方法论机遇与理论挑战
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-11-12 DOI: 10.1146/annurev-statistics-112723-034548
Arnab Auddy, Dong Xia, Ming Yuan
Large amounts of multidimensional data represented by multiway arrays or tensors are prevalent in modern applications across various fields such as chemometrics, genomics, physics, psychology, and signal processing. The structural complexity of such data provides vast new opportunities for modeling and analysis, but efficiently extracting information content from them, both statistically and computationally, presents unique and fundamental challenges. Addressing these challenges requires an interdisciplinary approach that brings together tools and insights from statistics, optimization, and numerical linear algebra, among other fields. Despite these hurdles, significant progress has been made in the past decade. This review seeks to examine some of the key advancements and identify common threads among them, under a number of different statistical settings.
在化学计量学、基因组学、物理学、心理学和信号处理等各个领域的现代应用中,以多向阵列或张量表示的大量多维数据十分普遍。此类数据的结构复杂性为建模和分析提供了大量新机遇,但如何从这些数据中有效地提取信息内容,无论是在统计上还是在计算上,都提出了独特而根本的挑战。应对这些挑战需要一种跨学科的方法,将统计学、优化和数值线性代数等领域的工具和见解结合起来。尽管存在这些障碍,但在过去十年中已取得了重大进展。本综述试图在一些不同的统计环境下,研究其中的一些关键进展,并找出它们之间的共同点。
{"title":"Tensors in High-Dimensional Data Analysis: Methodological Opportunities and Theoretical Challenges","authors":"Arnab Auddy, Dong Xia, Ming Yuan","doi":"10.1146/annurev-statistics-112723-034548","DOIUrl":"https://doi.org/10.1146/annurev-statistics-112723-034548","url":null,"abstract":"Large amounts of multidimensional data represented by multiway arrays or tensors are prevalent in modern applications across various fields such as chemometrics, genomics, physics, psychology, and signal processing. The structural complexity of such data provides vast new opportunities for modeling and analysis, but efficiently extracting information content from them, both statistically and computationally, presents unique and fundamental challenges. Addressing these challenges requires an interdisciplinary approach that brings together tools and insights from statistics, optimization, and numerical linear algebra, among other fields. Despite these hurdles, significant progress has been made in the past decade. This review seeks to examine some of the key advancements and identify common threads among them, under a number of different statistical settings.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"40 1","pages":""},"PeriodicalIF":7.9,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142601276","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Excess Mortality Estimation 超额死亡率估算
IF 7.9 1区 数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-11-12 DOI: 10.1146/annurev-statistics-112723-034236
Jon Wakefield, Victoria Knutson
Estimating the mortality associated with a specific mortality crisis event (for example, a pandemic, natural disaster, or conflict) is clearly an important public health undertaking. In many situations, deaths may be directly or indirectly attributable to the mortality crisis event, and both contributions may be of interest. The totality of the mortality impact on the population (direct and indirect deaths) includes the knock-on effects of the event, such as a breakdown of the health care system, or increased mortality due to shortages of resources. Unfortunately, estimating the deaths directly attributable to the event is frequently problematic. Hence, the excess mortality, defined as the difference between the observed mortality and that which would have occurred in the absence of the crisis event, is an estimation target. If the region of interest contains a functioning vital registration system, so that the mortality is fully observed and reliable, then the only modeling required is to produce the expected deaths counts, but this is a nontrivial exercise. In low- and middle-income countries it is common for there to be incomplete (or nonexistent) mortality data, and one must then use additional data and/or modeling, including predicting mortality using auxiliary variables. We describe and review each of these aspects, give examples of excess mortality studies, and provide a case study on excess mortality across states of the United States during the COVID-19 pandemic.
估算与特定死亡危机事件(如大流行病、自然灾害或冲突)相关的死亡率显然是一项重要的公共卫生工作。在许多情况下,死亡可能直接或间接归因于死亡危机事件,而这两种归因可能都会引起人们的兴趣。死亡率对人口的总体影响(直接和间接死亡)包括事件的连锁反应,如医疗保健系统崩溃,或因资源短缺导致死亡率上升。遗憾的是,估算可直接归因于事件的死亡人数经常会遇到问题。因此,超额死亡率(定义为观察到的死亡率与未发生危机事件时的死亡率之间的差异)成为估算目标。如果相关地区有一个正常运行的生命登记系统,从而可以全面观测到可靠的死亡率,那么所需的唯一建模工作就是生成预期死亡数,但这并不是一项简单的工作。在中低收入国家,死亡率数据不完整(或不存在)的情况很常见,因此必须使用额外的数据和/或建模,包括使用辅助变量预测死亡率。我们将逐一描述和回顾这些方面,举例说明超额死亡率研究,并提供 COVID-19 大流行期间美国各州超额死亡率的案例研究。
{"title":"Excess Mortality Estimation","authors":"Jon Wakefield, Victoria Knutson","doi":"10.1146/annurev-statistics-112723-034236","DOIUrl":"https://doi.org/10.1146/annurev-statistics-112723-034236","url":null,"abstract":"Estimating the mortality associated with a specific mortality crisis event (for example, a pandemic, natural disaster, or conflict) is clearly an important public health undertaking. In many situations, deaths may be directly or indirectly attributable to the mortality crisis event, and both contributions may be of interest. The totality of the mortality impact on the population (direct and indirect deaths) includes the knock-on effects of the event, such as a breakdown of the health care system, or increased mortality due to shortages of resources. Unfortunately, estimating the deaths directly attributable to the event is frequently problematic. Hence, the excess mortality, defined as the difference between the observed mortality and that which would have occurred in the absence of the crisis event, is an estimation target. If the region of interest contains a functioning vital registration system, so that the mortality is fully observed and reliable, then the only modeling required is to produce the expected deaths counts, but this is a nontrivial exercise. In low- and middle-income countries it is common for there to be incomplete (or nonexistent) mortality data, and one must then use additional data and/or modeling, including predicting mortality using auxiliary variables. We describe and review each of these aspects, give examples of excess mortality studies, and provide a case study on excess mortality across states of the United States during the COVID-19 pandemic.","PeriodicalId":48855,"journal":{"name":"Annual Review of Statistics and Its Application","volume":"6 1","pages":""},"PeriodicalIF":7.9,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142601402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Annual Review of Statistics and Its Application
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1