首页 > 最新文献

Statistical Science最新文献

英文 中文
The Secret Life of I. J. Good I. J.古德的秘密生活
IF 5.7 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2023-01-01 DOI: 10.1214/22-sts870
S. Zabell
{"title":"The Secret Life of I. J. Good","authors":"S. Zabell","doi":"10.1214/22-sts870","DOIUrl":"https://doi.org/10.1214/22-sts870","url":null,"abstract":"","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46359832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rejoinder: Confidence as Likelihood 复辩状:信心是可能的
IF 5.7 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2022-11-01 DOI: 10.1214/22-sts869
Y. Pawitan, Youngjo Lee
{"title":"Rejoinder: Confidence as Likelihood","authors":"Y. Pawitan, Youngjo Lee","doi":"10.1214/22-sts869","DOIUrl":"https://doi.org/10.1214/22-sts869","url":null,"abstract":"","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48482039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Conversation with David J. Aldous 《与大卫·j·奥多斯的对话
IF 5.7 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2022-11-01 DOI: 10.1214/22-sts849
S. Bhamidi
. David John Aldous was born in Exeter U.K. on July 13, 1952. He received a B.A. and Ph.D. in Mathematics in 1973 and 1977, respectively from Cambridge. After spending two years as a research fellow at St. John’s College, Cambridge, he joined the Department of Statistics at the University of California, Berkeley in 1979 where he spent the rest of his academic career until retiring in 2018. He is known for seminal contributions on many topics within probability including weak convergence and tightness, exchangeability, Markov chain mixing times, Poisson clumping heuristic and limit theory for large discrete random structures including random trees, stochastic coagulation and fragmentation systems, models of complex networks and interacting particle systems on such structures. For his contributions to the field, he has received numerous honors and awards including the Rollo David-son prize in 1980, the inaugural Loeve prize in Probability in 1993, and the Brouwer medal in 2021, and being elected as an IMS fellow in 1985, Fellow of the Royal Society in 1994, Fellow of the American Academy of Arts and Sciences in 2004, elected to the National Academy of Sciences (foreign associate) in 2010, ICM plenary speaker in 2010 and AMS fellow in 2012.
戴维·约翰·奥尔德斯1952年7月13日出生于英国埃克塞特。他分别于1973年和1977年在剑桥大学获得数学学士和博士学位。在剑桥圣约翰学院做了两年研究员后,他于1979年加入加州大学伯克利分校统计系,在那里度过了他的学术生涯,直到2018年退休。他在概率学的许多主题上做出了开创性的贡献,包括弱收敛性和紧密性、可交换性、马尔可夫链混合时间、泊松聚集启发式和大型离散随机结构的极限理论,包括随机树、随机凝聚和碎片系统、复杂网络模型和此类结构上的相互作用粒子系统。由于他在该领域的贡献,他获得了许多荣誉和奖项,包括1980年的罗洛-大卫森奖、1993年的首届洛夫概率奖和2021年的布劳沃奖章,并于1985年当选为IMS研究员,1994年当选为皇家学会院士,2004年当选为美国艺术与科学院院士,2010年当选为美国国家科学院院士(外籍院士),2010年当选ICM全体议长,2012年当选AMS研究员。
{"title":"A Conversation with David J. Aldous","authors":"S. Bhamidi","doi":"10.1214/22-sts849","DOIUrl":"https://doi.org/10.1214/22-sts849","url":null,"abstract":". David John Aldous was born in Exeter U.K. on July 13, 1952. He received a B.A. and Ph.D. in Mathematics in 1973 and 1977, respectively from Cambridge. After spending two years as a research fellow at St. John’s College, Cambridge, he joined the Department of Statistics at the University of California, Berkeley in 1979 where he spent the rest of his academic career until retiring in 2018. He is known for seminal contributions on many topics within probability including weak convergence and tightness, exchangeability, Markov chain mixing times, Poisson clumping heuristic and limit theory for large discrete random structures including random trees, stochastic coagulation and fragmentation systems, models of complex networks and interacting particle systems on such structures. For his contributions to the field, he has received numerous honors and awards including the Rollo David-son prize in 1980, the inaugural Loeve prize in Probability in 1993, and the Brouwer medal in 2021, and being elected as an IMS fellow in 1985, Fellow of the Royal Society in 1994, Fellow of the American Academy of Arts and Sciences in 2004, elected to the National Academy of Sciences (foreign associate) in 2010, ICM plenary speaker in 2010 and AMS fellow in 2012.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44257148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Covariate-Adjusted ROC Curve: The Concept and Its Importance, Review of Inferential Methods, and a New Bayesian Estimator 协变量校正ROC曲线:概念及其重要性,推理方法综述,以及一种新的贝叶斯估计量
IF 5.7 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2022-11-01 DOI: 10.1214/21-sts839
Vanda Inácio, M. Rodríguez-Álvarez
Accurate diagnosis of disease is of fundamental importance in clinical practice and medical research. Before a medical diagnostic test is routinely used in practice, its ability to distinguish between diseased and nondiseased states must be rigorously assessed. The receiver operating characteristic (ROC) curve is the most popular used tool for evaluating the diagnostic accuracy of continuous-outcome tests. It has been acknowledged that several factors (e.g., subject-specific characteristics such as age and/or gender) can affect the test outcomes and accuracy beyond disease status. Recently, the covariate-adjusted ROC curve has been proposed and successfully applied as a global summary measure of diagnostic accuracy that takes covariate information into account. The aim of this paper is three-fold. First, we motivate the importance of including covariate-information, whenever available, in ROC analysis and, in particular, how the covariate-adjusted ROC curve is an important tool in this context. Second, we review and provide insight on the existing approaches for estimating the covariate-adjusted ROC curve. Third, we develop a highly flexible Bayesian method, based on the combination of a Dirichlet process mixture of additive normal models and the Bayesian bootstrap, for conducting inference about the covariate-adjusted ROC curve. A simulation study is conducted to assess the performance of the different methods and it also demonstrates the ability of our proposed Bayesian model to successfully recover the true covariate-adjusted ROC curve and to produce valid inferences in a variety of complex scenarios. The methods are applied to an endocrine study where the goal is to assess the accuracy of the body mass index, adjusted for age and gender, for detecting clusters of cardiovascular disease risk factors. key words: Classification accuracy; Covariate-adjustment; Decision threshold; Diagnostic test; Dirichlet process (mixture) model; Receiver operating characteristic curve. Vanda Inácio, School of Mathematics, University of Edinburgh, Scotland, UK (vanda.inacio@ed.ac.uk). Maŕıa Xosé RodŕıguezÁlvarez, BCAM-Basque Center for Applied Mathematics & IKERBASQUE, Basque Foundation for Science, Bilbao, Basque Country, Spain (mxrodriguez@bcamath.org).
准确诊断疾病在临床实践和医学研究中具有重要意义。在医学诊断测试被常规应用于实践之前,必须严格评估其区分疾病和非疾病状态的能力。受试者工作特性(ROC)曲线是评估连续结果测试诊断准确性的最常用工具。人们已经认识到,有几个因素(例如,受试者的特定特征,如年龄和/或性别)会影响疾病状态之外的测试结果和准确性。最近,协变量调整的ROC曲线被提出并成功应用于考虑协变量信息的诊断准确性的全局汇总测量。本文的目的有三个方面。首先,我们强调了在ROC分析中尽可能包含协变量信息的重要性,特别是协变量调整的ROC曲线在这种情况下是如何成为一个重要工具的。其次,我们回顾并深入了解现有的估计协变量调整ROC曲线的方法。第三,我们开发了一种高度灵活的贝叶斯方法,基于加性正态模型的狄利克雷过程混合和贝叶斯自举,用于对协变量调整的ROC曲线进行推断。进行了一项模拟研究来评估不同方法的性能,它还证明了我们提出的贝叶斯模型能够成功恢复真正的协变量调整ROC曲线,并在各种复杂场景中产生有效的推断。这些方法应用于内分泌研究,目的是评估根据年龄和性别调整的体重指数的准确性,以检测心血管疾病风险因素。关键词:分类准确性;协变量调整;决策阈值;诊断测试;狄利克雷过程(混合物)模型;接收器工作特性曲线。Vanda Inácio,英国苏格兰爱丁堡大学数学学院(vanda.inacio@ed.ac.uk)。西班牙巴斯克国家毕尔巴鄂巴斯克科学基金会,BCAM巴斯克应用数学中心和IKERBASQUE(mxrodriguez@bcamath.org)。
{"title":"The Covariate-Adjusted ROC Curve: The Concept and Its Importance, Review of Inferential Methods, and a New Bayesian Estimator","authors":"Vanda Inácio, M. Rodríguez-Álvarez","doi":"10.1214/21-sts839","DOIUrl":"https://doi.org/10.1214/21-sts839","url":null,"abstract":"Accurate diagnosis of disease is of fundamental importance in clinical practice and medical research. Before a medical diagnostic test is routinely used in practice, its ability to distinguish between diseased and nondiseased states must be rigorously assessed. The receiver operating characteristic (ROC) curve is the most popular used tool for evaluating the diagnostic accuracy of continuous-outcome tests. It has been acknowledged that several factors (e.g., subject-specific characteristics such as age and/or gender) can affect the test outcomes and accuracy beyond disease status. Recently, the covariate-adjusted ROC curve has been proposed and successfully applied as a global summary measure of diagnostic accuracy that takes covariate information into account. The aim of this paper is three-fold. First, we motivate the importance of including covariate-information, whenever available, in ROC analysis and, in particular, how the covariate-adjusted ROC curve is an important tool in this context. Second, we review and provide insight on the existing approaches for estimating the covariate-adjusted ROC curve. Third, we develop a highly flexible Bayesian method, based on the combination of a Dirichlet process mixture of additive normal models and the Bayesian bootstrap, for conducting inference about the covariate-adjusted ROC curve. A simulation study is conducted to assess the performance of the different methods and it also demonstrates the ability of our proposed Bayesian model to successfully recover the true covariate-adjusted ROC curve and to produce valid inferences in a variety of complex scenarios. The methods are applied to an endocrine study where the goal is to assess the accuracy of the body mass index, adjusted for age and gender, for detecting clusters of cardiovascular disease risk factors. key words: Classification accuracy; Covariate-adjustment; Decision threshold; Diagnostic test; Dirichlet process (mixture) model; Receiver operating characteristic curve. Vanda Inácio, School of Mathematics, University of Edinburgh, Scotland, UK (vanda.inacio@ed.ac.uk). Maŕıa Xosé RodŕıguezÁlvarez, BCAM-Basque Center for Applied Mathematics & IKERBASQUE, Basque Foundation for Science, Bilbao, Basque Country, Spain (mxrodriguez@bcamath.org).","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43020500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
High-Performance Statistical Computing in the Computing Environments of the 2020s. 2020 年代计算环境中的高性能统计计算。
IF 3.9 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2022-11-01 Epub Date: 2022-10-13 DOI: 10.1214/21-sts835
Seyoon Ko, Hua Zhou, Jin J Zhou, Joong-Ho Won

Technological advances in the past decade, hardware and software alike, have made access to high-performance computing (HPC) easier than ever. We review these advances from a statistical computing perspective. Cloud computing makes access to supercomputers affordable. Deep learning software libraries make programming statistical algorithms easy and enable users to write code once and run it anywhere-from a laptop to a workstation with multiple graphics processing units (GPUs) or a supercomputer in a cloud. Highlighting how these developments benefit statisticians, we review recent optimization algorithms that are useful for high-dimensional models and can harness the power of HPC. Code snippets are provided to demonstrate the ease of programming. We also provide an easy-to-use distributed matrix data structure suitable for HPC. Employing this data structure, we illustrate various statistical applications including large-scale positron emission tomography and 1-regularized Cox regression. Our examples easily scale up to an 8-GPU workstation and a 720-CPU-core cluster in a cloud. As a case in point, we analyze the onset of type-2 diabetes from the UK Biobank with 200,000 subjects and about 500,000 single nucleotide polymorphisms using the HPC 1-regularized Cox regression. Fitting this half-million-variate model takes less than 45 minutes and reconfirms known associations. To our knowledge, this is the first demonstration of the feasibility of penalized regression of survival outcomes at this scale.

过去十年中,硬件和软件方面的技术进步使高性能计算(HPC)的使用比以往任何时候都更加便捷。我们将从统计计算的角度回顾这些进步。云计算使超级计算机的使用变得经济实惠。深度学习软件库使统计算法编程变得简单,用户只需编写一次代码,即可在任何地方运行--从笔记本电脑到配备多个图形处理器(GPU)的工作站或云计算中的超级计算机。在重点介绍这些发展如何使统计学家受益的同时,我们回顾了最近的优化算法,这些算法对高维模型非常有用,而且可以利用高性能计算的强大功能。我们还提供了代码片段,以演示编程的便捷性。我们还提供了适用于高性能计算的易用分布式矩阵数据结构。利用这种数据结构,我们展示了各种统计应用,包括大规模正电子发射断层扫描和 ℓ1-regularized Cox 回归。我们的示例可以轻松扩展到 8 GPU 工作站和云中的 720 CPU 核心集群。例如,我们使用 HPC ℓ1-regularized Cox 回归分析了英国生物库中 20 万受试者和约 50 万单核苷酸多态性的 2 型糖尿病发病情况。拟合这个包含 50 万个变量的模型只需不到 45 分钟的时间,并能重新确认已知的关联。据我们所知,这是首次展示这种规模的生存结果惩罚回归的可行性。
{"title":"High-Performance Statistical Computing in the Computing Environments of the 2020s.","authors":"Seyoon Ko, Hua Zhou, Jin J Zhou, Joong-Ho Won","doi":"10.1214/21-sts835","DOIUrl":"10.1214/21-sts835","url":null,"abstract":"<p><p>Technological advances in the past decade, hardware and software alike, have made access to high-performance computing (HPC) easier than ever. We review these advances from a statistical computing perspective. Cloud computing makes access to supercomputers affordable. Deep learning software libraries make programming statistical algorithms easy and enable users to write code once and run it anywhere-from a laptop to a workstation with multiple graphics processing units (GPUs) or a supercomputer in a cloud. Highlighting how these developments benefit statisticians, we review recent optimization algorithms that are useful for high-dimensional models and can harness the power of HPC. Code snippets are provided to demonstrate the ease of programming. We also provide an easy-to-use distributed matrix data structure suitable for HPC. Employing this data structure, we illustrate various statistical applications including large-scale positron emission tomography and <i>ℓ</i><sub>1</sub>-regularized Cox regression. Our examples easily scale up to an 8-GPU workstation and a 720-CPU-core cluster in a cloud. As a case in point, we analyze the onset of type-2 diabetes from the UK Biobank with 200,000 subjects and about 500,000 single nucleotide polymorphisms using the HPC <i>ℓ</i><sub>1</sub>-regularized Cox regression. Fitting this half-million-variate model takes less than 45 minutes and reconfirms known associations. To our knowledge, this is the first demonstration of the feasibility of penalized regression of survival outcomes at this scale.</p>","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":"37 4","pages":"494-518"},"PeriodicalIF":3.9,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10168006/pdf/nihms-1884249.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9502219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comments on Confidence as Likelihood by Pawitan and Lee in Statistical Science, November 2021 Pawitan和Lee在《统计科学》杂志上对置信度作为可能性的评论,2021年11月
IF 5.7 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2022-11-01 DOI: 10.1214/22-sts862
M. Lavine, J. F. Bjørnstad
. Pawitan and Lee (2021) attempt to show a correspondence between confidence and likelihood, specifically, that “confidence is in fact an extended likelihood” (Pawitan and Lee, 2021, abstract). The word “extended” means that the likelihood function can accommodate unobserved random variables such as random effects and future values; see Bjørnstad (1996) for details. Here we argue that the extended likelihood presented by Pawitan and Lee (2021) is not the correct extended likelihood and does not justify interpreting confidence as likelihood.
Pawitan和Lee(2021)试图展示信心和可能性之间的对应关系,特别是“信心实际上是一种扩展的可能性”(Pawitan and Lee,2021,摘要)。“扩展”一词意味着似然函数可以容纳未观察到的随机变量,如随机效应和未来值;详见Bjørnstad(1996)。在这里,我们认为Pawitan和Lee(2021)提出的扩展可能性不是正确的扩展可能性,也不能证明将置信度解释为可能性是合理的。
{"title":"Comments on Confidence as Likelihood by Pawitan and Lee in Statistical Science, November 2021","authors":"M. Lavine, J. F. Bjørnstad","doi":"10.1214/22-sts862","DOIUrl":"https://doi.org/10.1214/22-sts862","url":null,"abstract":". Pawitan and Lee (2021) attempt to show a correspondence between confidence and likelihood, specifically, that “confidence is in fact an extended likelihood” (Pawitan and Lee, 2021, abstract). The word “extended” means that the likelihood function can accommodate unobserved random variables such as random effects and future values; see Bjørnstad (1996) for details. Here we argue that the extended likelihood presented by Pawitan and Lee (2021) is not the correct extended likelihood and does not justify interpreting confidence as likelihood.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42862716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Regression Perspective on Generalized Distance Covariance and the Hilbert–Schmidt Independence Criterion 广义距离协方差与Hilbert–Schmidt独立性准则的回归分析
IF 5.7 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2022-11-01 DOI: 10.1214/21-sts841
Dominic Edelmann, J. Goeman
In a seminal paper, Sejdinovic, et al. [49] showed the equivalence of the Hilbert-Schmidt Independence Criterion (HSIC) [20] and a generalization of distance covariance [62]. In this paper the two notions of dependence are unified with a third prominent concept for independence testing, the “global test” introduced in [16]. The new viewpoint provides novel insights into all three test traditions, as well as a unified overall view of the way all three tests contrast with classical association tests. As our main result, a regression perspective on HSIC and generalized distance covariance is obtained, allowing such tests to be used with nuisance covariates or for survival data. Several more examples of cross-fertilization of the three traditions are provided, involving theoretical results and novel methodology. To illustrate the difference between classical statistical tests and the unified HSIC/distance covariance/global tests we investigate the case of association between two categorical variables in depth.
Sejdinovic等人[49]在一篇开创性的论文中展示了希尔伯特-施密特独立性准则(HSIC)[20]的等价性和距离协方差的推广[62]。在本文中,依赖性的两个概念与独立性测试的第三个突出概念相统一,即[16]中引入的“全局测试”。这一新观点为所有三种测试传统提供了新颖的见解,并对所有三种考试与经典联想测试的对比方式提供了统一的整体观点。作为我们的主要结果,获得了HSIC和广义距离协方差的回归观点,允许将此类测试与干扰协变量或生存数据一起使用。文中还列举了三种传统相互融合的实例,包括理论成果和新颖的方法论。为了说明经典统计检验和统一的HSIC/距离协方差/全局检验之间的差异,我们深入研究了两个分类变量之间的关联情况。
{"title":"A Regression Perspective on Generalized Distance Covariance and the Hilbert–Schmidt Independence Criterion","authors":"Dominic Edelmann, J. Goeman","doi":"10.1214/21-sts841","DOIUrl":"https://doi.org/10.1214/21-sts841","url":null,"abstract":"In a seminal paper, Sejdinovic, et al. [49] showed the equivalence of the Hilbert-Schmidt Independence Criterion (HSIC) [20] and a generalization of distance covariance [62]. In this paper the two notions of dependence are unified with a third prominent concept for independence testing, the “global test” introduced in [16]. The new viewpoint provides novel insights into all three test traditions, as well as a unified overall view of the way all three tests contrast with classical association tests. As our main result, a regression perspective on HSIC and generalized distance covariance is obtained, allowing such tests to be used with nuisance covariates or for survival data. Several more examples of cross-fertilization of the three traditions are provided, involving theoretical results and novel methodology. To illustrate the difference between classical statistical tests and the unified HSIC/distance covariance/global tests we investigate the case of association between two categorical variables in depth.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46642491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Approximate Confidence Intervals for a Binomial p—Once Again 二项p的近似置信区间
IF 5.7 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2022-11-01 DOI: 10.1214/21-sts837
Per Gösta Andersson
The problem of constructing a reasonably simple yet well behaved confidence interval for a binomial parameter p is old but still fascinating and surprisingly complex. During the last century many alternatives to the poorly behaved standard Wald interval have been suggested. It seems though that the Wald interval is still much in use in spite of many efforts over the years through publications to point out its deficiencies. This paper constitutes yet another attempt to provide an alternative and it builds on a special case of a general technique for adjusted intervals primarily based on Wald type statistics. The main idea is to construct an approximate pivot with uncorrelated, or nearly uncorrelated, components. The resulting (AN) Andersson-Nerman interval, as well as a modification thereof, is compared with the well renowned Wilson and AC (Agresti-Coull) intervals and the subsequent discussion will in itself hopefully shed some new light on this seemingly elementary interval estimation situation. Generally, an alternative to the Wald interval is to be judged not only by performance, its expression should also indicate why we will obtain a better behaved interval. It is argued that the well-behaved AN interval meets this requirement.
为二项式参数p构造一个相当简单但表现良好的置信区间的问题由来已久,但仍然令人着迷,而且异常复杂。在上个世纪,人们提出了许多替代表现不佳的标准瓦尔德区间的方法。尽管多年来通过出版物指出了瓦尔德区间的不足,但瓦尔德区间似乎仍在大量使用。本文是提供一种替代方法的又一尝试,它建立在主要基于Wald型统计的调整区间的一般技术的特例之上。其主要思想是构造具有不相关或几乎不相关分量的近似枢轴。将由此产生的(AN)Andersson-Nerman区间及其修改与著名的Wilson和AC(Agresti-Coull)区间进行比较,随后的讨论本身有望为这种看似基本的区间估计情况提供一些新的线索。一般来说,Wald区间的替代方案不仅要通过性能来判断,它的表达式还应该表明为什么我们会获得表现更好的区间。认为表现良好的AN区间满足这一要求。
{"title":"Approximate Confidence Intervals for a Binomial p—Once Again","authors":"Per Gösta Andersson","doi":"10.1214/21-sts837","DOIUrl":"https://doi.org/10.1214/21-sts837","url":null,"abstract":"The problem of constructing a reasonably simple yet well behaved confidence interval for a binomial parameter p is old but still fascinating and surprisingly complex. During the last century many alternatives to the poorly behaved standard Wald interval have been suggested. It seems though that the Wald interval is still much in use in spite of many efforts over the years through publications to point out its deficiencies. This paper constitutes yet another attempt to provide an alternative and it builds on a special case of a general technique for adjusted intervals primarily based on Wald type statistics. The main idea is to construct an approximate pivot with uncorrelated, or nearly uncorrelated, components. The resulting (AN) Andersson-Nerman interval, as well as a modification thereof, is compared with the well renowned Wilson and AC (Agresti-Coull) intervals and the subsequent discussion will in itself hopefully shed some new light on this seemingly elementary interval estimation situation. Generally, an alternative to the Wald interval is to be judged not only by performance, its expression should also indicate why we will obtain a better behaved interval. It is argued that the well-behaved AN interval meets this requirement.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41959851","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Interpreting p-Values and Confidence Intervals Using Well-Calibrated Null Preference Priors 使用校准良好的零偏好先验解释p值和置信区间
IF 5.7 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2022-11-01 DOI: 10.1214/21-sts833
M. Fay, M. Proschan, E. Brittain, R. Tiwari
We propose well-calibrated null preference priors for use with one-sided hypothesis tests, such that resulting Bayesian and frequentist inferences agree. Null preference priors mean that they have nearly 100% of their prior belief in the null hypothesis, and well-calibrated priors mean that the resulting posterior beliefs in the alternative hypothesis are not overconfident. This formulation expands the class of problems giving Bayes-frequentist agreement to include problems involving discrete distributions such as binomial and negative binomial oneand two-sample exact (i.e., valid) tests. When applicable, these priors give posterior belief in the null hypothesis that is a valid p-value, and the null preference prior emphasizes that large p-values may simply represent insufficient data to overturn prior belief. This formulation gives a Bayesian interpretation of some common frequentist tests, as well as more intuitively explaining lesser known and less straightforward confidence intervals for two-sample tests.
我们提出了校准良好的零偏好先验,用于单侧假设检验,从而得出贝叶斯和频率论推断一致。零偏好先验意味着他们对零假设有近100%的先验信念,而校准良好的先验意味着对替代假设的后验信念不会过于自信。该公式扩展了给出贝叶斯频率论一致性的一类问题,包括涉及离散分布的问题,如二项式和负二项式一样本和两样本精确(即有效)检验。在适用的情况下,这些先验给出了零假设的后验信念,即有效的p值,而零偏好先验强调大的p值可能只是代表不足以推翻先验信念的数据。该公式对一些常见的频繁度测试进行了贝叶斯解释,并更直观地解释了两个样本测试的鲜为人知和不太直接的置信区间。
{"title":"Interpreting p-Values and Confidence Intervals Using Well-Calibrated Null Preference Priors","authors":"M. Fay, M. Proschan, E. Brittain, R. Tiwari","doi":"10.1214/21-sts833","DOIUrl":"https://doi.org/10.1214/21-sts833","url":null,"abstract":"We propose well-calibrated null preference priors for use with one-sided hypothesis tests, such that resulting Bayesian and frequentist inferences agree. Null preference priors mean that they have nearly 100% of their prior belief in the null hypothesis, and well-calibrated priors mean that the resulting posterior beliefs in the alternative hypothesis are not overconfident. This formulation expands the class of problems giving Bayes-frequentist agreement to include problems involving discrete distributions such as binomial and negative binomial oneand two-sample exact (i.e., valid) tests. When applicable, these priors give posterior belief in the null hypothesis that is a valid p-value, and the null preference prior emphasizes that large p-values may simply represent insufficient data to overturn prior belief. This formulation gives a Bayesian interpretation of some common frequentist tests, as well as more intuitively explaining lesser known and less straightforward confidence intervals for two-sample tests.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41886684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Probabilistic View on Predictive Constructions for Bayesian Learning 贝叶斯学习预测结构的概率观
IF 5.7 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2022-08-14 DOI: 10.1214/23-sts884
P. Berti, E. Dreassi, F. Leisen, P. Rigo, L. Pratelli
Given a sequence $X=(X_1,X_2,ldots)$ of random observations, a Bayesian forecaster aims to predict $X_{n+1}$ based on $(X_1,ldots,X_n)$ for each $nge 0$. To this end, in principle, she only needs to select a collection $sigma=(sigma_0,sigma_1,ldots)$, called ``strategy"in what follows, where $sigma_0(cdot)=P(X_1incdot)$ is the marginal distribution of $X_1$ and $sigma_n(cdot)=P(X_{n+1}incdotmid X_1,ldots,X_n)$ the $n$-th predictive distribution. Because of the Ionescu-Tulcea theorem, $sigma$ can be assigned directly, without passing through the usual prior/posterior scheme. One main advantage is that no prior probability is to be selected. In a nutshell, this is the predictive approach to Bayesian learning. A concise review of the latter is provided in this paper. We try to put such an approach in the right framework, to make clear a few misunderstandings, and to provide a unifying view. Some recent results are discussed as well. In addition, some new strategies are introduced and the corresponding distribution of the data sequence $X$ is determined. The strategies concern generalized P'olya urns, random change points, covariates and stationary sequences.
给定一个随机观察序列$X=(X_1,X_2,ldots)$,贝叶斯预测者的目标是基于$(X_1,ldots,X_n)$对每个$nge 0$进行预测$X_{n+1}$。为此,原则上,她只需要选择一个集合$sigma=(sigma_0,sigma_1,ldots)$,下面称之为“策略”,其中$sigma_0(cdot)=P(X_1incdot)$是$X_1$的边际分布,$sigma_n(cdot)=P(X_{n+1}incdotmid X_1,ldots,X_n)$是$n$的预测分布。由于Ionescu-Tulcea定理,$sigma$可以直接分配,而无需通过通常的先验/后验方案。一个主要的优点是不需要选择先验概率。简而言之,这就是贝叶斯学习的预测方法。本文对后者作了简要的综述。我们试图将这种方法置于正确的框架中,澄清一些误解,并提供一个统一的观点。本文还讨论了最近的一些研究结果。此外,还引入了一些新的策略,并确定了相应的数据序列$X$的分布。这些策略涉及到广义Pólya回合、随机变化点、协变量和平稳序列。
{"title":"A Probabilistic View on Predictive Constructions for Bayesian Learning","authors":"P. Berti, E. Dreassi, F. Leisen, P. Rigo, L. Pratelli","doi":"10.1214/23-sts884","DOIUrl":"https://doi.org/10.1214/23-sts884","url":null,"abstract":"Given a sequence $X=(X_1,X_2,ldots)$ of random observations, a Bayesian forecaster aims to predict $X_{n+1}$ based on $(X_1,ldots,X_n)$ for each $nge 0$. To this end, in principle, she only needs to select a collection $sigma=(sigma_0,sigma_1,ldots)$, called ``strategy\"in what follows, where $sigma_0(cdot)=P(X_1incdot)$ is the marginal distribution of $X_1$ and $sigma_n(cdot)=P(X_{n+1}incdotmid X_1,ldots,X_n)$ the $n$-th predictive distribution. Because of the Ionescu-Tulcea theorem, $sigma$ can be assigned directly, without passing through the usual prior/posterior scheme. One main advantage is that no prior probability is to be selected. In a nutshell, this is the predictive approach to Bayesian learning. A concise review of the latter is provided in this paper. We try to put such an approach in the right framework, to make clear a few misunderstandings, and to provide a unifying view. Some recent results are discussed as well. In addition, some new strategies are introduced and the corresponding distribution of the data sequence $X$ is determined. The strategies concern generalized P'olya urns, random change points, covariates and stationary sequences.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48914546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Statistical Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1