首页 > 最新文献

Statistical Science最新文献

英文 中文
A Conversation with Ross Prentice 与罗斯·普伦蒂斯的对话
IF 5.7 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2022-02-01 DOI: 10.1214/21-sts829
L. Hsu, C. Kooperberg
Ross L. Prentice received his B.Sc. from the University of Waterloo and his Ph.D. from the University of Toronto. He joined the University of Washington (UW) and the Fred Hutchinson Cancer Research Center (the Hutch) in 1974, and is currently Professor of Biostatistics at these institutions. He was Senior Vice President at the Hutch, and Director of its Public Health Sciences Division, for more than 25 years. Dr. Prentice’s expertise and research interests are in the fields of biostatistics, epidemiology, and disease prevention. He played a central role in the conception, design, and implementation of the Women’s Health Initiative. In statistical and medical literature he has over 500 scientific papers, including more than 40 with 500 or more citations. His substantial contributions to the theory of population and clinical research include the use of surrogate endpoints and case-cohort designs and other areas such as survival analysis, nutritional epidemiology, genetic epidemiology, biomarkers, and measurement error. Dr. Prentice is recognized for his mentoring of students and junior colleagues, and for his generous collaborations. Dr. Prentice has received numerous awards for his work, including an honorary doctorate in mathematics from the University of Waterloo, the Mantel Award for Lifetime Contributions to Statistics in Epidemiology from the American Statistical Association, the Mortimer Spiegelman Award from the American Public Health Association, the Committee of Presidents of Statistical Societies Presidents’ Award and RA Fisher Award, the Marvin Zelen Leadership Award for Outstanding Achievement in Statistical Science from Harvard University, the American Association of Cancer Research/American Cancer Society Award for Research Excellence in Cancer Epidemiology and Prevention, and the American Association for Cancer Research Team Science Award. He was elected to the Institute of Medicine/National Academy of Medicine in 1990. The Ross L. Prentice Endowed Professorship of Biostatistical Collaboration was created at the UW in 2005 and has been awarded every year since its inception. The interior space of the Public Health Sciences building at the Hutch has been named the Ross L. Prentice Atrium. In his spare time, Ross enjoys sports including water skiing, golf, running, and spending time with his wife, Didi, and with his daughters, sons-in-law, and grandchildren. He ran daily from when he was in his 20s until his knees objected about 10 years ago. This interview took place with Li Hsu and Charles Kooperberg via Zoom in December 2020.
Ross L.Prentice在滑铁卢大学获得理学学士学位,在多伦多大学获得博士学位。他于1974年加入华盛顿大学(UW)和弗雷德·哈钦森癌症研究中心(Hutch),目前是这些机构的生物统计学教授。他担任哈奇医院高级副院长兼公共卫生科学部主任超过25年。Prentice博士的专业知识和研究兴趣是生物统计学、流行病学和疾病预防领域。他在妇女健康倡议的构思、设计和实施中发挥了核心作用。在统计学和医学文献中,他有500多篇科学论文,其中40多篇引用次数达到或超过500次。他对人口和临床研究理论的重大贡献包括使用替代终点和病例队列设计,以及其他领域,如生存分析、营养流行病学、遗传流行病学、生物标志物和测量误差。普伦蒂斯博士因其对学生和初级同事的指导以及慷慨的合作而受到认可。普伦蒂斯博士的工作获得了许多奖项,包括滑铁卢大学的数学荣誉博士学位、美国统计协会的曼特尔流行病学统计终身贡献奖、美国公共卫生协会的莫蒂默·斯皮格尔曼奖、,统计学会主席委员会主席奖和RA Fisher奖、哈佛大学统计科学杰出成就马文·泽伦领导奖、美国癌症研究协会/美国癌症学会癌症流行病学和预防卓越研究奖、,以及美国癌症研究团队科学奖。1990年,他被选入美国国家医学院医学研究所。罗斯·L·普伦蒂斯授予的生物统计学协作教授职位于2005年在华盛顿大学设立,自成立以来每年都会颁发。哈奇公共卫生科学大楼的内部空间被命名为罗斯·L·普伦蒂斯中庭。在业余时间,罗斯喜欢运动,包括滑水、高尔夫、跑步,并与妻子迪迪、女儿、女婿和孙子孙女共度时光。从20多岁开始,他每天都在跑步,直到大约10年前膝盖出现问题。本次采访于2020年12月通过Zoom对李旭和查尔斯·库珀伯格进行。
{"title":"A Conversation with Ross Prentice","authors":"L. Hsu, C. Kooperberg","doi":"10.1214/21-sts829","DOIUrl":"https://doi.org/10.1214/21-sts829","url":null,"abstract":"Ross L. Prentice received his B.Sc. from the University of Waterloo and his Ph.D. from the University of Toronto. He joined the University of Washington (UW) and the Fred Hutchinson Cancer Research Center (the Hutch) in 1974, and is currently Professor of Biostatistics at these institutions. He was Senior Vice President at the Hutch, and Director of its Public Health Sciences Division, for more than 25 years. Dr. Prentice’s expertise and research interests are in the fields of biostatistics, epidemiology, and disease prevention. He played a central role in the conception, design, and implementation of the Women’s Health Initiative. In statistical and medical literature he has over 500 scientific papers, including more than 40 with 500 or more citations. His substantial contributions to the theory of population and clinical research include the use of surrogate endpoints and case-cohort designs and other areas such as survival analysis, nutritional epidemiology, genetic epidemiology, biomarkers, and measurement error. Dr. Prentice is recognized for his mentoring of students and junior colleagues, and for his generous collaborations. Dr. Prentice has received numerous awards for his work, including an honorary doctorate in mathematics from the University of Waterloo, the Mantel Award for Lifetime Contributions to Statistics in Epidemiology from the American Statistical Association, the Mortimer Spiegelman Award from the American Public Health Association, the Committee of Presidents of Statistical Societies Presidents’ Award and RA Fisher Award, the Marvin Zelen Leadership Award for Outstanding Achievement in Statistical Science from Harvard University, the American Association of Cancer Research/American Cancer Society Award for Research Excellence in Cancer Epidemiology and Prevention, and the American Association for Cancer Research Team Science Award. He was elected to the Institute of Medicine/National Academy of Medicine in 1990. The Ross L. Prentice Endowed Professorship of Biostatistical Collaboration was created at the UW in 2005 and has been awarded every year since its inception. The interior space of the Public Health Sciences building at the Hutch has been named the Ross L. Prentice Atrium. In his spare time, Ross enjoys sports including water skiing, golf, running, and spending time with his wife, Didi, and with his daughters, sons-in-law, and grandchildren. He ran daily from when he was in his 20s until his knees objected about 10 years ago. This interview took place with Li Hsu and Charles Kooperberg via Zoom in December 2020.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":"1 1","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41588952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diffusion Smoothing for Spatial Point Patterns 空间点模式的扩散平滑
IF 5.7 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2022-02-01 DOI: 10.1214/21-sts825
A. Baddeley, Tilman M. Davies, S. Rakshit, Gopalan M. Nair, Greg McSwiggan
Traditional kernel methods for estimating the spatially-varying density of points in a spatial point pattern may exhibit unrealistic artefacts, in addition to the familiar problems of bias and overor undersmoothing. Performance can be improved by using diffusion smoothing, in which the smoothing kernel is the heat kernel on the spatial domain. This paper develops diffusion smoothing into a practical statistical methodology for two-dimensional spatial point pattern data. We clarify the advantages and disadvantages of diffusion smoothing over Gaussian kernel smoothing. Adaptive smoothing, where the smoothing bandwidth is spatially-varying, can be performed by adopting a spatiallyvarying diffusion rate: this avoids technical problems with adaptive Gaussian smoothing and has substantially better performance. We introduce a new form of adaptive smoothing using lagged arrival times, which has good performance and improved robustness. Applications in archaeology and epidemiology are demonstrated. The methods are implemented in open-source R code. AMS 2000 subject classifications: Primary 62G07; secondary 62M30.
传统的核方法用于估计空间点模式中点的空间变化密度,除了常见的偏差和过平滑或欠平滑问题外,还可能出现不切实际的伪影。使用扩散平滑可以提高性能,其中平滑核是空间域上的热核。本文将扩散平滑发展成为一种实用的二维空间点图数据统计方法。我们阐明了扩散平滑相对于高斯核平滑的优缺点。自适应平滑,其中平滑带宽是空间变化的,可以通过采用空间变化的扩散速率来实现,这避免了自适应高斯平滑的技术问题,并且具有更好的性能。我们引入了一种新的使用滞后到达时间的自适应平滑形式,它具有良好的性能和增强的鲁棒性。演示了在考古学和流行病学中的应用。这些方法是在开源的R代码中实现的。AMS 2000学科分类:初级62G07;二次62 m30。
{"title":"Diffusion Smoothing for Spatial Point Patterns","authors":"A. Baddeley, Tilman M. Davies, S. Rakshit, Gopalan M. Nair, Greg McSwiggan","doi":"10.1214/21-sts825","DOIUrl":"https://doi.org/10.1214/21-sts825","url":null,"abstract":"Traditional kernel methods for estimating the spatially-varying density of points in a spatial point pattern may exhibit unrealistic artefacts, in addition to the familiar problems of bias and overor undersmoothing. Performance can be improved by using diffusion smoothing, in which the smoothing kernel is the heat kernel on the spatial domain. This paper develops diffusion smoothing into a practical statistical methodology for two-dimensional spatial point pattern data. We clarify the advantages and disadvantages of diffusion smoothing over Gaussian kernel smoothing. Adaptive smoothing, where the smoothing bandwidth is spatially-varying, can be performed by adopting a spatiallyvarying diffusion rate: this avoids technical problems with adaptive Gaussian smoothing and has substantially better performance. We introduce a new form of adaptive smoothing using lagged arrival times, which has good performance and improved robustness. Applications in archaeology and epidemiology are demonstrated. The methods are implemented in open-source R code. AMS 2000 subject classifications: Primary 62G07; secondary 62M30.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47305929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical Dependence: Beyond Pearson’s ρ 统计相关性:超越皮尔逊ρ
IF 5.7 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2022-02-01 DOI: 10.1214/21-sts823
D. Tjøstheim, Håkon Otneim, Bård Støve
Pearson’s ρ is the most used measure of statistical dependence. It gives a complete characterization of dependence in the Gaussian case, and it also works well in some non-Gaussian situations. It is well known, however, that it has a number of shortcomings; in particular for heavy tailed distributions and in nonlinear situations, where it may produce misleading, and even disastrous results. In recent years a number of alternatives have been proposed. In this paper, we will survey these developments, especially results obtained in the last couple of decades. Among measures discussed are the copula, distribution-based measures, the distance covariance, the HSIC measure popular in machine learning, and finally the local Gaussian correlation, which is a local version of Pearson’s ρ. Throughout we put the emphasis on conceptual developments and a comparison of these. We point out relevant references to technical details as well as comparative empirical and simulated experiments. There is a broad selection of references under each topic treated.
皮尔逊ρ是最常用的统计相关性度量。它给出了高斯情况下依赖性的完整表征,在一些非高斯情况下也能很好地工作。然而,众所周知,它有许多缺点;特别是在重尾分布和非线性情况下,它可能会产生误导,甚至灾难性的结果。近年来,人们提出了许多替代方案。在本文中,我们将调查这些发展,特别是在过去几十年中获得的结果。讨论的度量包括copula、基于分布的度量、距离协方差、机器学习中流行的HSIC度量,最后是局部高斯相关,它是Pearsonρ的局部版本。在整个过程中,我们都把重点放在概念发展和这些发展的比较上。我们指出了技术细节的相关参考资料,以及比较经验和模拟实验。在处理的每个主题下都有大量的参考文献。
{"title":"Statistical Dependence: Beyond Pearson’s ρ","authors":"D. Tjøstheim, Håkon Otneim, Bård Støve","doi":"10.1214/21-sts823","DOIUrl":"https://doi.org/10.1214/21-sts823","url":null,"abstract":"Pearson’s ρ is the most used measure of statistical dependence. It gives a complete characterization of dependence in the Gaussian case, and it also works well in some non-Gaussian situations. It is well known, however, that it has a number of shortcomings; in particular for heavy tailed distributions and in nonlinear situations, where it may produce misleading, and even disastrous results. In recent years a number of alternatives have been proposed. In this paper, we will survey these developments, especially results obtained in the last couple of decades. Among measures discussed are the copula, distribution-based measures, the distance covariance, the HSIC measure popular in machine learning, and finally the local Gaussian correlation, which is a local version of Pearson’s ρ. Throughout we put the emphasis on conceptual developments and a comparison of these. We point out relevant references to technical details as well as comparative empirical and simulated experiments. There is a broad selection of references under each topic treated.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49302811","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
Some Perspectives on Inference in High Dimensions 高维推理的若干观点
IF 5.7 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2022-02-01 DOI: 10.1214/21-sts824
H. Battey, D. Cox
With very large amounts of data, important aspects of statistical analysis may appear largely descriptive in that the role of probability sometimes seems limited or totally absent. The main emphasis of the present paper lies on contexts where formulation in terms of a probabilistic model is feasible and fruitful but to be at all realistic large numbers of unknown parameters need consideration. Then many of the standard approaches to statistical analysis, for instance direct application of the method of maximum likelihood, or the use of flat priors, often encounter difficulties. After a brief discussion of broad conceptual issues, we provide some new perspectives on aspects of high-dimensional statistical theory, emphasizing a number of open problems.
由于数据量很大,统计分析的重要方面可能在很大程度上是描述性的,因为概率的作用有时似乎有限或完全不存在。本文的主要重点在于概率模型的公式化是可行和富有成效的,但要想成为现实,需要考虑大量未知参数。然后,许多标准的统计分析方法,例如直接应用最大似然法,或使用平面先验,经常会遇到困难。在简要讨论了广泛的概念问题后,我们对高维统计理论的各个方面提供了一些新的视角,强调了一些悬而未决的问题。
{"title":"Some Perspectives on Inference in High Dimensions","authors":"H. Battey, D. Cox","doi":"10.1214/21-sts824","DOIUrl":"https://doi.org/10.1214/21-sts824","url":null,"abstract":"With very large amounts of data, important aspects of statistical analysis may appear largely descriptive in that the role of probability sometimes seems limited or totally absent. The main emphasis of the present paper lies on contexts where formulation in terms of a probabilistic model is feasible and fruitful but to be at all realistic large numbers of unknown parameters need consideration. Then many of the standard approaches to statistical analysis, for instance direct application of the method of maximum likelihood, or the use of flat priors, often encounter difficulties. After a brief discussion of broad conceptual issues, we provide some new perspectives on aspects of high-dimensional statistical theory, emphasizing a number of open problems.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46293804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Aitchison’s Compositional Data Analysis 40 Years on: A Reappraisal 艾奇逊40年来的成分数据分析:再评价
IF 5.7 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2022-01-13 DOI: 10.1214/22-sts880
M. Greenacre, E. Grunsky, J. Bacon-Shone, Ionas Erb, T. Quinn
The development of John Aitchison's approach to compositional data analysis is followed since his paper read to the Royal Statistical Society in 1982. Aitchison's logratio approach, which was proposed to solve the problematic aspects of working with data with a fixed sum constraint, is summarized and reappraised. It is maintained that the properties on which this approach was originally built, the main one being subcompositional coherence, are not required to be satisfied exactly -- quasi-coherence is sufficient, that is near enough to being coherent for all practical purposes. This opens up the field to using simpler data transformations, such as power transformations, that permit zero values in the data. The additional property of exact isometry, which was subsequently introduced and not in Aitchison's original conception, imposed the use of isometric logratio transformations, but these are complicated and problematic to interpret, involving ratios of geometric means. If this property is regarded as important in certain analytical contexts, for example unsupervised learning, it can be relaxed by showing that regular pairwise logratios, as well as the alternative quasi-coherent transformations, can also be quasi-isometric, meaning they are close enough to exact isometry for all practical purposes. It is concluded that the isometric and related logratio transformations such as pivot logratios are not a prerequisite for good practice, although many authors insist on their obligatory use. This conclusion is fully supported here by case studies in geochemistry and in genomics, where the good performance is demonstrated of pairwise logratios, as originally proposed by Aitchison, or Box-Cox power transforms of the original compositions where no zero replacements are necessary.
自1982年约翰·艾奇逊的论文在英国皇家统计学会上发表以来,他对成分数据分析方法的发展一直受到关注。艾奇逊的logratio方法是为了解决在固定和约束下处理数据的问题而提出的,它被总结和重新评估。有人认为,这种方法最初建立的性质,主要是亚成分相干性,并不需要完全满足——准相干性是足够的,即接近于所有实际目的的相干性。这使得该字段可以使用更简单的数据转换,例如允许数据中的零值的幂转换。精确等距的附加性质随后被引入,而不是艾奇逊最初的概念,强制使用等距的logratio变换,但这些是复杂的和有问题的解释,涉及几何平均的比率。如果这个性质在某些分析环境中被认为是重要的,例如无监督学习,那么它可以通过显示规则的成对logratios以及可选的准相干变换也可以是准等距的来放松,这意味着它们在所有实际目的中都足够接近精确的等距。结论是,等距和相关的坐标变换,如枢轴坐标变换,并不是良好实践的先决条件,尽管许多作者坚持必须使用它们。这一结论在这里得到了地球化学和基因组学案例研究的充分支持,在这些研究中,Aitchison最初提出的两两logratios或原始成分的Box-Cox幂变换的良好性能得到了证明,其中不需要零替换。
{"title":"Aitchison’s Compositional Data Analysis 40 Years on: A Reappraisal","authors":"M. Greenacre, E. Grunsky, J. Bacon-Shone, Ionas Erb, T. Quinn","doi":"10.1214/22-sts880","DOIUrl":"https://doi.org/10.1214/22-sts880","url":null,"abstract":"The development of John Aitchison's approach to compositional data analysis is followed since his paper read to the Royal Statistical Society in 1982. Aitchison's logratio approach, which was proposed to solve the problematic aspects of working with data with a fixed sum constraint, is summarized and reappraised. It is maintained that the properties on which this approach was originally built, the main one being subcompositional coherence, are not required to be satisfied exactly -- quasi-coherence is sufficient, that is near enough to being coherent for all practical purposes. This opens up the field to using simpler data transformations, such as power transformations, that permit zero values in the data. The additional property of exact isometry, which was subsequently introduced and not in Aitchison's original conception, imposed the use of isometric logratio transformations, but these are complicated and problematic to interpret, involving ratios of geometric means. If this property is regarded as important in certain analytical contexts, for example unsupervised learning, it can be relaxed by showing that regular pairwise logratios, as well as the alternative quasi-coherent transformations, can also be quasi-isometric, meaning they are close enough to exact isometry for all practical purposes. It is concluded that the isometric and related logratio transformations such as pivot logratios are not a prerequisite for good practice, although many authors insist on their obligatory use. This conclusion is fully supported here by case studies in geochemistry and in genomics, where the good performance is demonstrated of pairwise logratios, as originally proposed by Aitchison, or Box-Cox power transforms of the original compositions where no zero replacements are necessary.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47827621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
In Praise (and Search) of J. V. Uspensky 在赞美(和搜索)J·V·乌斯彭斯基
IF 5.7 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2022-01-01 DOI: 10.1214/22-sts866
P. Diaconis, S. Zabell
. The two of us have shared a fascination with James Victor Uspensky’s 1937 textbook Introduction to Mathematical Probability ever since our graduate student days: it contains many interesting results not found in other books on the same subject in the English language, together with many non-trivial examples, all clearly stated with careful proofs. We present some of Uspensky’s gems to a modern audience hoping to tempt others to read Uspensky for themselves, as well as report on a few of the other mathematical topics he also wrote about (for example, his book on number theory contains early results about perfect shuffles). Uspensky led an interesting life: a member of the Russian Academy of Sciences, he spoke at the 1924 International Congress of Mathematicians in Toronto before leaving Russia in 1929 and coming to the US and Stanford. Comparatively little has been written about him in English; the second half of this paper attempts to remedy this.
从研究生时代起,我们两人就对詹姆斯·维克托·乌斯彭斯基1937年出版的《数学概率论导论》有着共同的兴趣:它包含了许多有趣的结果,这些结果在其他英语书籍中是找不到的,还有许多不平凡的例子,所有这些都经过了仔细的证明。我们向现代观众展示了乌彭斯基的一些精华,希望吸引其他人亲自阅读乌彭斯基,并报道了他也写过的其他一些数学主题(例如,他的数论书包含了关于完美舒夫函数的早期结果)。乌斯彭斯基过着有趣的生活:他是俄罗斯科学院的成员,在1929年离开俄罗斯来到美国和斯坦福大学之前,曾在1924年多伦多举行的国际数学家大会上发表演讲。关于他的英文报道相对较少;本文的后半部分试图对此进行补救。
{"title":"In Praise (and Search) of J. V. Uspensky","authors":"P. Diaconis, S. Zabell","doi":"10.1214/22-sts866","DOIUrl":"https://doi.org/10.1214/22-sts866","url":null,"abstract":". The two of us have shared a fascination with James Victor Uspensky’s 1937 textbook Introduction to Mathematical Probability ever since our graduate student days: it contains many interesting results not found in other books on the same subject in the English language, together with many non-trivial examples, all clearly stated with careful proofs. We present some of Uspensky’s gems to a modern audience hoping to tempt others to read Uspensky for themselves, as well as report on a few of the other mathematical topics he also wrote about (for example, his book on number theory contains early results about perfect shuffles). Uspensky led an interesting life: a member of the Russian Academy of Sciences, he spoke at the 1924 International Congress of Mathematicians in Toronto before leaving Russia in 1929 and coming to the US and Stanford. Comparatively little has been written about him in English; the second half of this paper attempts to remedy this.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45991960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Bayesian Adaptive Randomization with Compound Utility Functions 具有复合效用函数的贝叶斯自适应随机化
IF 5.7 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2022-01-01 DOI: 10.1214/21-sts848
A. Giovagnoli, I. Verdinelli
Bayesian adaptive designs formalize the use of previous knowledge at the planning stage of an experiment, permitting recursive updating of the prior information. They often make use of utility functions, while also allowing for randomization. We review frequentist and Bayesian adaptive design methods and show that some of the frequentist adaptive design methodology can also be employed in a Bayesian context. We use compound utility functions for the Bayesian designs, that are a trade-off between an optimal design information criterion, that represents the acquisition of scientific knowledge, and some ethical or utilitarian gain. We focus on binary response models on two groups with independent Beta prior distributions on the success probabilities. The treatment allocation is shown to converge to the allocation that produces the maximum utility. Special cases are the Bayesian Randomized (simply) Adaptive Compound (BRAC) design, an extension of the frequentist Sequential Maximum Likelihood (SML) design and the Bayesian Randomized (doubly) Adaptive Compound Efficient (BRACE) design, a generalization of the Efficient Randomized Adaptive DEsign (ERADE). Numerical simulation studies compare BRAC with BRACE when D-optimality is the information criterion chosen. In analogy with the frequentist theory, the BRACE-D design appears more efficient than the BRAC-D design.
贝叶斯自适应设计在实验的规划阶段将先前知识的使用形式化,允许对先前信息进行递归更新。它们通常使用效用函数,同时也允许随机化。我们回顾了频率论和贝叶斯自适应设计方法,并表明一些频率论自适应设计方法也可以应用于贝叶斯环境。我们在贝叶斯设计中使用复合效用函数,这是代表获得科学知识的最佳设计信息标准与一些道德或功利收益之间的权衡。我们关注的是成功概率具有独立Beta先验分布的两组的二元响应模型。处理分配将收敛于产生最大效用的分配。特殊情况是贝叶斯随机(简单)自适应复合(BRAC)设计,它是频率序列最大似然(SML)设计的扩展,贝叶斯随机(双重)自适应复合高效(BRACE)设计,是高效随机自适应设计(ERADE)的推广。数值模拟研究了选择d -最优信息准则时BRAC与BRACE的比较。与频率理论类比,BRACE-D设计似乎比BRAC-D设计更有效。
{"title":"Bayesian Adaptive Randomization with Compound Utility Functions","authors":"A. Giovagnoli, I. Verdinelli","doi":"10.1214/21-sts848","DOIUrl":"https://doi.org/10.1214/21-sts848","url":null,"abstract":"Bayesian adaptive designs formalize the use of previous knowledge at the planning stage of an experiment, permitting recursive updating of the prior information. They often make use of utility functions, while also allowing for randomization. We review frequentist and Bayesian adaptive design methods and show that some of the frequentist adaptive design methodology can also be employed in a Bayesian context. We use compound utility functions for the Bayesian designs, that are a trade-off between an optimal design information criterion, that represents the acquisition of scientific knowledge, and some ethical or utilitarian gain. We focus on binary response models on two groups with independent Beta prior distributions on the success probabilities. The treatment allocation is shown to converge to the allocation that produces the maximum utility. Special cases are the Bayesian Randomized (simply) Adaptive Compound (BRAC) design, an extension of the frequentist Sequential Maximum Likelihood (SML) design and the Bayesian Randomized (doubly) Adaptive Compound Efficient (BRACE) design, a generalization of the Efficient Randomized Adaptive DEsign (ERADE). Numerical simulation studies compare BRAC with BRACE when D-optimality is the information criterion chosen. In analogy with the frequentist theory, the BRACE-D design appears more efficient than the BRAC-D design.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46839550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
On Some Connections Between Esscher’s Tilting, Saddlepoint Approximations, and Optimal Transportation: A Statistical Perspective Esscher倾斜度、鞍点近似和最优运输之间的一些联系:一个统计视角
IF 5.7 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2022-01-01 DOI: 10.1214/21-sts847
D. La Vecchia, E. Ronchetti, A. Ilievski
We showcase some unexplored connections between saddlepoint approximations, measure transportation, and some key topics in information theory. To bridge these different areas, we review selectively the fundamental results available in the literature and we draw the connections between them. First, for a generic random variable we explain how the Esscher’s tilting (which is a result rooted in information theory and lies at the heart of saddlepoint approximations) is connected to the solution of the dual Kantorovich problem (which lies at the heart of measure transportation theory) via the Legendre transform of the cumulant generating function. Then, we turn to statistics: we illustrate the connections when the random variable we work with is the sample mean or a statistic with known (either exact or approximate) cumulant generating function. The unveiled connections offer the possibility to look at the saddlepoint approximations from different angles, putting under the spotlight the links to convex analysis (via the notion of duality) or differential geometry (via the notion of geodesic). We feel these possibilities can trigger a knowledge transfer between statistics and other disciplines, like mathematics and machine learning. A discussion on some topics for future research concludes the paper.
我们展示了鞍点近似、测量运输和信息理论中一些关键主题之间的一些未经探索的联系。为了弥合这些不同的领域,我们选择性地回顾了文献中的基本结果,并得出了它们之间的联系。首先,对于一个通用随机变量,我们解释了Esscher倾斜(这是一个植根于信息论的结果,位于鞍点近似的核心)是如何通过累积量生成函数的Legendre变换与对偶Kantorovich问题(位于测度输运理论的核心)的解相联系的。然后,我们转向统计学:我们说明了当我们使用的随机变量是样本均值或具有已知(精确或近似)累积量生成函数的统计量时的联系。公开的连接提供了从不同角度观察鞍点近似的可能性,将凸分析(通过对偶概念)或微分几何(通过测地线概念)的联系置于聚光灯下。我们觉得这些可能性可以引发统计学和其他学科之间的知识转移,比如数学和机器学习。最后,对未来研究的一些主题进行了讨论。
{"title":"On Some Connections Between Esscher’s Tilting, Saddlepoint Approximations, and Optimal Transportation: A Statistical Perspective","authors":"D. La Vecchia, E. Ronchetti, A. Ilievski","doi":"10.1214/21-sts847","DOIUrl":"https://doi.org/10.1214/21-sts847","url":null,"abstract":"We showcase some unexplored connections between saddlepoint approximations, measure transportation, and some key topics in information theory. To bridge these different areas, we review selectively the fundamental results available in the literature and we draw the connections between them. First, for a generic random variable we explain how the Esscher’s tilting (which is a result rooted in information theory and lies at the heart of saddlepoint approximations) is connected to the solution of the dual Kantorovich problem (which lies at the heart of measure transportation theory) via the Legendre transform of the cumulant generating function. Then, we turn to statistics: we illustrate the connections when the random variable we work with is the sample mean or a statistic with known (either exact or approximate) cumulant generating function. The unveiled connections offer the possibility to look at the saddlepoint approximations from different angles, putting under the spotlight the links to convex analysis (via the notion of duality) or differential geometry (via the notion of geodesic). We feel these possibilities can trigger a knowledge transfer between statistics and other disciplines, like mathematics and machine learning. A discussion on some topics for future research concludes the paper.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43550808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Approximating Bayes in the 21st Century 在21世纪近似贝叶斯
IF 5.7 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2021-12-20 DOI: 10.1214/22-STS875
G. Martin, David T. Frazier, C. Robert
The 21st century has seen an enormous growth in the development and use of approximate Bayesian methods. Such methods produce computational solutions to certain intractable statistical problems that challenge exact methods like Markov chain Monte Carlo: for instance, models with unavailable likelihoods, high-dimensional models, and models featuring large data sets. These approximate methods are the subject of this review. The aim is to help new researchers in particular -- and more generally those interested in adopting a Bayesian approach to empirical work -- distinguish between different approximate techniques; understand the sense in which they are approximate; appreciate when and why particular methods are useful; and see the ways in which they can can be combined.
21世纪,近似贝叶斯方法的发展和使用出现了巨大的增长。这些方法为某些棘手的统计问题提供了计算解决方案,这些问题挑战了马尔可夫链蒙特卡罗等精确方法:例如,具有不可用可能性的模型、高维模型和具有大数据集的模型。这些近似方法是本综述的主题。其目的是帮助新的研究人员——尤其是那些对采用贝叶斯方法进行实证研究感兴趣的人——区分不同的近似技术;理解它们的近似意义;了解特定方法何时以及为什么有用;看看它们可以结合在一起的方式。
{"title":"Approximating Bayes in the 21st Century","authors":"G. Martin, David T. Frazier, C. Robert","doi":"10.1214/22-STS875","DOIUrl":"https://doi.org/10.1214/22-STS875","url":null,"abstract":"The 21st century has seen an enormous growth in the development and use of approximate Bayesian methods. Such methods produce computational solutions to certain intractable statistical problems that challenge exact methods like Markov chain Monte Carlo: for instance, models with unavailable likelihoods, high-dimensional models, and models featuring large data sets. These approximate methods are the subject of this review. The aim is to help new researchers in particular -- and more generally those interested in adopting a Bayesian approach to empirical work -- distinguish between different approximate techniques; understand the sense in which they are approximate; appreciate when and why particular methods are useful; and see the ways in which they can can be combined.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2021-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44404322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Real-Time Estimation of COVID-19 Infections: Deconvolution and Sensor Fusion 新冠肺炎感染的实时估计:反卷积和传感器融合
IF 5.7 1区 数学 Q1 STATISTICS & PROBABILITY Pub Date : 2021-12-13 DOI: 10.1214/22-sts856
M. Jahja, Andrew Chin, R. Tibshirani
We propose, implement, and evaluate a method to estimate the daily number of new symptomatic COVID-19 infections, at the level of individual U.S. counties, by deconvolving daily reported COVID-19 case counts using an estimated symptom-onset-to-case-report delay distribution. Importantly, we focus on estimating infections in real-time (rather than retrospectively), which poses numerous challenges. To address these, we develop new methodology for both the distribution estimation and deconvolution steps, and we employ a sensor fusion layer (which fuses together predictions from models that are trained to track infections based on auxiliary surveillance streams) in order to improve accuracy and stability.
我们提出、实施并评估了一种方法,通过使用估计的症状对病例报告延迟分布,对每日报告的新冠肺炎病例数进行去卷积,在美国个别县的水平上估计每日新增症状新冠肺炎感染人数。重要的是,我们专注于实时(而不是回顾性)估计感染,这带来了许多挑战。为了解决这些问题,我们为分布估计和反褶积步骤开发了新的方法,并使用了传感器融合层(将训练来跟踪基于辅助监测流的感染的模型的预测融合在一起),以提高准确性和稳定性。
{"title":"Real-Time Estimation of COVID-19 Infections: Deconvolution and Sensor Fusion","authors":"M. Jahja, Andrew Chin, R. Tibshirani","doi":"10.1214/22-sts856","DOIUrl":"https://doi.org/10.1214/22-sts856","url":null,"abstract":"We propose, implement, and evaluate a method to estimate the daily number of new symptomatic COVID-19 infections, at the level of individual U.S. counties, by deconvolving daily reported COVID-19 case counts using an estimated symptom-onset-to-case-report delay distribution. Importantly, we focus on estimating infections in real-time (rather than retrospectively), which poses numerous challenges. To address these, we develop new methodology for both the distribution estimation and deconvolution steps, and we employ a sensor fusion layer (which fuses together predictions from models that are trained to track infections based on auxiliary surveillance streams) in order to improve accuracy and stability.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2021-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43138722","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
Statistical Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1