Rejoinder: “Co-citation and Co-authorship Networks of Statisticians”

IF 2.5 2区数学 Q1 ECONOMICS Journal of Business & Economic Statistics Pub Date : 2022-04-03 DOI:10.1080/07350015.2022.2055358

Pengsheng Ji, Jiashun Jin, Z. Ke, Wanshan Li

{"title":"Rejoinder: “Co-citation and Co-authorship Networks of Statisticians”","authors":"Pengsheng Ji, Jiashun Jin, Z. Ke, Wanshan Li","doi":"10.1080/07350015.2022.2055358","DOIUrl":null,"url":null,"abstract":"We thank David Donoho for very encouraging comments. As always, his penetrating vision and deep thoughts are extremely stimulating. We are glad that he summarizes a major philosophical difference between statistics in earlier years (e.g., the time of Francis Galton) and statistics in our time by just a few words: data-first versus model-first. We completely agree with his comment that “each effort by a statistics researcher to understand a newly available type of data enlarges our field; it should be a primary part of the career of statisticians to cultivate an interest in cultivating new types of datasets, so that new methodology can be discovered and developed”; these are exactly the motivations underlying our (several-year) efforts in collecting, cleaning, and analyzing a large-scale high-quality dataset. We would like to add that both traditions have strengths, and combining the strengths of two sides may greatly help statisticians deal with the so-called crisis of the 21st century in statistics we face today. Let us explain the crisis above first. In the model-first tradition, with a particular application problem in mind, we propose a model, develop a method and justify its optimality by some hard-to-prove theorems, and find a dataset to support the approach. In this tradition, we put a lot of faith on our model and our theory: we hope the model is adequate, and we hope our optimality theory warrants the superiority of our method over others. Modern machine learning literature (especially the recent development of deep learning) provides a different approach to justifying the “superiority” of an approach; we compare the proposed approach with existing approaches by the real data results over a dozen of benchmark datasets. To choose an algorithm for their dataset, a practitioner does not necessarily need warranties from a theorem; a superior performance over many benchmark datasets says it all. To some theoretical statisticians, this is rather disappointing, as they come from a long","PeriodicalId":50247,"journal":{"name":"Journal of Business & Economic Statistics","volume":"40 1","pages":"499 - 504"},"PeriodicalIF":2.5000,"publicationDate":"2022-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Business & Economic Statistics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1080/07350015.2022.2055358","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}

引用次数: 1

Abstract

We thank David Donoho for very encouraging comments. As always, his penetrating vision and deep thoughts are extremely stimulating. We are glad that he summarizes a major philosophical difference between statistics in earlier years (e.g., the time of Francis Galton) and statistics in our time by just a few words: data-first versus model-first. We completely agree with his comment that “each effort by a statistics researcher to understand a newly available type of data enlarges our field; it should be a primary part of the career of statisticians to cultivate an interest in cultivating new types of datasets, so that new methodology can be discovered and developed”; these are exactly the motivations underlying our (several-year) efforts in collecting, cleaning, and analyzing a large-scale high-quality dataset. We would like to add that both traditions have strengths, and combining the strengths of two sides may greatly help statisticians deal with the so-called crisis of the 21st century in statistics we face today. Let us explain the crisis above first. In the model-first tradition, with a particular application problem in mind, we propose a model, develop a method and justify its optimality by some hard-to-prove theorems, and find a dataset to support the approach. In this tradition, we put a lot of faith on our model and our theory: we hope the model is adequate, and we hope our optimality theory warrants the superiority of our method over others. Modern machine learning literature (especially the recent development of deep learning) provides a different approach to justifying the “superiority” of an approach; we compare the proposed approach with existing approaches by the real data results over a dozen of benchmark datasets. To choose an algorithm for their dataset, a practitioner does not necessarily need warranties from a theorem; a superior performance over many benchmark datasets says it all. To some theoretical statisticians, this is rather disappointing, as they come from a long

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

复辩状：“统计学家的共同引用和合作网络”

我们感谢大卫·多诺霍非常鼓舞人心的评论。一如既往，他锐利的眼光和深邃的思想极具启发性。我们很高兴他总结了早期统计(例如，弗朗西斯·高尔顿的时代)和我们这个时代的统计之间的主要哲学差异，只有几个字:数据优先与模型优先。我们完全同意他的评论:“统计研究人员为理解一种新的可用数据类型所做的每一次努力都扩大了我们的研究领域;培养培养新型数据集的兴趣应该成为统计学家职业生涯的一个主要部分，这样才能发现和发展新的方法”;这些正是我们(数年)努力收集、清理和分析大规模高质量数据集的动机。我们想补充的是，这两种传统都有各自的优势，将双方的优势结合起来，可能会极大地帮助统计学家应对我们今天面临的所谓21世纪统计危机。让我们先解释一下上述危机。在模型优先的传统中，考虑到特定的应用问题，我们提出了一个模型，开发了一种方法，并通过一些难以证明的定理来证明其最优性，并找到一个数据集来支持该方法。在这个传统中，我们对我们的模型和理论有很大的信心:我们希望模型是足够的，我们希望我们的最优性理论保证我们的方法优于其他方法。现代机器学习文献(尤其是深度学习的最新发展)提供了一种不同的方法来证明一种方法的“优越性”;我们通过十几个基准数据集的真实数据结果将所提出的方法与现有方法进行了比较。为了为他们的数据集选择一种算法，从业者不一定需要定理的保证;优于许多基准数据集的优越性能说明了一切。对于一些理论统计学家来说，这是相当令人失望的，因为他们来自一个漫长的

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Business & Economic Statistics 数学-统计学与概率论

CiteScore

5.00

自引率

6.70%

发文量

审稿时长

>12 weeks

期刊介绍： The Journal of Business and Economic Statistics (JBES) publishes a range of articles, primarily applied statistical analyses of microeconomic, macroeconomic, forecasting, business, and finance related topics. More general papers in statistics, econometrics, computation, simulation, or graphics are also appropriate if they are immediately applicable to the journal''s general topics of interest. Articles published in JBES contain significant results, high-quality methodological content, excellent exposition, and usually include a substantive empirical application.

期刊最新文献

A Heteroskedasticity-Robust Overidentifying Restriction Test with High-Dimensional Covariates. A Ridge-Regularized Jackknifed Anderson-Rubin Test. Efficient and Robust Estimation of the Generalized LATE Model Modeling and Forecasting Macroeconomic Downside Risk* Causal inference under outcome-based sampling with monotonicity assumptions