Observational studies最新文献

英文中文

Comment on 'Statistical Modelling: the Two Cultures' by Leo Breiman 评Leo Breiman的《统计模型:两种文化》

Observational studies

Pub Date : 2021-07-27 DOI: 10.1353/obs.2021.0021

Efrén Cruz‐Cortés, Fan Yang, E. Juarez-colunga, Theodore Warsavage, D. Ghosh

Abstract:The discussion paper "Statistical Modeling: the Two Cultures" (Statistical Science, Vol 16, 2001) by the late Leo Breiman sent shockwaves throughout the statistical community and subsequently redirected the efforts of much of the field towards machine learning, high-dimensional analysis and data mining approaches. In this discussion, we discuss some of the implications of this work in the sphere of causal inference. In particular, we define the concept of comparability, which is fundamental to the ability to draw causal inferences and reinterpret some concepts in high-dimensional data analysis from this viewpoint. One of the points we highlight in this discussion is the need to consider data-adaptive estimands for causal effects with high-dimensional confounders. We also revisit matching and develop some mathematical formalism for matching algorithms.

摘要：已故Leo Breiman的讨论论文“统计建模：两种文化”（《统计科学》，2001年第16卷）在整个统计界掀起了轩然大波，随后将该领域的大部分工作转向了机器学习、高维分析和数据挖掘方法。在这次讨论中，我们讨论了这项工作在因果推理领域的一些含义。特别是，我们定义了可比性的概念，这是从这个角度得出因果推断和重新解释高维数据分析中一些概念的能力的基础。我们在本次讨论中强调的一点是，需要考虑高维混杂因素的因果效应的数据自适应估计。我们还重新审视了匹配，并为匹配算法开发了一些数学形式。

引用次数: 0

Comments on Breiman: Statistical Modelling: The Two Cultures and Commentaries 布雷曼评论:统计模型:两种文化和评论

Observational studies

Pub Date : 2021-07-27 DOI: 10.1353/obs.2021.0018

P. Bickel

Abstract:In a challenging paper 20 years ago, Leo Breiman challenged the statistical culture of his time. Some perceptive comments by David Cox and Brad Efron appeared with it. In this paper I try to look at this work in the light of modern culture and find much to agree but also much to disagree with. It's still a pleasure to read.

摘要：在20年前的一篇富有挑战性的论文中，Leo Breiman挑战了他那个时代的统计文化。在这篇文章中，我试图从现代文化的角度来看待这部作品，发现有很多值得同意的地方，也有很多不同意的地方。阅读仍然是一种乐趣。

引用次数: 1

Nonparametric Bayes: A Bridge Between Cultures 非参数贝叶斯：文化之间的桥梁

Observational studies

Pub Date : 2021-07-27 DOI: 10.1353/obs.2021.0005

Arman Oganisian, J. Roy

Abstract:In this commentary, we assess the cultural fit of Bayesian nonparametrics in light of advances in the field since Breiman's 2001 article. We argue that Bayesian nonparametrics synthesizes desirable elements of the data modeling and algorithmic cultures to yield new insights and methodological improvements. We discuss how these methods have been combined with identification strategies from the causal inference literature to do flexible inference for interpretable target parameters.

摘要：在这篇评论中，我们根据Breiman 2001年文章以来该领域的进展，评估了贝叶斯非框架的文化契合度。我们认为，贝叶斯非框架综合了数据建模和算法文化的理想元素，以产生新的见解和方法改进。我们讨论了这些方法如何与因果推理文献中的识别策略相结合，对可解释的目标参数进行灵活的推理。

引用次数: 0

Leo Breiman's Challenge: A Retrospective Leo Breiman的挑战:回顾

Observational studies

Pub Date : 2021-07-27 DOI: 10.1353/obs.2021.0017

D. Banks

Abstract:Breiman led the way in thinking differently about statistics. Many of his iconoclastic ideas have become standard in the data science sphere. This discussion argues for some rebalancing, while gratefully acknowledging his achievements.

摘要:布雷曼引领了统计学的新思路。他的许多打破传统的想法已经成为数据科学领域的标准。本文主张进行某种程度的再平衡，同时对他的成就表示感谢。

引用次数: 0

Causally Colored Reflections on Leo Breiman's "Statistical Modeling: The Two Cultures" (2001) 对Leo Breiman“统计建模：两种文化”（2001）的因果有色思考

Observational studies

Pub Date : 2021-07-27 DOI: 10.1353/obs.2021.0008

J. Pearl

Abstract:This note provides a re-assessment of Breiman's contributions to the art of statistical modeling, in light of recent advances in machine learning and causal inference. It highlights the crisp separation between the data-fitting and data-interpretation components of statistical modeling.

摘要：根据机器学习和因果推理的最新进展，本说明重新评估了Breiman对统计建模艺术的贡献。它强调了统计建模的数据拟合和数据解释组件之间的清晰分离。

引用次数: 1

Reasoning Using Data: Two Old Ways and One New 数据推理:两种旧方法和一种新方法

Observational studies

Pub Date : 2021-07-27 DOI: 10.1353/obs.2021.0016

M. Baiocchi, J. Rodu

Abstract:Instead of two cultures, the story of the last couple decades of data science is about the interplay between three different types of reasoning using data. Two of these types of reasoning were well known when Breiman wrote his Two Cultures paper – warranted reasoning (e.g., randomized trials and sampling) and model reasoning (e.g., linear models). Breiman, though he does not appear to have realized it fully, was in fact describing the dynamics arising in a data community that was making progress using the newest, third type of reasoning – outcome reasoning. In this commentary we clarify this dynamic a bit, and suggest some useful language for identifying and differentiating types of problems better suited for outcome reasoning.

摘要：过去几十年的数据科学故事讲述的不是两种文化，而是三种不同类型的使用数据的推理之间的相互作用。当Breiman写他的《两种文化》论文时，其中两种类型的推理是众所周知的——保证推理（如随机试验和抽样）和模型推理（如线性模型）。尽管Breiman似乎没有完全意识到这一点，但事实上，他描述的是数据社区中出现的动态，该社区正在使用最新的第三种推理——结果推理——取得进展。在这篇评论中，我们稍微澄清了这一动态，并提出了一些有用的语言来识别和区分更适合结果推理的问题类型。

引用次数: 2

Statistical Modelling in the Age of Data Science 数据科学时代的统计建模

Observational studies

Pub Date : 2021-07-27 DOI: 10.1353/obs.2021.0013

S. Vansteelandt

Abstract:Twenty years after Leo Breiman's wake-up call on the use of data models, I reconsider his concerns, which were heavily influenced by problems in prediction and classification, in light of the much vaster class of problems of estimating effects and (conditional) associations. Viewed from this perspective, one realises that the statistical community's commitment to the use of data models continues to be dominant and problematic, but that algorithmic modelling (machine learning) does not readily provide a satisfactory alternative, by virtue of being almost exclusively focused on prediction and classification. The only successful way forward is to bridge the two cultures. It requires machine learning skills from the algorithmic modelling culture in order to reduce model misspecification bias and to enable pre-specification of the statistical analysis. It moreover requires data modelling skills in order to choose and construct interpretable effect and association measures that target the scientific question; in order to identify those measures from observed data under the considered sampling design by relating to minimal and well-understood assumptions; and finally, in order to reduce regularisation bias and quantify uncertainty in the obtained estimates by relating to asymptotic theory.

摘要：在Leo Breiman对数据模型的使用敲响警钟20年后，我重新考虑了他的担忧，这些担忧在很大程度上受到了预测和分类问题的影响，因为估计效应和（条件）关联的问题要多得多。从这个角度来看，人们意识到统计界对使用数据模型的承诺仍然占主导地位，而且存在问题，但算法建模（机器学习）由于几乎完全专注于预测和分类，并不容易提供令人满意的替代方案。唯一成功的前进道路是把两种文化联系起来。它需要来自算法建模文化的机器学习技能，以减少模型错误指定偏差，并实现统计分析的预先指定。此外，它还需要数据建模技能，以便选择和构建针对科学问题的可解释效果和关联度量；以便通过与最小和充分理解的假设相关，从所考虑的抽样设计下的观测数据中确定这些措施；最后，为了减少正则化偏差，并通过与渐近理论相关来量化所获得估计中的不确定性。

{"title":"Statistical Modelling in the Age of Data Science","authors":"S. Vansteelandt","doi":"10.1353/obs.2021.0013","DOIUrl":"https://doi.org/10.1353/obs.2021.0013","url":null,"abstract":"Abstract:Twenty years after Leo Breiman's wake-up call on the use of data models, I reconsider his concerns, which were heavily influenced by problems in prediction and classification, in light of the much vaster class of problems of estimating effects and (conditional) associations. Viewed from this perspective, one realises that the statistical community's commitment to the use of data models continues to be dominant and problematic, but that algorithmic modelling (machine learning) does not readily provide a satisfactory alternative, by virtue of being almost exclusively focused on prediction and classification. The only successful way forward is to bridge the two cultures. It requires machine learning skills from the algorithmic modelling culture in order to reduce model misspecification bias and to enable pre-specification of the statistical analysis. It moreover requires data modelling skills in order to choose and construct interpretable effect and association measures that target the scientific question; in order to identify those measures from observed data under the considered sampling design by relating to minimal and well-understood assumptions; and finally, in order to reduce regularisation bias and quantify uncertainty in the obtained estimates by relating to asymptotic theory.","PeriodicalId":74335,"journal":{"name":"Observational studies","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44767646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Modern Data Modeling: Cross-Fertilization of the Two Cultures 现代数据建模:两种文化的交叉受精

Observational studies

Pub Date : 2021-07-27 DOI: 10.1353/obs.2021.0023

Jianqing Fan, Cong Ma, Kaizheng Wang, Ziwei Zhu

Abstract:The past two decades have witnessed deep cross-fertilization between the two cultures—statistics (data/generative modeling) and machine learning (algorithmic modeling), which is in stark contrast to the scene pictured in Breiman's inspiring work. In light of this major confluence, we find it helpful to single out a few salient examples showcasing the impacts of one to the other, and the research progress out of them. We point out in the end that the current big data era especially requires joint efforts from both cultures in order to address some common challenges including decentralized data analysis, privacy, fairness, etc.

摘要：在过去的二十年里，统计学（数据/生成建模）和机器学习（算法建模）这两种文化之间发生了深刻的交叉融合，这与布莱曼鼓舞人心的作品中的场景形成了鲜明对比。鉴于这一主要汇合点，我们发现挑出几个突出的例子来展示一个对另一个的影响以及其中的研究进展是很有帮助的。我们最后指出，当前的大数据时代尤其需要两种文化的共同努力，以应对一些共同的挑战，包括去中心化的数据分析、隐私、公平等。

引用次数: 3

The Two Cultures: Statistics and Machine Learning in Science 两种文化：科学中的统计学和机器学习

Observational studies

Pub Date : 2021-07-27 DOI: 10.1353/obs.2021.0000

R. Kass

Abstract:In his 2001 Statistical Science paper, Leo Breiman called attention to "two cultures" of data analysts, the first associated with computer science and the second with statistics. Breiman saw flaws in the traditionally-oriented statistical culture and advocated the predictively-oriented approach he identified with computer science. Although many of his observations were accurate and useful, Breiman failed to acknowledge the merits of statistical modeling, and he mischaracterized the role of statistics in science. To explain, I discuss machine learning and artificial intelligence; excessive cautiousness in statistics; dangers of statistical modeling; potential accomplishments of statistical modeling; the statistical paradigm; the nature of statistical models; and statistical methods that work well in practice. Everyone who is interested in the use of computer science and statistics in data analysis should grapple with the issues raised by Breiman's article.

摘要：Leo Breiman在其2001年的《统计科学》论文中呼吁关注数据分析师的“两种文化”，第一种与计算机科学有关，第二种与统计学有关。Breiman看到了传统的统计文化中的缺陷，并提倡他在计算机科学中认同的预测导向方法。尽管他的许多观察都是准确和有用的，但Breiman没有承认统计建模的优点，他错误地描述了统计在科学中的作用。为了解释，我讨论了机器学习和人工智能；统计工作过于谨慎；统计建模的危险性；统计建模的潜在成就；统计范式；统计模型的性质；以及在实践中行之有效的统计方法。每个对计算机科学和统计学在数据分析中的应用感兴趣的人都应该努力解决Breiman文章中提出的问题。

引用次数: 2

Comment on Breiman's "Two Cultures" (2002): From Two Cultures to Multicultural 评布莱曼的“两种文化”（2002）：从两种文化到多元文化

Observational studies

Pub Date : 2021-07-27 DOI: 10.1353/obs.2021.0010

G. Shmueli

Abstract:Since Breiman's "Two Cultures" paper's appearance in 2002, the term prediction has gained incredible significance in research, practice, society, and humanity. "Two Cultures" led to many useful advancements and surprising discoveries. Experiencing first hand the different cultures in the statistics and machine learning communities that Brieman expressed so early and clearly, I've then encountered even more differences. I describe additional modeling distinctions and further modeling "cultures". Recognizing these cultures, understanding their reasoning, and comparing and contrasting them, opens our eyes to new ways of viewing the world and creates opportunities for innovation and collaboration.

摘要:自2002年Breiman的“两种文化”论文问世以来，预测这一术语在研究、实践、社会和人类中都获得了不可思议的意义。“两种文化”带来了许多有用的进步和惊人的发现。亲身体验了统计学和机器学习社区的不同文化，Brieman很早就清楚地表达了这一点，然后我遇到了更多的差异。我描述了额外的建模区别和进一步建模“文化”。认识到这些文化，理解它们的原因，并对它们进行比较和对比，使我们看到了看待世界的新方式，并为创新和合作创造了机会。

引用次数: 2

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Observational studies

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀