{"title":"A Dirichlet-Multinomial mixture model of Statistical Science: Mapping the shift of a paradigm","authors":"Massimo Bilancia , Rade Dačević","doi":"10.1016/j.joi.2024.101633","DOIUrl":null,"url":null,"abstract":"<div><div>Using Bayesian natural language processing (NLP) methods and a scalable variational algorithm tailored for mixtures of discrete positive data, we analyzed a large corpus of 111,411 eprints submitted to the arXiv repository between 1994 and 2022 in the Statistics category (the primary classification for these eprints on arXiv). Our objective is to assess the impact of Machine Learning (ML) on the field of Statistics–specifically, to determine whether the introduction of ML has led to a fundamental paradigm shift, transforming traditional statistical problems or creating entirely new ones, or if this perceived revolution is primarily occurring outside the field of Statistics. Our findings suggest that the only significant paradigm shift for Statistics as a scientific discipline remains the Bayesian revolution that began in the early 1990s.</div></div>","PeriodicalId":48662,"journal":{"name":"Journal of Informetrics","volume":"19 1","pages":"Article 101633"},"PeriodicalIF":3.4000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Informetrics","FirstCategoryId":"91","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1751157724001457","RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Using Bayesian natural language processing (NLP) methods and a scalable variational algorithm tailored for mixtures of discrete positive data, we analyzed a large corpus of 111,411 eprints submitted to the arXiv repository between 1994 and 2022 in the Statistics category (the primary classification for these eprints on arXiv). Our objective is to assess the impact of Machine Learning (ML) on the field of Statistics–specifically, to determine whether the introduction of ML has led to a fundamental paradigm shift, transforming traditional statistical problems or creating entirely new ones, or if this perceived revolution is primarily occurring outside the field of Statistics. Our findings suggest that the only significant paradigm shift for Statistics as a scientific discipline remains the Bayesian revolution that began in the early 1990s.
期刊介绍:
Journal of Informetrics (JOI) publishes rigorous high-quality research on quantitative aspects of information science. The main focus of the journal is on topics in bibliometrics, scientometrics, webometrics, patentometrics, altmetrics and research evaluation. Contributions studying informetric problems using methods from other quantitative fields, such as mathematics, statistics, computer science, economics and econometrics, and network science, are especially encouraged. JOI publishes both theoretical and empirical work. In general, case studies, for instance a bibliometric analysis focusing on a specific research field or a specific country, are not considered suitable for publication in JOI, unless they contain innovative methodological elements.