Alessandro Rovetta , Mohammad Ali Mansournia , Alessandro Vitale
{"title":"For a proper use of frequentist inferential statistics in public health","authors":"Alessandro Rovetta , Mohammad Ali Mansournia , Alessandro Vitale","doi":"10.1016/j.gloepi.2024.100151","DOIUrl":null,"url":null,"abstract":"<div><p>As widely noted in the literature and by international bodies such as the American Statistical Association, severe misinterpretations of <em>P</em>-values, confidence intervals, and statistical significance are sadly common in public health. This scenario poses serious risks concerning terminal decisions such as the approval or rejection of therapies. Cognitive distortions about statistics likely stem from poor teaching in schools and universities, overly simplified interpretations, and – as we suggest – the reckless use of calculation software with predefined standardized procedures. In light of this, we present a framework to recalibrate the role of frequentist-inferential statistics within clinical and epidemiological research. In particular, we stress that statistics is only a set of rules and numbers that make sense only when properly placed within a well-defined scientific context beforehand. Practical examples are discussed for educational purposes. Alongside this, we propose some tools to better evaluate statistical outcomes, such as multiple compatibility or surprisal intervals or tuples of various point hypotheses. Lastly, we emphasize that every conclusion must be informed by different kinds of scientific evidence (e.g., biochemical, clinical, statistical, etc.) and must be based on a careful examination of costs, risks, and benefits.</p></div>","PeriodicalId":36311,"journal":{"name":"Global Epidemiology","volume":"8 ","pages":"Article 100151"},"PeriodicalIF":0.0000,"publicationDate":"2024-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590113324000178/pdfft?md5=4fe0b244a01cff2d2b827052b4f73448&pid=1-s2.0-S2590113324000178-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Global Epidemiology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590113324000178","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
As widely noted in the literature and by international bodies such as the American Statistical Association, severe misinterpretations of P-values, confidence intervals, and statistical significance are sadly common in public health. This scenario poses serious risks concerning terminal decisions such as the approval or rejection of therapies. Cognitive distortions about statistics likely stem from poor teaching in schools and universities, overly simplified interpretations, and – as we suggest – the reckless use of calculation software with predefined standardized procedures. In light of this, we present a framework to recalibrate the role of frequentist-inferential statistics within clinical and epidemiological research. In particular, we stress that statistics is only a set of rules and numbers that make sense only when properly placed within a well-defined scientific context beforehand. Practical examples are discussed for educational purposes. Alongside this, we propose some tools to better evaluate statistical outcomes, such as multiple compatibility or surprisal intervals or tuples of various point hypotheses. Lastly, we emphasize that every conclusion must be informed by different kinds of scientific evidence (e.g., biochemical, clinical, statistical, etc.) and must be based on a careful examination of costs, risks, and benefits.
正如文献和美国统计协会等国际机构广泛指出的那样,对 P 值、置信区间和统计意义的严重误读在公共卫生领域十分常见,令人痛心。这种情况会对诸如批准或拒绝疗法等终极决策带来严重风险。对统计学认知的扭曲可能源于学校和大学的不良教学、过度简化的解释,以及--正如我们所建议的--不计后果地使用带有预定义标准化程序的计算软件。有鉴于此,我们提出了一个框架,以重新调整频数-推断统计在临床和流行病学研究中的作用。我们特别强调,统计学只是一套规则和数字,只有事先将其正确置于定义明确的科学环境中才有意义。为了达到教育目的,我们讨论了一些实际案例。与此同时,我们还提出了一些更好地评估统计结果的工具,如多重相容性或意外间隔或各种点假设的元组。最后,我们强调,每一个结论都必须以不同种类的科学证据(如生化、临床、统计等)为依据,并且必须基于对成本、风险和收益的仔细研究。